Member of Technical Staff, AI - Pre-Training

1 Month ago • 4-10 Years • Artificial Intelligence • $117,200 PA - $294,000 PA

Job Summary

Job Description

This role involves developing algorithms, model architectures, and scaling laws for large-scale AI model training. Responsibilities include algorithmic implementation, conducting experiments, and overseeing training runs on a distributed system. Close collaboration with cross-functional teams is required. The ideal candidate will have expertise in deep learning, large-scale distributed systems, and a strong publication record. The team aims to deliver one of the world's best foundational AI models, impacting various Microsoft AI initiatives. The position requires proficiency in languages like C, C++, C#, Java, JavaScript, or Python and a passion for conversational AI and its deployment. The role demands strong analytical, communication, and collaborative skills.
Must have:
  • Expertise in deep learning and large-scale systems
  • Proficiency in C/C++/Java/Python etc.
  • Strong publication record and technical leadership
  • Experience with large-scale AI model training
  • Excellent communication and collaboration skills
Good to have:
  • Passion for conversational AI
  • Experience with cloud computing platforms
  • Familiarity with multimodal AI models
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Help deliver one of the best foundational models in the world at Microsoft AI. 

At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. 

 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are looking for candidates who: 

  • Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects 
  • Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making 
  • Have experience and/or in-depth understandings about large-scale distributed systems 
  • Demonstrate an ability to work collaboratively in a fast-paced, innovative environment

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

 

Qualifications

Required Qualifications 

  • Bachelor's Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python 
    • OR equivalent experience. 

Preferred Qualifications 

  • Bachelor's Degree in Computer Science or related technical field AND 10+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python 
    • OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python 
    • OR equivalent experience. 
  • Demonstrated experience in large-scale AI. 
  • Passionate about conversational AI and its deployment. 
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.   
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.  
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.  

 

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.

 

Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $137,600 - $267,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $180,400 - $294,000 per year.


Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:

Microsoft will accept applications and processes offers for these roles on an ongoing basis.

 

Responsibilities

  • Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations  
  • Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack  
  • Collaborate closely with teams on infrastructure, data, post-training, and multimodality 
  • Embody our and . 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

Salesforce - Performance Engineering - MTS/SMTS/LMTS

Salesforce

Hyderabad, Telangana, India (On-Site)
3 Months ago
Aristocrat Gaming - Team Lead

Aristocrat Gaming

Noida, Uttar Pradesh, India (Hybrid)
3 Months ago
Balbix - Senior Software Engineer - Lakehouse

Balbix

Bengaluru, Karnataka, India (On-Site)
3 Months ago
ION - Technical Consultant - Endur

ION

Uniondale, New York, United States (On-Site)
4 Months ago
Luxoft - Java Technical Lead - Microservices

Luxoft

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Unity - Principal Applied Research Machine Learning Engineer

Unity

London, England, United Kingdom (On-Site)
4 Months ago
ByteDance - Research Scientist - Multimodal Foundation Model - 2025 Start

ByteDance

Singapore (On-Site)
3 Months ago
ByteDance - Research Engineer (Foundation Model) - Machine Learning Systems

ByteDance

Singapore (On-Site)
3 Months ago
Google - Staff Software Engineer, Core Machine Learning, Google Cloud

Google

Kirkland, Washington, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Hitachi - Java Developers

Hitachi

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Tesla - Controls Engineer Paint (m/w/d) - Gigafactory Berlin - Brandenburg

Tesla

Grünheide (Mark), Brandenburg, Germany (On-Site)
1 Month ago
eBay - Manager, Software Development

eBay

Toronto, Ontario, Canada (Hybrid)
4 Months ago
Paytm - Android - Senior Software Engineer

Paytm

Noida, Uttar Pradesh, India (On-Site)
3 Months ago
Luxoft - Senior Java Developer (for Trading Application)

Luxoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Remote)
2 Months ago
PwC - Data Engineer – Senior Associate - P&T Labs

PwC

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Razer - Senior API Developer

Razer

Singapore (On-Site)
4 Months ago
PwC - Python Developer (freelance)

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Activision - Associate Cinematic Director

Activision

Santa Monica, California, United States (On-Site)
2 Months ago
Meta - Software Engineer (Leadership) - Machine Learning

Meta

Burlingame, California, United States (Remote)
3 Months ago
Allied Machine - Field Sales Engineer - Eastern PA, NJ, NYC, Long Island, Delaware

Allied Machine

Philadelphia, Pennsylvania, United States (Remote)
3 Months ago
Crunchyroll - Staff Site Reliability Engineer - Data Engineering, Platform

Crunchyroll

San Francisco, California, United States (Remote)
2 Months ago
Corsair - Director, Web Experience

Corsair

Milpitas, California, United States (On-Site)
1 Month ago
ByteDance - Research Scientist Graduates, Large Language Model (Doubao) - 2025 Start

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Blizzard Entertainment - Senior Materials Artist - Unannounced Game

Blizzard Entertainment

Irvine, California, United States (On-Site)
4 Months ago
Nintendo - CONTRACT - Product Specialist (Portuguese)

Nintendo

Redmond, Washington, United States (Hybrid)
2 Months ago
Lightbox animation-studios - Mid CFX Artist

Lightbox animation-studios

Dallas, Texas, United States (Remote)
6 Months ago
SEGA US - Lifestyle & Partnerships Manager

SEGA US

Burbank, California, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Paypal - Sr. Manager, AI Tech Product Manager

Paypal

San Jose, California, United States (On-Site)
4 Months ago
Genies - Lead NLP Research Scientist

Genies

San Mateo, California, United States (On-Site)
7 Months ago
Microsoft - Gen AI Principal Applied Scientist

Microsoft

Mountain View, California, United States (On-Site)
1 Month ago
Level AI - Principal Software Engineer

Level AI

Noida, Uttar Pradesh, India (Hybrid)
4 Months ago
The Walt Disney Company - Principal Machine Learning Engineer, Research - Ad Platforms

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Months ago
Kokotree - Artificial Intelligence Developers

Kokotree

Wilmington, North Carolina, United States (On-Site)
3 Months ago
NeST Digital - 1730 - Data Scientist

NeST Digital

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Zoox - Senior/Staff Motion Planning Engineer, Teleguidance

Zoox

Foster City, California, United States (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Redmond, Washington, United States (On-Site)

Mountain View, California, United States (On-Site)

London, England, United Kingdom (Hybrid)

London, England, United Kingdom (On-Site)

Jakarta, Jakarta, Indonesia (On-Site)

Prague, Prague, Czechia (On-Site)

Montreal, Quebec, Canada (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug