Member of Technical Staff, AI Data

1 Month ago • All levels • Data Analyst

Job Summary

Job Description

Microsoft AI seeks a Member of Technical Staff to contribute to building the world's most advanced multimodal dataset. Responsibilities include designing and developing data pipelines for massive multi-modal data (text, audio, images, video); building and maintaining infrastructure for petabytes of data; partnering with pre-training and post-training teams to refine data; and collaborating with product teams and researchers. The role requires expertise in data engineering, pipeline development, and large-scale data processing. The successful candidate will be passionate about data's role in AI model training, thrive in a collaborative environment, and possess a strong attention to detail. They will work closely with teams developing the Copilot experience.
Must have:
  • Design & develop data pipelines
  • Build & maintain data infrastructure
  • Improve data recipes via experimentation
  • Collaborate with product & research teams
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Help build the world’s most advanced multimodal dataset at Microsoft AI.

We are on a mission to create the largest and most advanced multimodal dataset in the world. This dataset, spanning all modalities from across the web and beyond, will power the training of the world’s most capable AI frontier models, pushing the boundaries of scale, performance, and product deployment.  

The AI Data team at Microsoft AI is responsible for all aspects of data preparation to support our model pre-training operations, including collecting data from the source, extracting and transforming the most useful data, and understanding the impact of changes to data by training and evaluating new models. We are an interdisciplinary team of engineers and scientists, learning from each other, and collaborating to create the best models and products. We work closely with the teams that transform pre-trained models into the models that power the consumer Copilot experience 

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. In particular, we are looking for candidates who: 

  • Are passionate about the role of data in large-scale AI model training 
  • Will thrive in a highly collaborative, fast-paced environment 
  • Have a high degree of craftsmanship and pay close attention to details 
  • Demonstrate a proactive attitude and enthusiasm for exploring new methods and technologies 
  • Effectively manage multiple responsibilities and can adjust to shifting priorities.  

Qualifications

Required/Minimum Qualifications

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling or data engineering work 
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work 
  • OR equivalent experience. 

 

 

 

 

#Copilot #MicrosoftAI

Responsibilities

  • Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video). 
  • Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models. 
  • Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation. 
  • Collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models. 
  • Embody our and . 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

Scopely - Data Science Manager, Marketing Analytics

Scopely

Barcelona, Catalonia, Spain (Hybrid)
2 Days ago
Coursera - Senior Product Designer, Core Foundation

Coursera

India (Remote)
2 Weeks ago
The Walt Disney Company - Principal Machine Learning Engineer, Research - Ad Platforms

The Walt Disney Company

Santa Monica, California, United States (On-Site)
2 Months ago
Microsoft - Research Intern - M365 Copilot - LLM Reasoning

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Notion - Software Engineer, Data Platform

Notion

San Francisco, California, United States (On-Site)
4 Months ago
Sinch - Manager, Data Engineering

Sinch

Malmö, Skåne County, Sweden (On-Site)
3 Months ago
Wildlife Studios - Data Engineer

Wildlife Studios

São Paulo, State Of São Paulo, Brazil (On-Site)
4 Weeks ago
Zendesk - Senior Data Scientist

Zendesk

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Epic Games - Senior Data Scientist - Product Analytics

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Month ago
Inkittt - Senior Data Engineer (m/f/d)

Inkittt

Krakow Am See, Mecklenburg-Vorpommern, Germany (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ARHS - NIFI Data Engineer (NDE)

ARHS

Warsaw, Masovian Voivodeship, Poland (Remote)
3 Months ago
Microsoft - Product Management IC4

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Clipwire Games - Senior Game Designer, Clipwire Games

Clipwire Games

Toronto, Ontario, Canada (On-Site)
6 Months ago
DraftKings - Senior Data Science Engineer

DraftKings

London, England, United Kingdom (On-Site)
3 Days ago
Hitachi - Datawarehouse Data Architect - Data & Analytics team (remote / Costa Rica- or LATAM-based)

Hitachi

San José Province, Costa Rica (Remote)
3 Months ago
Inkittt - Senior Machine Learning Engineer, Recommendations

Inkittt

San Francisco, California, United States (Hybrid)
2 Weeks ago
Applike Group - Senior Go Backend Developer (Anti-Fraud) (f/m/d)

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
3 Months ago
Twitch - Data Scientist

Twitch

San Francisco, California, United States (Remote)
5 Months ago
Microsoft - Research Intern - IMAIS - Adaptive Closed-Loop Interaction

Microsoft

Cambridge, Massachusetts, United States (On-Site)
1 Month ago
Netflix - Principal Researcher, Talent Analytics

Netflix

Los Angeles, California, United States (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

version 1 - Senior Outsystems Developer

version 1

Belfast, Northern Ireland, United Kingdom (On-Site)
1 Month ago
ESL FACEIT Group - EFG - Senior Software Engineer - Backend (Go)

ESL FACEIT Group - EFG

London, England, United Kingdom (Remote)
2 Months ago
LeoVegas - Customer Experience Advisor - Danish Market

LeoVegas

Newcastle Upon Tyne, England, United Kingdom (On-Site)
3 Months ago
Netflix - Counsel, CMOs & Collective Music Licensing

Netflix

London, England, United Kingdom (On-Site)
3 Weeks ago
WebMD - Digital Project Manager (m/w/d)

WebMD

United Kingdom (On-Site)
3 Months ago
Critical mass - Senior Project Manager

Critical mass

London, England, United Kingdom (On-Site)
3 Months ago
Cirrus Logic - Systems Engineer / Product Definer

Cirrus Logic

Edinburgh, Scotland, United Kingdom (Hybrid)
3 Months ago
PTW - Project Coordinator | Video Games

PTW

United Kingdom (Remote)
1 Week ago
Hudl - Engineering Manager

Hudl

London, England, United Kingdom (Hybrid)
2 Months ago
Maverick Games - Online Engineer

Maverick Games

Warwick, England, United Kingdom (On-Site)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

DraftKings - Lead Data Science Engineer

DraftKings

London, England, United Kingdom (On-Site)
3 Months ago
Dream Sports - LiveOps Manager

Dream Sports

Pune, Maharashtra, India (On-Site)
5 Months ago
Rockstar Games - Senior Data Scientist, GTA+ Subscriptions

Rockstar Games

New York, New York, United States (On-Site)
5 Months ago
Microsoft - Principal Data Scientist

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
VGW - AML Analyst

VGW

Valletta, Malta (On-Site)
2 Weeks ago
PlayStation Global - Product Manager, Data Foundations

PlayStation Global

San Francisco, California, United States (On-Site)
1 Week ago
The Walt Disney Company - Principal Data Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
1 Week ago
Activision - Stagiaire - Coordonateur.trice de produit

Activision

San Francisco, California, United States (On-Site)
3 Months ago
N-iX - Data Engineer (Databricks) (#2168)

N-iX

Romania (Remote)
3 Months ago
PhonePe - Business Intelligence, Associate Manager

PhonePe

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

London, England, United Kingdom (On-Site)

London, England, United Kingdom (Hybrid)

London, England, United Kingdom (On-Site)

Jakarta, Jakarta, Indonesia (On-Site)

Gurugram, Haryana, India (On-Site)

Prague, Prague, Czechia (On-Site)

Montreal, Quebec, Canada (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug