Member of Technical Staff, AI - Reinforcement Learning (RL) Platform

1 Month ago • All levels • Research & Development • Undisclosed

About the job

Job Description

This role involves building and enhancing the world's most advanced reinforcement learning (RL) platform at Microsoft AI. Responsibilities include designing and developing the core infrastructure of the RL platform, focusing on systematizing and extending RL algorithms for LLMs across various environments. The position requires collaborating with cross-functional teams to deliver new agentic AI product capabilities, developing new algorithms, and onboarding team members to state-of-the-art techniques. The ideal candidate possesses strong coding, software engineering, and API design skills, a background in machine learning and scientific computing, and thrives in a collaborative environment. They must excel at managing multiple responsibilities and adapting to changing priorities.
Must have:
  • Design and develop RL platform infrastructure
  • Extend RL algorithms for LLMs
  • Collaborate with cross-functional teams
  • Develop new algorithms
  • Strong coding, software engineering, and API design skills
  • Machine learning and scientific computing background
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Overview

Help build the world’s most advanced reinforcement learning platform at Microsoft AI. 

We're on a mission to create trustworthy agents capable of autonomous action and decision-making on behalf of our users. As part of our team, you’ll help advance state-of-the-art algorithms for model alignment and develop tools to extend model capabilities to numerous product domains within Microsoft. 

We are looking for candidates who are both scientists and software engineers. The ideal candidate will be able to build robust systems that help our team solve the next generation of AI problems. They would: 

  • Excel in coding, software engineering, and API design 
  • Have a background in machine learning and scientific computing 
  • Thrive in a highly collaborative, fast-paced environment 
  • Have a high degree of craftsmanship and pay close attention to details 
  • Effectively manage multiple responsibilities and can adjust to shifting priorities.   

Qualifications

Required/Minimum Qualifications  

  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling or data engineering work 
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work 
  • OR equivalent experience. 

 

 

 

 

#Copilot #MicrosoftAI

Responsibilities

  • Design and develop the core infrastructure of the RL Platform, focusing on systematizing and extending RL algorithms for LLMs to a variety of present and future environments. 
  • Assist in development of new algorithms and help onboard other team members to state-of-the-art techniques. 
  • Collaborate with cross-functional teams to ship new agentic AI product capabilities. 
  • Embody our of collaboration, innovation, and excellence. 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Gurugram, Haryana, India (On-Site)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Redmond, Washington, United States (On-Site)

Barcelona, Catalonia, Spain (On-Site)

Prague, Prague, Czechia (On-Site)

Montreal, Quebec, Canada (On-Site)

Dublin, County Dublin, Ireland (On-Site)

London, England, United Kingdom (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

The Walt Disney Company - Principal Software Engineer

The Walt Disney Company, United States (On-Site)

Paypal - Sr MTS Software Engineer

Paypal, India (On-Site)

Paypal - Machine Learning Engineer

Paypal, United States (Hybrid)

Microsoft - Design Verification Engineer 2

Microsoft, India (On-Site)

Power Integrations - Software Developer (Backend)

Power Integrations, Philippines (On-Site)

Meta - Software Engineer, Machine Learning

Meta, United States (Remote)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Playtech - Java Developer

Playtech, United Kingdom (On-Site)

Meta - Design Verification Engineer

Meta, United States (On-Site)

Centripetal - Cyber Data Scientist

Centripetal, United States (On-Site)

Fliff  Inc  - Software Engineer II

Fliff Inc , Bulgaria (On-Site)

Warner Bros Discovery - Customer Data Manager - Digital/ VOD

Warner Bros Discovery, Poland (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

version 1 - .Net Technical Lead

version 1, United Kingdom (On-Site)

Push Gaming - Infrastructure Engineer

Push Gaming, United Kingdom (Hybrid)

THE GAME - SENIOR PEOPLE & CULTURE MANAGER

THE GAME, United Kingdom (Hybrid)

Alphasense - Product Specialist

Alphasense, United Kingdom (On-Site)

Assystems - Cost Engineer

Assystems, United Kingdom (Hybrid)

Bazaar Voice - Senior Software Engineer (Backend)

Bazaar Voice, United Kingdom (Hybrid)

DraftKings - Associate Director, Product, Global Sports

DraftKings, United Kingdom (On-Site)

Hyper Luminal Games  - Console Programmer

Hyper Luminal Games , United Kingdom (On-Site)

Climax Studios - Senior Games Designer

Climax Studios, United Kingdom (On-Site)

Orange Tree Theatre - Casual Audio Describers

Orange Tree Theatre, United Kingdom (On-Site)

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Ubisoft India Studios - QC Technical Manager

Ubisoft India Studios, India (Hybrid)

Intel Corporation - Senior Performance Verification Architect

Intel Corporation, Israel (Hybrid)

Luxoft - Senior Information Architect

Luxoft, Sweden (On-Site)

Microsoft - Data and Applied Scientist II

Microsoft, India (On-Site)

Samsung Semiconductor - Intern, High Capacity SSD Software Ecosystem

Samsung Semiconductor, United States (Hybrid)

Virtuos - Software Engineer Trainee

Virtuos, China (On-Site)

Intel Corporation - SOC Architect

Intel Corporation, United States (Hybrid)

Get notifed when new similar jobs are uploaded