Research Intern - Multimodal AI Research

2 Weeks ago • 1 Years + • Artificial Intelligence • $78,600 PA - $154,560 PA

Job Summary

Job Description

Microsoft's AI Platform team seeks Research Interns for its Multimodal Intelligence (MMI) team. The internship involves cutting-edge research in multimodal AI, focusing on video, image, and document understanding. Responsibilities include collaborating with researchers, presenting findings, and contributing to projects such as video understanding, information retrieval, and key-value extraction. Candidates should possess a PhD background in a relevant field (AI, NLP, CV) and at least one year of hands-on deep learning experience. Familiarity with LLMs/VLMs is a plus. The internship is a 12-week program, with interns paired with mentors and expected to contribute to the team's vibrant research community.
Must have:
  • PhD in relevant field
  • 1+ years deep learning experience
  • NLP/CV/AI background
  • Proficient in Python
  • Collaboration skills
Good to have:
  • LLM/VLM familiarity
  • Publications in top conferences
  • Experience with PyTorch
  • C/C++ proficiency
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Networking opportunities

Job Details

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

The AI Platform team is on a mission to advance the state of the art in AI and deliver on our company’s vision for how intelligent cloud and intelligent edge will shape the next phase of innovation. The team includes top scientists and researchers from across Microsoft who are creating a center of excellence in speech, computer vision, and natural language.

 

Within the AI Platform, the Multi-modal Intelligence team (MMI) mission is to make fundamental contributions to advancing the state-of-the-art in AI technology related to Video, Image, Document, and other multimodality inputs. “Documents”, for example, stand at the intersection between NLP and Vision research. To fully understand a document, one needs to borrow from both language and visual (Layout) elements of the document. We explore both single and multimodality inputs – and their synergy - to conduct research on forward-looking topics such as Video Understanding, Information Retrieval, Key-Value extraction, few-shot Named Entity Recognition (NER), hierarchical layout analysis, and many others. 

 

We are looking for Research Interns to work on cutting edge research in Multimodal AI. We are particularly interested in Research Interns with background in AI, NLP, and/or CV, including topics like Video/image understanding, document layout analysis, chart understanding, multi-page multi-document question answering, novel ways of leveraging LLMs for document/video/image understanding and solving problems inherent to large language models (grounding, retrieval-based generation, etc.). Familiarity with modern LLMs/VLMs is a plus, but not required.  

 

Qualifications

Required Qualifications

  • Currently enrolled in a PhD program in Computer Vision, Natural Language Processing, Deep Learning, Machine Learning, AI, or a related field.
  • At least 1 year of experience in NLP, computer vision, Deep learning, or multimodal research with hands-on deep learning experience.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Proficient algorithmic problem solving and software development skills (Python, C/C++, etc.).
  • Experience with open-source tools such as PyTorch, etc.
  • Publication(s) in top-tier conferences or journals in related fields (e.g., ACL, CVPR, ECCV, ICCV, EMNLP, NAACL, NIPS, ICML, ICLR, IJCV, PAMI, etc.). 

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

Rockstar Games - UI Programmer (C++)

Rockstar Games

Dundee, Scotland, United Kingdom (On-Site)
5 Months ago
Microsoft - Software Engineer - Storage

Microsoft

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Zoox - Software Engineer - 3D World Generation Pipelines

Zoox

Seattle, Washington, United States (Hybrid)
3 Months ago
ByteDance - Backend Software Engineer - Global E-Commerce Supply Chain Operation Platform

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
2K - Technical Director of Gameplay

2K

Vancouver, British Columbia, Canada (On-Site)
7 Months ago
Canva - Head of AI Research

Canva

San Francisco, California, United States (Remote)
1 Month ago
Level AI - Senior Backend Engineer - CX

Level AI

Noida, Uttar Pradesh, India (Hybrid)
4 Months ago
Zoox - Technical Program Manager - Artificial Intelligence

Zoox

Foster City, California, United States (Hybrid)
3 Months ago
Level AI - Backend Engineer - Customer Engineering

Level AI

Noida, Uttar Pradesh, India (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

The Walt Disney Company - Staff Production Engineer - Platform

The Walt Disney Company

Sydney, New South Wales, Australia (On-Site)
3 Months ago
Rockstar Games - Senior Build & Release Engineer

Rockstar Games

San Diego, California, United States (On-Site)
1 Month ago
Marvell - Senior Product Engineer

Marvell

Singapore (On-Site)
3 Months ago
Bungie - Marathon Senior Software Engineer - Commerce

Bungie

(Hybrid)
3 Months ago
Larian Studios - DEVOPS BUILD ENGINEER

Larian Studios

Quebec, Canada (On-Site)
1 Month ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

New York, New York, United States (Remote)
3 Months ago
Activision - Senior Build Engineer

Activision

Malmö, Skåne County, Sweden (Hybrid)
1 Month ago
Microsoft - Silicon Engineering: Internship Opportunities

Microsoft

Penang, Malaysia (On-Site)
1 Month ago
Luxoft - Senior SW developer Functional Safety and C++

Luxoft

Gothenburg, Västra Götaland County, Sweden (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Evolution - Casino Game Presenter (Live Chat Agent Alternative) - up to $25/hr

Evolution

Atlantic City, New Jersey, United States (On-Site)
3 Months ago
Stardock - Senior Game Designer

Stardock

Plymouth, Michigan, United States (On-Site)
1 Month ago
Workco - Senior Designer

Workco

United States (Remote)
1 Month ago
Crunchyroll - Staff Partner Engineer - Data & Services

Crunchyroll

San Francisco, California, United States (Hybrid)
2 Months ago
ByteDance - Image Sensor Architect - Pico - San Jose

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Riot Games - Principal Software Engineer, Product Tech-Lead - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Flow - Senior/Staff Backend Software Engineer

Flow

New York, New York, United States (Hybrid)
4 Months ago
Blizzard Entertainment - Principal Software Engineer, Gameplay AI | Unannounced Game

Blizzard Entertainment

Irvine, California, United States (Hybrid)
3 Months ago
On Location - Marketing Cloud Engineer

On Location

Austin, Texas, United States (On-Site)
4 Months ago
IGT - Senior Internal Auditor, IT

IGT

Providence, Rhode Island, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Keywords Studios (Player Support) - Research Associate - AI

Keywords Studios (Player Support)

Katowice, Silesian Voivodeship, Poland (On-Site)
1 Month ago
Spell Brush - AI Infrastructure Engineer

Spell Brush

San Francisco, California, United States (On-Site)
4 Months ago
Interface AI - Sr. Implementation Engineer

Interface AI

United States (Remote)
2 Months ago
Rackspace Technology - Principal MLOPs Engineer

Rackspace Technology

United States (Remote)
3 Months ago
Meta - Research Scientist, Computer Vision for Generative AI (PhD)

Meta

Menlo Park, California, United States (On-Site)
3 Months ago
CloudHire - Machine Learning - Engineer

CloudHire

India (Remote)
3 Months ago
ByteDance - Research Engineer (Foundation Model) - Machine Learning Systems

ByteDance

Singapore (On-Site)
3 Months ago
Amazon Games - Senior ML Scientist, Amazon Games AI Research

Amazon Games

San Diego, California, United States (On-Site)
1 Month ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Fremont, California, United States (Remote)
3 Months ago
Ubisoft - Programmeur senior ML _ Groupe Technologique Création de Contenu

Ubisoft

Montreal, Quebec, Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

London, England, United Kingdom (On-Site)

London, England, United Kingdom (Hybrid)

London, England, United Kingdom (On-Site)

Jakarta, Jakarta, Indonesia (On-Site)

Gurugram, Haryana, India (On-Site)

Prague, Prague, Czechia (On-Site)

Montreal, Quebec, Canada (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug