Director of AI Research

15 Minutes ago • 15 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA's Conversational AI research team seeks a Director of AI Research to lead the development of new deep learning algorithms and techniques for efficient LLM inference. Responsibilities include designing new architectures for advanced LLMs, adapting foundation AI models to downstream tasks (math, code reasoning), contributing to the Nemo framework, curating datasets, and collaborating with product and hardware teams. The ideal candidate will possess a PhD in Computer Science/Electrical Engineering, 15+ years of machine learning/deep learning experience (10+ in management), expertise in NLP and speech processing, proficiency in Python and PyTorch, and a strong publication record. This role involves leading a team to build and deploy cutting-edge AI solutions.
Must have:
  • PhD in CS/EE
  • 15+ years ML/DL experience
  • 10+ years management experience
  • NLP/Speech processing knowledge
  • Python & PyTorch expertise
  • Strong publication record
  • Lead LLM inference research
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

We are looking for Director of AI research team, to work on new deep learning algorithms and techniques for efficient LLM inference 

NVIDIA is searching for world-class researchers in deep learning and natural language processing (NLP) to join our Conversational AI research team. Our team is pushing the boundaries of generative AI by building state-of-the-art large language models (LLM). We  work on new neural architectures to enable LLM with very long context, on applying LLM to solve complicated math and coding problems, and on improvement LLM robustness. If you are passionate about the latest research and technologies revolutionizing generative AI and want to explore creative new paradigms for applied foundation models such as reasoning  agents, this team will be a great fit for you. After building prototypes that demonstrate the promise of your research, you will collaborate with product teams to apply your ideas into industry-leading real-world applications.

What you will do:

  • Lead the team which will work on new deep learning algorithms and techniques for efficient LLM inference 
  • Develop new architectures for advanced large language models
  • Design and adapt foundation AI models to downstream tasks such as math and code reasoning.
  • Contribute these new models to Nemo framework
  • Construct and curate datasets for large-scale machine learning, for learning from human preferences, and for specific domains of applications.
  • Work closely with product and hardware architecture teams to integrate your research and developments into products.

What we need to see:

  • MSc or PhD in  Computer Science/ Electrical Engineering 
  • 15 overall Years of extensive machine learning / deep learning research or work experience, and 10 years of management experience
  • Knowledge of application areas such as natural language processing and speech processing.
  • Excellent programming skills in some rapid prototyping environments such as Python; 
  • Expertise with deep learning frameworks such as PyTorch.
  • A track record of research excellence demonstrated in publications at leading conferences and journals.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Similar Jobs

Paypal - Business Program Management

Paypal

Mexico City, Mexico City, Mexico (Hybrid)
• 3 Months ago
Info Stretch - Engineer II

Info Stretch

Washington, United States (On-Site)
• 1 Month ago
ByteDance - Product Solution Architect, BytePlus Edge Cloud - 2025 Start

ByteDance

Singapore (On-Site)
• 3 Months ago
DPS Games - Senior Environment Artist (Unannounced Project)

DPS Games

Guildford, England, United Kingdom (On-Site)
• 5 Months ago
Evolution - Backend Developer - Game Server

Evolution

Gothenburg, Västra Götaland County, Sweden (On-Site)
• 4 Months ago
GoTo Group - Lead Data Scientist - KYC

GoTo Group

Singapore (On-Site)
• 2 Months ago
Ubisoft - ML OPS Senior _ Groupe Technologique Création de contenu

Ubisoft

Montreal, Quebec, Canada (On-Site)
• 1 Month ago
Canva - Senior Machine Learning Engineer - Photo AI

Canva

Prague, Czechia (Remote)
• 1 Month ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

Prague, Czechia (Remote)
• 4 Weeks ago
Zoox - Senior/Staff Software Engineer, ML Performance Optimization

Zoox

Foster City, California, United States (On-Site)
• 4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

PwC - Senior Associate - Consulting - Strategy & Operations - Talent Pool

PwC

Jakarta, Jakarta, Indonesia (On-Site)
• 4 Months ago
Blue Yonder - Sr Solution Architect

Blue Yonder

Dallas, Texas, United States (On-Site)
• 4 Months ago
Zeta - Senior Business Intelligence Engineer

Zeta

Bengaluru, Karnataka, India (On-Site)
• 4 Months ago
ByteDance - Network Engineer, Optical Long-Haul and Submarine

ByteDance

Ashburn, Virginia, United States (On-Site)
• 2 Weeks ago
The Walt Disney Company - Senior QA Engineer (Software)

The Walt Disney Company

Orlando, Florida, United States (On-Site)
• 1 Month ago
Axon - Senior Mechanical Engineer II (Onsite)

Axon

Boston, Massachusetts, United States (On-Site)
• 2 Months ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
• 3 Months ago
Simple Viral Games - Game Designer

Simple Viral Games

Bengaluru, Karnataka, India (On-Site)
• 6 Months ago
Ubisoft - Development Tester

Ubisoft

Taguig, Metro Manila, Philippines (On-Site)
• 3 Weeks ago
Ubisoft - The Division Resurgence]– Lead System Designer (W/M/NB)

Ubisoft

Saint-Mandé, Île-de-France, France (On-Site)
• 1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Tel Aviv-Yafo, Tel Aviv District, Israel

Scopely - Senior Product Manager, Economy - Monopoly GO!

Scopely

Tel Aviv-Yafo, Tel Aviv District, Israel (Remote)
• 1 Month ago
NVIDIA - Senior System Design Test Architect

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
• 1 Month ago
SuperPlay - BUSINESS DATA ANALYST

SuperPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
• 3 Months ago
NVIDIA - Physical Design Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
• 1 Week ago
NVIDIA - Senior IC Product Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
• 5 Days ago
seeking alpha - Affiliate/Partnerships Manager

seeking alpha

Israel (Remote)
• 1 Month ago
Pazu Games - Game Producer

Pazu Games

Israel (On-Site)
• 1 Month ago
SuperPlay - GAME ECONOMIST

SuperPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
• 3 Months ago
NVIDIA - Networking Software and System Architect

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
• 2 Weeks ago
Overwolf - Senior Client Software Engineer

Overwolf

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
• 2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Zoox - Offline Perception Internship/Co-op

Zoox

Boston, Massachusetts, United States (On-Site)
• 4 Months ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

Bellevue, Washington, United States (On-Site)
• 3 Months ago
Zoox - Staff Software Engineer - Perception

Zoox

Foster City, California, United States (Hybrid)
• 4 Months ago
Egnyte - Senior Product Manager

Egnyte

Mountain View, California, United States (Remote)
• 3 Months ago
ByteDance - Research Scientist/Engineer - Multimodal Interaction & World Model

ByteDance

Singapore (On-Site)
• 3 Months ago
Airlab Inc  - C++ & Python Programmer

Airlab Inc

Montreal, Quebec, Canada (On-Site)
• 7 Months ago
Netflix - Software Engineer L4/L5, Training Platform, Machine Learning Platform

Netflix

United States (Remote)
• 4 Months ago
Microsoft - Research Intern - AI for Domains

Microsoft

Redmond, Washington, United States (On-Site)
• 1 Month ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
• 1 Month ago
Xsolla - Principal AI Engineer

Xsolla

Raleigh, North Carolina, United States (On-Site)
• 9 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Massachusetts, United States (On-Site)

Toronto, Ontario, Canada (On-Site)

Tel Aviv District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (Hybrid)

Shanghai, Shanghai, China (Remote)

Shanghai, Shanghai, China (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug