Senior Deep Learning Systems Software Engineer - AI Infrastructure

1 Month ago • 5 Years + • Full Stack Development

Job Summary

Job Description

NVIDIA seeks a Senior Deep Learning Systems Software Engineer to optimize deep learning workloads on cutting-edge hardware and software. Responsibilities include analyzing, profiling, and optimizing workloads; building automation tools; collaborating with cross-functional teams; identifying and resolving performance bottlenecks; designing performance benchmarks; and providing guidance on cloud application optimization. The ideal candidate will have 5+ years of experience in application performance engineering, experience with large-scale GPU infrastructure, deep learning model architectures (PyTorch), application profiling tools (NVIDIA Nsight, Intel VTune), and strong programming skills (Python, C/C++). The role involves working across the hardware/software stack to achieve peak performance in deep learning training and inference.
Must have:
  • 5+ years application performance engineering experience
  • Large-scale multi-node GPU infrastructure experience
  • Deep learning model architectures & PyTorch expertise
  • Application profiling tools (NVIDIA NSight, Intel VTune)
  • Strong understanding of computer architecture and GPU architecture
  • Proficiency in Python and C/C++
Good to have:
  • CUDA or OpenCL experience
  • NVIDIA server and software ecosystem understanding
  • Experience with large-scale distributed systems
  • Hands-on experience with NVIDIA GPUs, HPC storage, networking, and cloud computing
  • In-depth understanding of storage systems, Linux file systems, and RDMA networking

Job Details

NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of all facets of Deep Learning such as training and inferencing, one of today's most important workloads in the world. If you are unafraid to work across all layers of the hardware/software stack from GPU architecture to Deep Learning Framework to achieve peak performance, we want to hear from you! This role offers an opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution while helping deep learning users around the globe enjoy ever-higher training speeds.

What you'll be doing:

  • Understand, analyze, profile, and optimize deep learning workloads on state-of-the-art hardware and software platforms.

  • Build tools to automate workload analysis, workload optimization, and other critical workflows.

  • Collaborate with cross-functional teams to analyze and optimize cloud application performance on diverse GPU architectures.

  • Identify bottlenecks and inefficiencies in application code and propose optimizations to enhance GPU utilization.

  • Drive end-to-end platform optimization from a hardware level to the application and service levels

  • Design and implement performance benchmarks and testing methodologies to evaluate application performance.

  • Provide guidance and recommendations on optimizing cloud-native applications for speed, scalability, and resource efficiency.

  • Share knowledge and best practices with domain expert teams as they transition applications to distributed environments.

What we need to see:

  • Masters in CS, EE or CSEE or equivalent experience

  • 5+ years of experience in application performance engineering

  • Experience using large scale multi node GPU infrastructure on premise or in CSPs

  • Background in deep learning model architectures and experience with Pytorch and large scale distributed training

  • Experience with application profiling tools such as NVIDIA NSight, Intel VTune etc.

  • Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture. Experience with NVIDIA's Infrastructure and software stacks.

  • Proven experience analyzing, modeling and tuning DL application performance.

  • Proficiency in Python and C/C++ for analyzing and optimizing application code

Ways to stand out from the crowd:

  • Strong fundamentals in algorithms and GPU programming experience (CUDA or OpenCL)

  • Understanding of NVIDIA's server and software ecosystem

  • Hands-on experience in performance optimization and benchmarking on large-scale distributed systems

  • Hands-on experience with NVIDIA GPUs, HPC storage, networking, and cloud computing.

  • In-depth understanding storage systems, Linux file systems, RDMA networking

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you.

Similar Jobs

DeepSight AI Labs   - Intern/Computer Vision Engineer

DeepSight AI Labs

Gurugram, Haryana, India (On-Site)
8 Months ago
ByteDance - DevOps Engineer - Applied Machine Learning Engine (Singapore)

ByteDance

Singapore (On-Site)
3 Months ago
PortalOne,  Inc  - Unreal Engine Developer (Studio)

PortalOne, Inc

Oslo, Oslo, Norway (On-Site)
6 Months ago
NVIDIA - Research Scientist, Design Automation

NVIDIA

Austin, Texas, United States (On-Site)
1 Month ago
Rockstar Games - Animation R&D Programmer

Rockstar Games

New York, New York, United States (On-Site)
3 Months ago
Tesla - Senior Full Stack Developer

Tesla

Prüm, Rhineland-Palatinate, Germany (On-Site)
1 Week ago
Evolution - Frontend Engineer (JavaScript/TypeScript)

Evolution

Riga, Latvia (On-Site)
4 Months ago
CloudHire - Fullstack - IONIC Developer

CloudHire

Pune, Maharashtra, India (Remote)
4 Months ago
Warner Bros Discovery - Staff Software Engineer, Data Platforms

Warner Bros Discovery

Atlanta, Georgia, United States (On-Site)
3 Months ago
Microsoft - Software Engineer II/Senior Software Engineer - CTJ - Top Secret

Microsoft

Reston, Virginia, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Media Molecule - Senior Technical Artist (Game Tech) - 12 month contract

Media Molecule

London, England, United Kingdom (On-Site)
3 Months ago
Intelsat - Senior Software Engineer

Intelsat

Chennai, Tamil Nadu, India (Hybrid)
4 Months ago
Google - Student Researcher, BS/MS, Winter/Summer 2025

Google

Mountain View, California, United States (On-Site)
3 Months ago
NVIDIA - Electronics Failure Analysis Hardware Engineer

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
Forcepoint - Sr. Software Engineer - C++ Developer

Forcepoint

Mumbai, Maharashtra, India (On-Site)
4 Months ago
Hello Games - Graphics / Engine Programmer

Hello Games

United Kingdom (On-Site)
2 Months ago
Plummy games - Senior Gameplay С++ Developer

Plummy games

Chișinău, Chisinau, Moldova (Remote)
2 Months ago
ByteDance - Senior Software Development Engineer - Distributed NoSQL Database Systems

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Simplify 360 - Principal Software Engineer (Java)

Simplify 360

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Vigaet - Rigging artist

Vigaet

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Paytm - Collections - Junior Manager (Ahmedabad)

Paytm

Ahmedabad, Gujarat, India (On-Site)
4 Months ago
NVIDIA - Tegra Architect

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Tejas Networks - Staff Engineer -GPON Development

Tejas Networks

Chennai, Tamil Nadu, India (On-Site)
4 Months ago
AppMySite - Lead Back-End Developer

AppMySite

Delhi, India (Remote)
6 Months ago
Gamemano - Quality Assurance Analyst

Gamemano

Noida, Uttar Pradesh, India (On-Site)
6 Months ago
NielsenIQ - Team Lead

NielsenIQ

Chennai, Tamil Nadu, India (On-Site)
5 Months ago
PwC - AWS DataOps Engineer

PwC

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Schbang - Media Buyer

Schbang

Mumbai, Maharashtra, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Full Stack Development Jobs

Aristocrat Gaming - .NET Game engine Developer

Aristocrat Gaming

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
PwC - IN-Associate_.NET Full Stack Developer_MS Engg_Advisory_Kolkata

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
Nagarro - Principal Engineer, PHP Lavavel

Nagarro

India (Remote)
4 Months ago
bosh group india - Solution Architect

bosh group india

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Futurum Technology  - Junior Front-end Developer

Futurum Technology

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
1 Month ago
Clari - Staff Software Engineer, Capture

Clari

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Animoca Brands - Junior Software Engineer

Animoca Brands

Hong Kong (On-Site)
4 Months ago
Conga - Staff Software Engineer

Conga

Bengaluru, Karnataka, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug