Senior DevOps Engineer, Deep Learning Frameworks

1 Month ago • 5 Years + • DevOps • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA's Deep Learning Optimized Frameworks Group seeks a Senior DevOps Engineer to enhance their high-performing deep learning software stacks (TensorFlow, PyTorch). Responsibilities include automating build, test, integration, and release processes; configuring and maintaining industry-standard tools (Gitlab, Jenkins, Docker, etc.); developing shared utilities; leading best practices; and identifying infrastructure needs. The ideal candidate will have strong experience with CI systems, SCM, build systems, and Python programming, along with a passion for automation.
Must have:
  • 5+ years relevant experience
  • CI/CD automation expertise
  • SCM & build system fluency (Git, CMake, Bazel)
  • Python programming skills
  • Problem-solving & collaboration
Good to have:
  • CUDA & Deep Learning experience
  • Container & cluster tech (Kubernetes, Jenkins)
  • GPU computing systems knowledge
  • Experience with new tech incorporation
  • Contribution to large SW projects
Perks:
  • Competitive salary
  • Comprehensive benefits package
  • Equity

Job Details

NVIDIA's Deep Learning Optimized Frameworks Group is looking for an excellent DevOps Engineer to enable the next wave of NVIDIA’s highest performing deep learning software stacks. Your role spans multiple products such as TensorFlow and PyTorch and is instrumental for streamlining development, build, and releases with modern DevOps tools. Join our technically hardworking team of software engineers and infrastructure authorities to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning frameworks.

What you'll be doing:

  • Automating and optimizing build, test, integrate, and release processes for optimized NVIDIA Deep Learning Frameworks

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Gitlab, Jenkins, Docker, LXC, HyperV, CMake, Bazel)

  • Developing shared utilities for setting up systems, running tests, and recording results

  • Lead best-practices for building, testing, and releasing software

  • Identifying infrastructure needs and translating them into action

What we need to see:

  • BS or higher degree in computer science (or equivalent experience)

  • 5+ years of relevant experience

  • Strong experience setting up, maintaining, and automating continuous integration systems

  • Fluency in SCM (e.g. Github, Gitlab, Git) and build systems (e.g. Make, CMake, Bazel, Docker)

  • Adept programming skills in Python (or Perl, Shell scripting, like bash, tcsh, sh)

  • Pragmatic approach to solving problems and collaboration

  • Real passion for “it just works” automation and enabling team members

Ways to stand out from the crowd:

  • Experience with CUDA and Deep Learning Software Stack

  • Good knowledge of container and cluster technologies like slurm, kubernetes, jenkins, gitlab-ci, and zabbix

  • Experience with GPU computing systems

  • Track record of identifying useful new technologies and incorporating them into SW development flows

  • Experience as an active contributor to a SW project involving many developers

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us and, due to unprecedented growth, our special engineering teams are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want to hear from you.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Rackspace Technology - Principal MLOPs Engineer

Rackspace Technology

United States (Remote)
4 Months ago
Twitch - Applied Scientist - Safety ML

Twitch

San Francisco, California, United States (On-Site)
2 Months ago
Canva - Research Engineering Manager - Image Generation (m/f/x) - Canva Austria

Canva

Vienna, Vienna, Austria (Remote)
3 Months ago
Trendyol - Data Science Professionals - Trendyol GO

Trendyol

İzmir, İzmir, Türkiye (Hybrid)
3 Months ago
ByteDance - Research Scientist Intern (Doubao (Seed) - Music Foundation Model) - 2024 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Rackspace Technology - OpenStack Cloud Engineer IV

Rackspace Technology

(Remote)
2 Weeks ago
Anthology  Inc  - Associate Software Engineer II

Anthology Inc

Brno, South Moravian Region, Czechia (On-Site)
1 Week ago
Rackspace Technology - AWS Engineer IV-IN (R-20541)

Rackspace Technology

Gurugram, Haryana, India (Remote)
2 Months ago
Axon - Senior Site Reliability Engineer II

Axon

Seattle, Washington, United States (Remote)
3 Days ago
Info Stretch - Lead Data Engineer

Info Stretch

Pune, Maharashtra, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

GoTo Group - Senior Data Scientist - Computer Vision - KYC

GoTo Group

Singapore (On-Site)
4 Months ago
Google - Student Researcher, PhD, Winter/Summer 2025

Google

(On-Site)
3 Months ago
Paypal - Machine Learning Manager

Paypal

San Jose, California, United States (Hybrid)
3 Months ago
Meta - Software Engineer, Computer Vision (Technical Leadership)

Meta

San Francisco, California, United States (Remote)
3 Months ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Omnissa - Staff Data Scientist

Omnissa

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Enterprise Bot - Data Scientist

Enterprise Bot

Bengaluru, Karnataka, India (On-Site)
3 Months ago
ByteDance - Technical Expert, Large Language Model

ByteDance

Singapore (On-Site)
3 Months ago
ByteDance - Senior Machine Learning Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Day ago
NVIDIA - Developer Relations Manager - GenAI for Automotive

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Epic Games - Senior Tools Programmer, Applied Usability

Epic Games

Cary, North Carolina, United States (On-Site)
2 Weeks ago
ZeniMax Media - Senior Animator (Faces)

ZeniMax Media

Rockville, Maryland, United States (On-Site)
5 Months ago
NVIDIA - Senior Circuit Design Engineer

NVIDIA

Santa Clara, California, United States (Remote)
1 Month ago
Info Stretch - Database Administrator 2

Info Stretch

Lansing, Michigan, United States (On-Site)
1 Month ago
Postman - Backend and Systems Engineer, Flows

Postman

New York, New York, United States (On-Site)
4 Months ago
Electronic Arts - Staff Quant Researcher

Electronic Arts

California, United States (On-Site)
1 Month ago
Netflix - Senior Producer, Game Studio

Netflix

Los Angeles, California, United States (On-Site)
1 Month ago
The Walt Disney Company - Water Sciences Project Specialist

The Walt Disney Company

Lake Buena Vista, Florida, United States (On-Site)
1 Week ago
ByteDance - Office Administration - DCar (Third-party Contractor)

ByteDance

Los Angeles, California, United States (On-Site)
3 Months ago
Kokotree - Artificial Intelligence Developers

Kokotree

Wilmington, North Carolina, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Intel Corporation - DevOps infra - k8s Engineer

Intel Corporation

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
ION - Senior DevSecOps Engineer, Italy

ION

Milan, Lombardy, Italy (On-Site)
4 Months ago
Anthology  Inc  - Platform Engineer II

Anthology Inc

Bogotá, Bogota, Colombia (Remote)
2 Months ago
Microsoft - ROP - Senior Software Engineer

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Larian Studios - DevOps Build Engineer

Larian Studios

Dublin, County Dublin, Ireland (On-Site)
3 Months ago
Hashlist - Senior Data Engineer

Hashlist

Pune, Maharashtra, India (Hybrid)
3 Months ago
Imagineio - MLOps / DevOps Engineer

Imagineio

New Delhi, Delhi, India (Hybrid)
8 Months ago
GoTo Group - Site Reliability Engineer - EP (SE4)

GoTo Group

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Microsoft - Senior Software Engineer

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Quorum Software - Site Reliability Engineer (Hybrid Work Schedule)

Quorum Software

Pune, Maharashtra, India (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

United States (Remote)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug