Staff Software Engineer, Machine Learning Infrastructure

2 Months ago • 8 Years + • Artificial Intelligence

Job Summary

Job Description

As a Staff Software Engineer on the Machine Learning Infrastructure team at Thumbtack, you will contribute to the design, implementation, and maintenance of scalable ML systems. Responsibilities include defining and driving the technical vision for Thumbtack's next-generation ML infrastructure, leading cross-functional initiatives, architecting critical ML infrastructure components (model serving and RAG systems), establishing technical standards and best practices, mentoring engineering teams, and partnering with senior leadership to align ML capabilities with business objectives. The role involves working with technologies like Go, Python, and modern ML frameworks (PyTorch, TensorFlow).
Must have:
  • 8+ years of engineering experience in distributed systems
  • 4+ years building ML infrastructure at scale
  • Expertise in Go or Python
  • Strong architectural skills
  • Experience mentoring teams
  • Deep understanding of ML workflows
Good to have:
  • Experience with hundreds of production models
  • Expertise with PyTorch/TensorFlow and MLOps tools
  • Generative AI implementation experience
  • High-performing team building experience
  • Cloud-native architectures expertise (AWS, GCP)
  • Experience in fast-growing tech companies
Perks:
  • Virtual-first working model
  • 20 company holidays
  • WiFi reimbursement
  • Cell phone reimbursement
  • Employee Assistance Program

Job Details

A home is the biggest investment most people make, and yet, it doesn’t come with a manual. That's why we’re building the only app homeowners need to effortlessly manage their homes —  knowing what to do, when to do it, and who to hire. With Thumbtack, millions of people care for what matters most, and pros earn billions of dollars through our platform. And as one of the fastest-growing companies in a $600B+ industry — we must be doing something right. 

We are driven by a common goal and the deep satisfaction that comes from knowing our work supports local economies, helps small businesses grow, and brings homeowners peace of mind. We’re seeking people who continually put our purpose first: advocating for pros and customers, embracing change, and choosing teamwork every day.

At Thumbtack, we're creating a new era of home care. If making an impact and the chance to do good inspires you, join us. Imagine what we’ll build together. 

Thumbtack by the Numbers

  • Available nationwide in every U.S. county
  • Over 85 million projects started on Thumbtack
  • More than 11 million 5-star reviews and counting
  • Pros earn billions on our platform
  • 1000+ employees 
  • $3.2 billion valuation (June, 2021) 

About the Machine Learning Infrastructure Team

At Thumbtack, we're solving complex technical challenges across search, ranking, recommendations, pricing optimization, and spam detection. Our ML Infrastructure team leads the architectural vision and implementation of enterprise-wide machine learning capabilities, enabling teams to effectively experiment with and deploy ML models at scale. We're building next-generation infrastructure that powers Thumbtack's AI-first future. For insights into our engineering challenges, visit our engineering blog.

Challenge 

As a Principal ML Infrastructure Engineer, you'll drive the technical vision and strategic direction of Thumbtack's machine learning platform. You'll architect solutions that democratize ML capabilities across the organization while establishing best practices and technical standards. Working closely with senior leadership, you'll shape our technical roadmap for generative AI adoption, feature platform evolution, and ML operational excellence.

Responsibilities

  • Define and drive the technical vision and architecture for Thumbtack's next-generation ML infrastructure
  •  Lead cross-functional initiatives spanning engineering, data science, and product teams to build scalable, enterprise-grade ML systems
  •  Architect and oversee implementation of critical ML infrastructure components including model serving systems and RAG systems that can scale. 
  •  Establish technical standards and best practices for ML engineering across the organization
  •  Mentor and provide technical leadership to engineering teams on ML infrastructure best practices
  •  Partner with senior leadership to align ML infrastructure capabilities with business objectives

What you’ll need

If you don't think you meet all of the criteria below but still are interested in the job, please apply. Nobody checks every box, and we're looking for someone excited to join the team.

  •  8+ years of engineering experience with significant focus on distributed systems
  •  4+ years of hands-on experience building ML infrastructure or ML platforms at scale
  •  Deep expertise in at least one major programming language; proficiency in our core stack (Go, Python) preferred
  •  Proven track record of technical leadership on complex, cross-functional projects
  •  Strong architectural skills with experience designing scalable, reliable distributed systems
  •  Deep understanding of ML workflows, common frameworks, and operational challenges
  •  Experience mentoring teams and driving engineering excellence
  •  Track record of making strategic technical decisions with organization-wide impact

Bonus points if you have

  •  Experience building AI platforms that support hundreds of models in production
  •  Deep expertise with modern ML frameworks (PyTorch, TensorFlow) and MLOps tools
  •  Experience implementing generative AI capabilities at enterprise scale
  •  Track record of building high-performing technical teams
  •  Expertise with cloud-native architectures and major cloud providers (AWS, GCP)
  •  Experience driving technical strategy at fast-growing technology companies

Thumbtack is a virtual-first company, meaning you can live and work from any one of our approved locations across the United States, Canada or the Philippines.* Learn more about our virtual-first working model here.

#LI-Remote

Benefits & Perks
  • Virtual-first working model coupled with in-person events
  • 20 company-wide holidays including a week-long end-of-year company shutdown
  • Library (optional use collaboration & connection hub) in San Francisco
  • WiFi reimbursements 
  • Cell phone reimbursements (North America) 
  • Employee Assistance Program for mental health and well-being 

Learn More About Us

Thumbtack embraces diversity. We are proud to be an equal opportunity workplace and do not discriminate on the basis of sex, race, color, age, pregnancy, sexual orientation, gender identity or expression, religion, national origin, ancestry, citizenship, marital status, military or veteran status, genetic information, disability status, or any other characteristic protected by federal, provincial, state, or local law. We also will consider for employment qualified applicants with arrest and conviction records, consistent with applicable law. 

Thumbtack is committed to working with and providing reasonable accommodation to individuals with disabilities. If you would like to request a reasonable accommodation for a medical condition or disability during any part of the application process, please contact: recruitingops@thumbtack.com

If you are a California resident, please review information regarding your rights under California privacy laws contained in Thumbtack’s Privacy policy available at https://www.thumbtack.com/privacy/ .

Similar Jobs

Orion Innovation - Data Engineer-AI,ML

Orion Innovation

Chennai, Tamil Nadu, India (On-Site)
3 Months ago
Microsoft - Research Intern - Multi-Agent Models

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Velotio Technologies - Lead Data Scientist

Velotio Technologies

Maharashtra, India (Remote)
2 Weeks ago
Trendyol - Senior Software Engineer - Machine Learning

Trendyol

Ankara, Ankara, Türkiye (Hybrid)
3 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech Understanding) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Trend Micro - NLP / Prompt Engineer (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
4 Months ago
Keywords Studios (Player Support) - AI Engineer (AI-Powered Agents)

Keywords Studios (Player Support)

Pune, Maharashtra, India (On-Site)
1 Month ago
ByteDance - Research Scientist Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
GoTo Group - Senior Data Scientist - KYC

GoTo Group

Jakarta, Jakarta, Indonesia (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Machine Learning Engineer-Model Serving Infrastructure (AML-Engine)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Every matrix - AI/ML Lead Engineer

Every matrix

Lviv, Lviv Oblast, Ukraine (Hybrid)
1 Week ago
Takeda - Senior AI/ML Engineer

Takeda

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Meta - Postdoctoral Researcher, Embodied AI (PhD)

Meta

Seattle, Washington, United States (On-Site)
2 Months ago
ION - Senior AI Engineer, Italy

ION

Pisa, Tuscany, Italy (On-Site)
4 Months ago
Netflix - Senior Engineering Manager, Data Infra, Machine Learning Platform

Netflix

United States (Remote)
3 Months ago
ByteDance - Research Scientist in Foundation Model, Speech Understanding - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Rackspace Technology - Principal MLOPs Engineer

Rackspace Technology

San Antonio, Texas, United States (Remote)
3 Months ago
Impact Analytics - R&D Architect/Sr. Architect - Artificial Intelligence

Impact Analytics

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Meta - Research Scientist, Machine Learning (PhD)

Meta

Sunnyvale, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Ontario, Canada

PwC - PwC Private, Philanthropic Tax, Manager (Bilingual FR/EN)

PwC

Montreal, Quebec, Canada (Hybrid)
4 Months ago
Maxis Studios - Technical Artist - The Sims

Maxis Studios

Vancouver, British Columbia, Canada (Hybrid)
2 Months ago
Snowed In Studios - Lead Software Developer - Montreal

Snowed In Studios

Quebec, Canada (Remote)
3 Months ago
Ingenuity Studios, LLC - Pipeline TD

Ingenuity Studios, LLC

Vancouver, British Columbia, Canada (Remote)
7 Months ago
Cloud Chamber - Senior Hard Surface Artist

Cloud Chamber

Montréal, Québec, Canada (Hybrid)
1 Month ago
RaceRocks - Engineering Manager (distributed learning platform)

RaceRocks

British Columbia, Canada (Remote)
5 Days ago
Skybox Labs - Design Director

Skybox Labs

Burnaby, British Columbia, Canada (Hybrid)
4 Months ago
Electronic Arts - Release Manager - Battlefield

Electronic Arts

Quebec, Canada (On-Site)
1 Month ago
Scanline VFX - Creative Editor

Scanline VFX

Vancouver, British Columbia, Canada (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Scanline VFX - Research Intern (Summer 2025)

Scanline VFX

Los Angeles, California, United States (Hybrid)
3 Months ago
Canva - Machine Learning Engineering Manager (m/f/x) - Canva Austria

Canva

Vienna, Vienna, Austria (Remote)
3 Months ago
Mashgin - Senior Software Engineer, Computer Vision and Deep Learning

Mashgin

Palo Alto, California, United States (Hybrid)
3 Months ago
Microsoft - Member of Technical Staff, High Performance Computing Engineer

Microsoft

Mountain View, California, United States (Hybrid)
1 Month ago
Netomi - Data Scientist - I

Netomi

Gurugram, Haryana, India (Hybrid)
3 Months ago
Level AI - Lead Solutions Architect - Post Sales (Remote - US)

Level AI

United States (Remote)
4 Months ago
Chiselon Technologies   - Data Scientist

Chiselon Technologies

Hyderabad, Telangana, India (On-Site)
2 Months ago
Arrise Solutions (India)   - Senior ML Engineer

Arrise Solutions (India)

Hyderabad, Telangana, India (On-Site)
4 Months ago
Keywords Studios (Player Support) - Research Associates - Fresher

Keywords Studios (Player Support)

Gurugram, Haryana, India (On-Site)
1 Week ago
Inworld AI - Staff / Principal AI Researcher - USA

Inworld AI

Mountain View, California, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded