Machine Learning System Tooling Tech Lead, Silicon

1 Week ago • 5 Years + • Artificial Intelligence • Research & Development

About the job

Job Description

This role involves designing, developing, and maintaining tools and infrastructure for analyzing Machine Learning (ML) workloads and hardware performance at Google. Responsibilities include developing power and performance models, creating visualizations and dashboards, building benchmarks for workload analysis, and collaborating with cross-functional teams. The ideal candidate possesses a Bachelor's degree (Master's or Ph.D. preferred) in a relevant field, 5+ years of experience with computer architecture concepts, and experience with ML accelerators and tooling development. The position requires expertise in compiler flows and translating high-level languages (like TensorFlow) into hardware instructions. This role is crucial for shaping the next generation of hardware experiences for Google's direct-to-consumer products, delivering unparalleled performance, efficiency, and integration.
Must have:
  • Bachelor's degree in relevant field
  • 5+ years experience with computer architecture
  • Experience with ML accelerators
  • Tooling development for power/performance analysis
  • Develop and maintain performance models
  • Collaborate with cross-functional teams
Good to have:
  • Master's/Ph.D. in performance evaluation for ML systems
  • Experience writing ML algorithms
  • Experience in architecting and optimizing compilers
  • Understanding of compiler flows

Minimum qualifications:

  • Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience.
  • 5 years of experience with computer architecture concepts, including microarchitecture, cache hierarchy, pipelining, and memory subsystems.

Preferred qualifications:

  • Master's Degree or Ph.D. with an emphasis on performance evaluation for Machine Learning (ML) systems.
  • Experience with ML accelerators (e.g. having worked on ML software models or accelerator architectures).
  • Experience writing ML algorithms for e.g. recommendation systems, Natural Language Processing (NLP), image and vision.
  • Experience in tooling development for power, performance and architecture analysis.
  • Experience in architecting and optimizing compilers.
  • Understanding of compiler flows, software involved in translating a high-level language (e.g. TensorFlow) to hardware instructions.

About the job

Be part of a diverse team that pushes boundaries, developing custom silicon solutions that power the future of Google's direct-to-consumer products. You'll contribute to the innovation behind products loved by millions worldwide. Your expertise will shape the next generation of hardware experiences, delivering unparalleled performance, efficiency, and integration.

Google's mission is to organize the world's information and make it universally accessible and useful. Our team combines the best of Google AI, Software, and Hardware to create radically helpful experiences. We research, design, and develop new technologies and hardware to make computing faster, seamless, and more powerful. We aim to make people's lives better through technology.

Responsibilities

  • Design, develop, and maintain tools and infrastructure for analyzing Machine Learning (ML) workloads and hardware performance.
  • Develop and maintain power and performance models.
  • Develop visualizations and dashboards to effectively communicate performance insights to engineers.
  • Build models, benchmarks for workload analysis and help to drive architectural decisions.
  • Collaborate with cross-functional teams to improve the workload analysis flows, including debuggability and tracing.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

San Francisco, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Warsaw, Masovian Voivodeship, Poland (On-Site)

San Bruno, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Neostella - Machine Learning Engineer

Neostella, Colombia (On-Site)

Madison Logic - Sr. Data Scientist

Madison Logic, India (On-Site)

Scale AI - Software Engineer, GenAI Model Evaluation

Scale AI, United States (Hybrid)

Wargaming - Solutions Architect (AI Technology)

Wargaming, Czechia (Hybrid)

Scale AI - Army Acquisitions Writing Expert

Scale AI, United States (Remote)

Google - Staff Software Engineer, Platform-Aware AutoML

Google, United States (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

eBay - ML Staff Software Engineer - Risk

eBay, United States (Hybrid)

Frost & Sullivan - AI Engineer

Frost & Sullivan, India (On-Site)

ByteDance - Research Scientist in Large Model System

ByteDance, United States (On-Site)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Get notifed when new similar jobs are uploaded