Senior Software Engineer, GenAI Model Evaluation

4 Months ago • 5 Years + • Artificial Intelligence

Job Summary

Job Description

Scale is seeking a Senior Software Engineer to build a world-class model evaluation platform for their GenAI Safety & Evaluation product team. This role involves owning large new areas within the product, working across backend, frontend, and LLMs, and collaborating with cross-functional teams. Must-have skills include proficiency in Python, Node, React, and MongoDB, along with experience in algorithms, data structures, and object-oriented programming.
Must have:
  • Python, Node, React
  • MongoDB experience
  • Algorithms, data structures
  • Object-oriented programming
Good to have:
  • AI platforms & technologies
  • Generative models & LLMs
  • ML infrastructure building
  • AI-powered solutions
Perks:
  • Hyper-growth startup
  • Work with AI technologies

Job Details

About Job

Software is eating the world, but AI is eating software. We live in unprecedented times – AI has the potential to exponentially augment human intelligence. As the world adjusts to this new reality, leading tech companies are racing to build LLMs at billion dollar scale, while large enterprises figure out how to add it to their products. To ensure that these models are safe, aligned, and highly useful, they require extremely high quality human-generated data and evaluation. Since before the launch of ChatGPT, through to the latest generation of frontier models coming out today, Scale has been at the forefront of providing the post-training, fine-tuning, and human preference alignment (RLHF) data needed to ensure these models are capable, aligned, and useful via our Generative AI Data Engine. The data we are producing is some of the most important work for how humanity will interact with AI.

As customers train their models on this data, and constantly aim to improve them, a critical need is having trustworthy evaluations of model performance, and an ability to identify weaknesses and potential vulnerabilities. Conducting these evaluations with our human experts constitutes a significant and growing portion of Scale’s work—thus assisting model developers in iteratively understanding where to focus their technical investments.

The GenAI Safety & Evaluation product team at Scale is at the heart of this work, building a world-class customer-facing model evaluation platform. This platform enables customers to easily launch new evaluation workflows, deep dive into evaluation results down to the test case level to understand weaknesses and benchmark performance, and use these insights to drive model development roadmaps. In building this product, you will have a chance to shape the way that models across the industry are evaluated, impacting billions of people around the world. And as a newer product at Scale, you will have the opportunity to build something impactful from the ground up.

As part of the Safety & Evaluation product team, you will partner closely with researchers from Scale’s Safety, Evaluations, and Alignment Lab (SEAL) on productization of novel research, as well as Scale’s expert red team, which supports AI safety via rigorous model testing trusted by the White House, major enterprises, and leading model developers.

 

We’re looking for entrepreneurial Software Engineers to join our team. In this role, you'll be given the opportunity to build these products and drive millions of dollars in revenue. You’ll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and large tech companies.

The ideal person is a natural entrepreneurial engineer who can take an ambiguous scope and lead the execution of outcomes, doing what it takes to hit them including both backend and frontend coding, defining requirements, coordinating with other eng and operations teams at Scale, etc. We strongly believe the best engineers own outcomes and deeply understand customer problems.

 

You will:

  • Own large new areas within our product, delivering customer-ready features with engineering excellence that stands up to rigorous quality standards
  • Work across backend, frontend, and interacting with LLMs and/or other ML models
  • Work across the entire product lifecycle from conceptualization through production
  • Be able, and willing, to multi-task and learn new technologies quickly
  • Collaborating with cross-functional teams to define, design, and ship new product features and experiences.
  • Be ready to jump in on fast-turnaround product requests for high value customers

 

Ideally you'd have:

  • 5+ years of full-time engineering experience, post-graduation
  • Proficiencies in one or more of Python, Node, React, Next.js and MongoDB
  • Solid background in algorithms, data structures, and object-oriented programming
  • Experience scaling products at hyper-growth startups
  • Excitement to work with AI technologies
  • Strong written and verbal communication skills, to be able to thrive in a writing-first culture
  • Strong problem-solving skills, and be able to work both independently and as part of a team

Nice to haves:

  • Strong knowledge of software engineering best practices
  • Have experience with AI platforms and technologies, including generative models and LLMs
  • Experience building ML infrastructure and AI-powered solutions
  • Experience growing new products from 0 to 1

Similar Jobs

Abnormal Security - Software Engineer II - Fullstack

Abnormal Security

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
undefined - Senior Manager, Customer Support - West Coast

United States (Remote)
4 Months ago
XBorg - Front-End Software Engineer

XBorg

(Remote)
2 Months ago
Axiom Zen - Frontend Software Engineering Intern

Axiom Zen

United States (Remote)
1 Day ago
Logitech - Senior Frontend Developer (React/Svelte)

Logitech

Chennai, Tamil Nadu, India (Hybrid)
1 Month ago
Level AI - Software Engineer - Machine Learning

Level AI

Noida, Uttar Pradesh, India (Hybrid)
4 Months ago
Trend Micro - Sr. AI Engineer

Trend Micro

Taipei City, Taiwan (On-Site)
4 Months ago
Codeway - Prompt Engineer

Codeway

İstanbul, Türkiye (On-Site)
1 Month ago
The Artarium - AI Digital Artist

The Artarium

Gurugram, Haryana, India (On-Site)
6 Months ago
Terralogic - SOFTWARE ENGINEER – AIML QA

Terralogic

Bengaluru, Karnataka, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Highspot - Sr. Backend Engineer, Meeting Intelligence

Highspot

Vancouver, British Columbia, Canada (Hybrid)
4 Months ago
undefined - Senior Customer Success Engineer, West

United States (Remote)
4 Months ago
Xsolla - Full Stack Developer

Xsolla

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
7 Months ago
undefined - Technical Consultant, West

United States (Remote)
4 Months ago
Highspot - Sr. Backend Engineer, Meeting Intelligence

Highspot

Vancouver, British Columbia, Canada (Hybrid)
4 Months ago
Flow - Senior/Staff Web Engineer

Flow

Miami, Florida, United States (Hybrid)
4 Months ago
The Sleep Company - Shopify Developer - Front End

The Sleep Company

Maharashtra, India (On-Site)
4 Months ago
GrowthX® - Tech Lead

GrowthX®

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Patterned Learning Career - Senior Full Stack Java Developer

Patterned Learning Career

(Remote)
1 Day ago
Urbint - Senior Full Stack Developer

Urbint

Bengaluru, Karnataka, India (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

ION - Debtwire Latin America Primary Bond Market Reporter

ION

New York, New York, United States (On-Site)
3 Weeks ago
Backbone - Electrical Engineer

Backbone

Atherton, California, United States (Hybrid)
6 Months ago
AliveCor - Healthcare Economics & Outcomes Research (HEOR) Lead

AliveCor

United States (On-Site)
3 Months ago
Onward Search - Graphic Production Designer

Onward Search

New York, New York, United States (Remote)
5 Days ago
PlayStation Global - Senior People Tech & Services Analyst

PlayStation Global

Aliso Viejo, California, United States (On-Site)
3 Months ago
Trek - Future Store Manager - Washington State

Trek

Tacoma, Washington, United States (On-Site)
4 Months ago
Funko - Marketing Manager, Amazon - US & LATAM

Funko

Washington, United States (On-Site)
1 Month ago
ByteDance - Research Scientist Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
PTW - Character Concept Artist - Talent Pool

PTW

United States (Remote)
3 Weeks ago
Flow - Senior/Staff Web Engineer

Flow

Miami, Florida, United States (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Microsoft - Research Intern - Generative AI

Microsoft

Redmond, Washington, United States (On-Site)
2 Weeks ago
Nagarro - Principal Engineer, AI / ML

Nagarro

Sri Lanka (Remote)
3 Months ago
Microsoft - Member of Technical Staff, Platform Engineer

Microsoft

Mountain View, California, United States (Hybrid)
1 Month ago
Rackspace Technology - Practice Manager, Data Science, AI and ML

Rackspace Technology

(Remote)
2 Months ago
Coursera - Machine Learning Scientist

Coursera

India (Remote)
1 Month ago
Quizizz - ML Engineer

Quizizz

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Stylumia - Senior Machine Learning Engineer - Time Series & Computer Vision

Stylumia

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
ByteDance - Research Scientist Graduate (Foundation Model, Video Generation) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Microsoft - Economics Program Manager

Microsoft

London, England, United Kingdom (On-Site)
2 Weeks ago
AI Fund - Machine Learning Engineer

AI Fund

(Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

St. Louis, Missouri, United States (Remote)

San Francisco, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Scale AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug