Machine Learning Engineer - Model Evaluations, Public SectorScale AI, Inc. • San Francisco, CA, United States

Machine Learning Engineer - Model Evaluations, Public Sector

Scale AI, Inc. • San Francisco, CA, United States

16 hours ago

Job type

Full-time

Job description

Machine Learning Engineer - Model Evaluations, Public Sector

The Public Sector ML team at Scale deploys advanced AI systems—including LLMs, agentic models, and multimodal pipelines—into mission‑critical government environments. We build evaluation frameworks that ensure these models operate reliably, safely, and effectively under real‑world constraints. As an ML Engineer, you will design, implement, and scale automated evaluation pipelines that help customers trust and operationalize advanced AI systems across defense, intelligence, and federal missions.

You will :

Develop and maintain automated evaluation pipelines for ML models across functional, performance, robustness, and safety metrics, including LLM‑judge‑based evaluations.
Design test datasets and benchmarks to measure generalization, bias, explainability, and failure modes.
Build evaluation frameworks for LLM agents, including infrastructure for scenario‑based and environment‑based testing.
Conduct comparative analyses of model architectures, training procedures, and evaluation outcomes.
Implement tools for continuous monitoring, regression testing, and quality assurance for ML systems.
Design and execute stress tests and red‑teaming workflows to uncover vulnerabilities and edge cases.
Collaborate with operations teams and subject‑matter experts to produce high‑quality evaluation datasets.

Security clearance requirement

This role will require an active security clearance or the ability to obtain a security clearance.

Qualifications

Experience in computer vision, deep learning, reinforcement learning, or NLP in production settings.

Strong programming skills in Python; experience with TensorFlow or PyTorch.

Background in algorithms, data structures, and object‑oriented programming.

Experience with LLM pipelines, simulation environments, or automated evaluation systems.

Ability to convert research insights into measurable evaluation criteria.

Nice to haves

Graduate degree in CS, ML, or AI.

Cloud experience (AWS, GCP) and model deployment experience.

Experience with LLM evaluation, CV robustness, or RL validation.

Knowledge of interpretability, adversarial robustness, or AI safety frameworks.

Familiarity with ML evaluation frameworks and agentic model design.

Experience in regulated, classified, or mission‑critical ML domains.

We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or veteran status.

We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and / or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com.

We comply with the United States Department of Labor's Pay Transparency provision.

#J-18808-Ljbffr

Create a job alert for this search

Machine Learning Engineer • San Francisco, CA, United States

Related jobs

AIML - Machine Learning Engineer, Foundation Model Services

Apple Inc. • Santa Clara, CA, United States

Full-time

Work closely with product teams to build production grade solutions to launch models serving millions of customers in real time. Work along side Foundation Model Research team to prototype and devel...Show more

Last updated: 30+ days ago • Promoted

Machine Learning Engineer

Institute Of Foundation Models • Sunnyvale, California, United States

Full-time

About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...Show more

Last updated: 30+ days ago • Promoted

Machine Learning Engineer

Hive • San Francisco, California, United States

Full-time

Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show more

Last updated: 30+ days ago • Promoted

Machine Learning Engineer, Multimodality

Tik Tok • San Jose, CA, United States

Full-time

We use an AIGC large model, deep learning, machine learning and other technologies to establish a safe and trusted connection between billions of active users and the company's commercial products ...Show more

Last updated: 30+ days ago • Promoted

Machine Learning Engineer

Aven • Campbell, California, United States

Full-time

We are looking for an experienced Machine Learning Engineers - you have built models in product from idea to delivery.You are passionate about digging into the data, sanitizing it, studying it, com...Show more

Last updated: 30+ days ago • Promoted

Machine Learning Research Engineer, Enterprise ML Systems

Scale AI, Inc. • San Francisco, CA, United States

Full-time

AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Show more

Last updated: 24 days ago • Promoted

1.62 Machine learning Engineer

Medium • Mountain View, CA, United States

Full-time

Field AI is transforming how robots interact with the real world.We are building risk‑aware, reliable, and field‑ready AI systems that address the most complex challenges in robotics, unlocking the...Show more

Last updated: 1 day ago • Promoted

Machine Learning Engineer Model Evaluations, Public Sector

Scale AI • San Francisco, California, USA

Full-time

Machine Learning Engineer - Model Evaluations Public Sector.The Public Sector ML team at Scale deploys advanced AI systemsincluding LLMs agentic models and multimodal pipelinesinto mission-critical...Show more

Last updated: 22 hours ago • Promoted • New!

Machine Learning Engineer - Model Evaluations, Public Sector

Scale AI • San Francisco, CA, United States

Full-time

Machine Learning Engineer - Model Evaluations, Public Sector.The Public Sector ML team at Scale deploys advanced AI systems-including LLMs, agentic models, and multimodal pipelines-into mission-cri...Show more

Last updated: 5 hours ago • Promoted • New!

1.62 Machine learning Engineer

FieldAI • Mountain View, CA, United States

Full-time

Last updated: 1 day ago • Promoted

Machine Learning Engineer, Monetization Engineering

Pinterest • Palo Alto, CA, United States

Full-time

Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show more

Last updated: 30+ days ago • Promoted

Machine Learning Engineer - Model Evaluations, Public Sector

Scale AI, Inc. • San Francisco, CA, US

Full-time

Machine Learning Engineer - Model Evaluations, Public Sector.The Public Sector ML team at Scale deploys advanced AI systems—including LLMs, agentic models, and multimodal pipelines—into mission-cri...Show more

Last updated: 4 hours ago • Promoted • New!

Machine Learning Engineer

Kiddom • San Francisco, California, United States

Full-time +1

Kiddom is a groundbreaking educational platform that promotes student equity and growth by uniting high-quality instructional materials with dynamic digital learning. Through unparalleled curriculum...Show more

Last updated: 30+ days ago • Promoted

Machine Learning Engineer

Curai • San Francisco, California, United States

Full-time

Curai Health is an AI-powered virtual clinic on a mission to improve access to care at scale.As the pioneer in deploying machine learning into clinical workflows, Curai Health enables its dedicated...Show more

Last updated: 30+ days ago • Promoted

Machine Learning Engineer

Alexander Chapman • San Francisco, CA, United States

Full-time

Get AI-powered advice on this job and more exclusive features.This range is provided by Alexander Chapman.Your actual pay will be based on your skills and experience — talk with your recruiter to l...Show more

Last updated: 30+ days ago • Promoted

Machine Learning Engineer

Arc Institute • Palo Alto, California, United States

Full-time

The Arc Institute is a new scientific institution that conducts curiosity-driven basic science and technology development to understand and treat complex human diseases. Headquartered in Palo Alto, ...Show more

Last updated: 30+ days ago • Promoted

AIML - Sr. Machine Learning Infrastructure Engineer, Evaluation

Apple Inc. • San Francisco, CA, United States

Full-time

Machine Learning Infrastructure Engineer, Evaluation.San Francisco, California, United States Software and Services.How do we ensure that Apple's most advanced AI features perform flawlessly for ev...Show more

Last updated: 6 days ago • Promoted

Machine Learning Engineer, Training Infrastructure

Intellipro Group • San Francisco, California, United States

Full-time

Machine Learning Engineer, Training Infrastructure.We are looking for an ML Engineer with .ML workloads at scale, supporting our 3DVAE and video diffusion models. We encourage you to apply even if y...Show more

Last updated: 30+ days ago • Promoted