Talent.com
Machine Learning Engineer - Model Evaluations, Public Sector
Machine Learning Engineer - Model Evaluations, Public SectorScale AI, Inc. • San Francisco, CA, United States
Machine Learning Engineer - Model Evaluations, Public Sector

Machine Learning Engineer - Model Evaluations, Public Sector

Scale AI, Inc. • San Francisco, CA, United States
16 hours ago
Job type
  • Full-time
Job description

Machine Learning Engineer - Model Evaluations, Public Sector

The Public Sector ML team at Scale deploys advanced AI systems—including LLMs, agentic models, and multimodal pipelines—into mission‑critical government environments. We build evaluation frameworks that ensure these models operate reliably, safely, and effectively under real‑world constraints. As an ML Engineer, you will design, implement, and scale automated evaluation pipelines that help customers trust and operationalize advanced AI systems across defense, intelligence, and federal missions.

You will :

  • Develop and maintain automated evaluation pipelines for ML models across functional, performance, robustness, and safety metrics, including LLM‑judge‑based evaluations.
  • Design test datasets and benchmarks to measure generalization, bias, explainability, and failure modes.
  • Build evaluation frameworks for LLM agents, including infrastructure for scenario‑based and environment‑based testing.
  • Conduct comparative analyses of model architectures, training procedures, and evaluation outcomes.
  • Implement tools for continuous monitoring, regression testing, and quality assurance for ML systems.
  • Design and execute stress tests and red‑teaming workflows to uncover vulnerabilities and edge cases.
  • Collaborate with operations teams and subject‑matter experts to produce high‑quality evaluation datasets.

Security clearance requirement

  • This role will require an active security clearance or the ability to obtain a security clearance.
  • Qualifications

  • Experience in computer vision, deep learning, reinforcement learning, or NLP in production settings.
  • Strong programming skills in Python; experience with TensorFlow or PyTorch.
  • Background in algorithms, data structures, and object‑oriented programming.
  • Experience with LLM pipelines, simulation environments, or automated evaluation systems.
  • Ability to convert research insights into measurable evaluation criteria.
  • Nice to haves

  • Graduate degree in CS, ML, or AI.
  • Cloud experience (AWS, GCP) and model deployment experience.
  • Experience with LLM evaluation, CV robustness, or RL validation.
  • Knowledge of interpretability, adversarial robustness, or AI safety frameworks.
  • Familiarity with ML evaluation frameworks and agentic model design.
  • Experience in regulated, classified, or mission‑critical ML domains.
  • We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or veteran status.

    We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and / or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com.

    We comply with the United States Department of Labor's Pay Transparency provision.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • San Francisco, CA, United States

    Related jobs
    AIML - Machine Learning Engineer, Foundation Model Services

    AIML - Machine Learning Engineer, Foundation Model Services

    Apple Inc. • Santa Clara, CA, United States
    Full-time
    Work closely with product teams to build production grade solutions to launch models serving millions of customers in real time. Work along side Foundation Model Research team to prototype and devel...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Institute Of Foundation Models • Sunnyvale, California, United States
    Full-time
    About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Hive • San Francisco, California, United States
    Full-time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer, Multimodality

    Machine Learning Engineer, Multimodality

    Tik Tok • San Jose, CA, United States
    Full-time
    We use an AIGC large model, deep learning, machine learning and other technologies to establish a safe and trusted connection between billions of active users and the company's commercial products ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Aven • Campbell, California, United States
    Full-time
    We are looking for an experienced Machine Learning Engineers - you have built models in product from idea to delivery.You are passionate about digging into the data, sanitizing it, studying it, com...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Research Engineer, Enterprise ML Systems

    Machine Learning Research Engineer, Enterprise ML Systems

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Show more
    Last updated: 24 days ago • Promoted
    1.62 Machine learning Engineer

    1.62 Machine learning Engineer

    Medium • Mountain View, CA, United States
    Full-time
    Field AI is transforming how robots interact with the real world.We are building risk‑aware, reliable, and field‑ready AI systems that address the most complex challenges in robotics, unlocking the...Show more
    Last updated: 1 day ago • Promoted
    Machine Learning Engineer Model Evaluations, Public Sector

    Machine Learning Engineer Model Evaluations, Public Sector

    Scale AI • San Francisco, California, USA
    Full-time
    Machine Learning Engineer - Model Evaluations Public Sector.The Public Sector ML team at Scale deploys advanced AI systemsincluding LLMs agentic models and multimodal pipelinesinto mission-critical...Show more
    Last updated: 22 hours ago • Promoted • New!
    Machine Learning Engineer - Model Evaluations, Public Sector

    Machine Learning Engineer - Model Evaluations, Public Sector

    Scale AI • San Francisco, CA, United States
    Full-time
    Machine Learning Engineer - Model Evaluations, Public Sector.The Public Sector ML team at Scale deploys advanced AI systems-including LLMs, agentic models, and multimodal pipelines-into mission-cri...Show more
    Last updated: 5 hours ago • Promoted • New!
    1.62 Machine learning Engineer

    1.62 Machine learning Engineer

    FieldAI • Mountain View, CA, United States
    Full-time
    Field AI is transforming how robots interact with the real world.We are building risk‑aware, reliable, and field‑ready AI systems that address the most complex challenges in robotics, unlocking the...Show more
    Last updated: 1 day ago • Promoted
    Machine Learning Engineer, Monetization Engineering

    Machine Learning Engineer, Monetization Engineering

    Pinterest • Palo Alto, CA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer - Model Evaluations, Public Sector

    Machine Learning Engineer - Model Evaluations, Public Sector

    Scale AI, Inc. • San Francisco, CA, US
    Full-time
    Machine Learning Engineer - Model Evaluations, Public Sector.The Public Sector ML team at Scale deploys advanced AI systems—including LLMs, agentic models, and multimodal pipelines—into mission-cri...Show more
    Last updated: 4 hours ago • Promoted • New!
    Machine Learning Engineer

    Machine Learning Engineer

    Kiddom • San Francisco, California, United States
    Full-time +1
    Kiddom is a groundbreaking educational platform that promotes student equity and growth by uniting high-quality instructional materials with dynamic digital learning. Through unparalleled curriculum...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Curai • San Francisco, California, United States
    Full-time
    Curai Health is an AI-powered virtual clinic on a mission to improve access to care at scale.As the pioneer in deploying machine learning into clinical workflows, Curai Health enables its dedicated...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Alexander Chapman • San Francisco, CA, United States
    Full-time
    Get AI-powered advice on this job and more exclusive features.This range is provided by Alexander Chapman.Your actual pay will be based on your skills and experience — talk with your recruiter to l...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Arc Institute • Palo Alto, California, United States
    Full-time
    The Arc Institute is a new scientific institution that conducts curiosity-driven basic science and technology development to understand and treat complex human diseases. Headquartered in Palo Alto, ...Show more
    Last updated: 30+ days ago • Promoted
    AIML - Sr. Machine Learning Infrastructure Engineer, Evaluation

    AIML - Sr. Machine Learning Infrastructure Engineer, Evaluation

    Apple Inc. • San Francisco, CA, United States
    Full-time
    Machine Learning Infrastructure Engineer, Evaluation.San Francisco, California, United States Software and Services.How do we ensure that Apple's most advanced AI features perform flawlessly for ev...Show more
    Last updated: 6 days ago • Promoted
    Machine Learning Engineer, Training Infrastructure

    Machine Learning Engineer, Training Infrastructure

    Intellipro Group • San Francisco, California, United States
    Full-time
    Machine Learning Engineer, Training Infrastructure.We are looking for an ML Engineer with .ML workloads at scale, supporting our 3DVAE and video diffusion models. We encourage you to apply even if y...Show more
    Last updated: 30+ days ago • Promoted