Talent.com
Research Engineer, Model Performance & Quality
Research Engineer, Model Performance & QualityAnthropic • Seattle, WA, US
No longer accepting applications
Research Engineer, Model Performance & Quality

Research Engineer, Model Performance & Quality

Anthropic • Seattle, WA, US
30+ days ago
Job type
  • Full-time
Job description

Research Engineer, Model Performance & Quality

About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the role

As a Research Engineer on the Model Performance team, you will help solve one of our greatest challenges : systematically understanding and monitoring model quality in real-time. This role blends research and engineering responsibilities, requiring you to train production models, develop robust monitoring systems, and create novel evaluation methodologies.

Note : For this role, we conduct all interviews in Python.

Representative Projects

  • Build comprehensive training observability systems - Design and implement monitoring infrastructure to keep an eye on how model behaviors evolve throughout training.
  • Develop next-generation evaluation frameworks - Move beyond traditional benchmarks to create evaluations that capture real-world utility.
  • Create automated quality assessment pipelines - Build custom classifiers to continuously monitor RL transcripts for complex issues
  • Bridge research and production - Partner with research teams to translate cutting-edge evaluation techniques into production-ready systems, and work with engineering teams to ensure our monitoring infrastructure scales with increasingly complex training workflows.

You may be a good fit if you :

  • Are proficient in Python and have experience building production ML systems
  • Have experience with training, evaluating, or monitoring large language models
  • Are energized by the high stakes and intensity of production training, and ready to jump in wherever needed.
  • Are naturally curious about debugging complex, distributed systems and thinking about failure modes
  • Enjoy collaborative problem-solving and working across diverse teams - you'll work on virtually all stages of our model training pipeline
  • Can balance research exploration with engineering rigor.
  • Have strong analytical skills for interpreting training metrics and model behavior
  • Want to directly impact the quality and safety of deployed AI systems
  • Strong candidates may have :

  • Experience with reinforcement learning and language model training pipelines
  • Experience designing and implementing evaluation frameworks or benchmarks
  • Background in production monitoring, observability, and incident response
  • Experience with statistical analysis and experimental design
  • Knowledge of AI safety and alignment research
  • Strong candidates need not have :

  • Formal certifications or education credentials
  • Academic research experience or publication history
  • Prior experience in AI safety or evaluation specifically
  • We're looking for thoughtful engineers who are excited about the challenge of measuring and monitoring capabilities we're still discovering. This role offers the opportunity to shape how the field approaches model quality assessment while working on systems that will be critical as AI capabilities continue to advance.

    The expected salary range for this position is :

    $315,000 - $340,000 USD

    Logistics

    Education requirements : We require at least a Bachelor's degree in a related field or equivalent experience.

    Location-based hybrid policy : Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

    Visa sponsorship : We do sponsor visas. However, we aren't able to successfully sponsor visas for every role and every candidate. If we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

    We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. We think AI systems like the ones we're building have enormous social and ethical implications. We strive to include a range of diverse perspectives on our team.

    How we're different

    We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. We value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We are highly collaborative and host frequent research discussions to ensure we are pursuing the highest-impact work. We strongly value communication skills.

    The easiest way to understand our research directions is to read our recent research. This research continues many directions our team worked on prior to Anthropic, including : GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

    Come work with us!

    Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.

    J-18808-Ljbffr

    Create a job alert for this search

    Performance Engineer • Seattle, WA, US

    Related jobs
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc. • Seattle, WA, United States
    Full-time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show more
    Last updated: 30+ days ago • Promoted
    Senior Research Engineer

    Senior Research Engineer

    VirtualVocations • Seattle, Washington, United States
    Full-time
    A company is looking for a Senior Research Engineer.Key Responsibilities Maintain and evolve the JAX training framework for large-scale distributed training runs Optimize production JAX inferenc...Show more
    Last updated: 25 days ago • Promoted
    Machine Learning Research Scientist / Research Engineer, Post-Training

    Machine Learning Research Scientist / Research Engineer, Post-Training

    Scale AI, Inc. • Seattle, WA, United States
    Full-time
    Scale works with the industry's leading AI labs to provide high quality data and accelerate progress in GenAI research.We are looking for Research Scientists and Research Engineers with expertise i...Show more
    Last updated: 30+ days ago • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    VirtualVocations • Renton, Washington, United States
    Full-time
    A company is looking for a Senior MLOps Engineer - Personalisation.Key Responsibilities Own and evolve the end-to-end ML lifecycle, including data ingestion, feature engineering, model training, ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Research Scientist / Engineer, Reasoning

    Machine Learning Research Scientist / Engineer, Reasoning

    Scale AI, Inc. • Seattle, WA, United States
    Full-time
    At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, fueling the most exciting advancements in AI, including generat...Show more
    Last updated: 30+ days ago • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    VirtualVocations • Renton, Washington, United States
    Full-time
    A company is looking for a Staff Machine Learning Engineer - Wildfire.Key Responsibilities Architect and build advanced ML models to predict vegetation and fuel conditions Design and maintain da...Show more
    Last updated: 30+ days ago • Promoted
    Sr Machine Learning Engineer, Applied Research Science

    Sr Machine Learning Engineer, Applied Research Science

    Pinterest • Seattle, WA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show more
    Last updated: 30+ days ago • Promoted
    Director CMC Modeling & Statistics

    Director CMC Modeling & Statistics

    VirtualVocations • Renton, Washington, United States
    Full-time
    A company is looking for a Director CMC Statistics.Key Responsibilities Provide strategic and technical leadership in advanced statistical modeling for CMC development Lead the development and a...Show more
    Last updated: 2 days ago • Promoted
    Research Associate I

    Research Associate I

    VirtualVocations • Seattle, Washington, United States
    Full-time
    A company is looking for a Research Associate I specializing in Tissue Cell Culture.Key Responsibilities Develop and implement complex in vitro models to support research and development Conduct...Show more
    Last updated: 5 hours ago • Promoted • New!
    High Performance AI Engineer

    High Performance AI Engineer

    VirtualVocations • Seattle, Washington, United States
    Full-time
    A company is looking for a High Performance AI Engineer to build groundbreaking multi-agent systems for the CUDA ecosystem. Key Responsibilities Design, build, and optimize agentic AI systems for ...Show more
    Last updated: 3 days ago • Promoted
    Machine Learning Engineer, Core Engineering

    Machine Learning Engineer, Core Engineering

    Pinterest • Seattle, WA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show more
    Last updated: 30+ days ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    VirtualVocations • Seattle, Washington, United States
    Full-time
    A company is looking for a Senior Machine Learning Engineer, Security.Key Responsibilities Design, build, and deploy machine learning models to detect and mitigate security threats Develop algor...Show more
    Last updated: 30+ days ago • Promoted
    Forward Deployed Research Engineer

    Forward Deployed Research Engineer

    VirtualVocations • Renton, Washington, United States
    Full-time
    A company is looking for a Forward Deployed Research Engineer (FDRE - Clearance).Key Responsibilities Own the mission end-to-end, clarifying goals and shipping systems that impact real metrics P...Show more
    Last updated: 9 hours ago • Promoted • New!
    Machine Learning Research Scientist / Engineer, Agents

    Machine Learning Research Scientist / Engineer, Agents

    Scale AI, Inc. • Seattle, WA, United States
    Full-time
    At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...Show more
    Last updated: 30+ days ago • Promoted
    UX Researcher

    UX Researcher

    VirtualVocations • Seattle, Washington, United States
    Full-time
    A company is looking for a UX Researcher to influence cloud infrastructure development through human-centered research.Key Responsibilities Determine and apply mixed methods research to distill c...Show more
    Last updated: 30+ days ago • Promoted
    Senior Quantum Algorithm Researcher

    Senior Quantum Algorithm Researcher

    VirtualVocations • Renton, Washington, United States
    Full-time
    A company is looking for a Senior Quantum Algorithm Researcher to drive innovation at the intersection of quantum computing applications and AI. Key Responsibilities Establish and lead technical c...Show more
    Last updated: 3 days ago • Promoted
    RCM Quality Performance Associate

    RCM Quality Performance Associate

    VirtualVocations • Renton, Washington, United States
    Full-time
    A company is looking for a RCM Quality Performance Associate responsible for supporting MACRA and Dictation Feedback program initiatives within the revenue cycle management department.Key Responsib...Show more
    Last updated: 1 day ago • Promoted
    Senior UX Researcher - AI / ML

    Senior UX Researcher - AI / ML

    VirtualVocations • Renton, Washington, United States
    Full-time
    A company is looking for a Senior UX Researcher (AI / ML - Healthcare / SaaS).Key Responsibilities Drive AI-enhanced product insights through user behavior analysis and feedback Leverage continuous ...Show more
    Last updated: 9 hours ago • Promoted • New!