Talent.com
Research Engineer, Machine Learning (Horizons) San Francisco, CA | New York City, NY

Research Engineer, Machine Learning (Horizons) San Francisco, CA | New York City, NY

Alcides FonsecaSan Francisco, CA, United States
1 day ago
Job type
  • Full-time
Job description

Overview

Research Engineer, Machine Learning (Horizons) at Anthropic. Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. Our Horizons team leads reinforcement learning research and development, with impact on Claude models such as Claude 3.5 and 3.7 Sonnet. We collaborate across alignment, red teams, applied production training, and RL engineering to build safe, scalable systems.

The Horizons team sits at the intersection of cutting-edge research and engineering excellence, focusing on building high-quality, scalable systems that push the boundaries of what AI can accomplish.

About the Role

As a Research Engineer on the Horizons team, you will collaborate with researchers and engineers to advance the capabilities and safety of large language models. The role blends research and engineering responsibilities, requiring you to implement novel approaches and contribute to the research direction. You will work on fundamental reinforcement learning, create agentic models via tool use for open-ended tasks (e.g., computer use and autonomous software generation), enhance reasoning in areas such as mathematics, and develop prototypes for internal use, productivity, and evaluation.

Representative projects

  • Architect and optimize core reinforcement learning infrastructure, from clean training abstractions to distributed experiment management across GPU clusters; scale research workflows.
  • Design, implement, and test novel training environments, evaluations, and methodologies for RL agents that push the state of the art for the next generation of models.
  • Drive performance improvements across the stack through profiling, optimization, benchmarking; implement efficient caching and debug distributed systems to accelerate training and evaluation.
  • Collaborate across research and engineering to develop automated testing frameworks, design clean APIs, and build scalable infrastructure that accelerates AI research.

You may be a good fit if you

  • Are proficient in Python and async / concurrent programming with frameworks like Trio
  • Have experience with machine learning frameworks (PyTorch, TensorFlow, JAX)
  • Have industry experience in machine learning research
  • Can balance research exploration with engineering implementation
  • Enjoy pair programming
  • Care about code quality, testing, and performance
  • Have strong systems design and communication skills
  • Are passionate about the potential impact of AI and are committed to developing safe and beneficial systems
  • Strong candidates may have

  • Familiarity with LLM architectures and training methodologies
  • Experience with reinforcement learning techniques and environments
  • Experience with virtualization and sandboxed code execution environments
  • Experience with Kubernetes
  • Experience with distributed systems or high-performance computing
  • Experience with Rust and / or C++
  • Strong candidates need not have

  • Formal certifications or education credentials
  • Academic research experience or publication history
  • Logistics

    Education requirements : At least a Bachelor's degree in a related field or equivalent experience.

    Location : Location-based hybrid policy : staff are expected to be in one of our offices at least 25% of the time, with some roles requiring more time in offices.

    Visa sponsorship : We sponsor visas where possible. If we make you an offer, we will make reasonable efforts to assist with visa processes.

    We encourage you to apply even if you do not meet every qualification. We value diverse perspectives and believe AI can benefit from inclusive teams.

    Compensation

    The expected base compensation for this position is listed below; total compensation may include equity, benefits, and incentive components.

    $280,000 - $425,000 USD

    Equal Opportunity

    Anthropic is an equal opportunity employer. We do not discriminate on the basis of protected status. We collect voluntary self-identification information for government reporting purposes, and participation is entirely voluntary.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • San Francisco, CA, United States