Talent.com
No longer accepting applications
Reinforcement Learning Engineer

Reinforcement Learning Engineer

Code MetalSan Francisco, CA, United States
3 days ago
Job type
  • Full-time
Job description

At Code Metal AI, you’ll be part of a world class team with talent from MIT, OpenAI and other top companies, focused on pioneering work in large language models (LLMs) and code generation. Our projects directly involve leading chip manufacturers, applying advanced AI to solve meaningful, practical challenges with real world impact.

This role bridges two critical areas :

Production

  • Build and maintain robust distributed training systems using PyTorch (2+ years experience required).
  • Design and implement scalable data curation and quality assurance pipelines to ensure top-tier training datasets.
  • Develop orchestration tools that manage complex workflows across large-scale AI model training and evaluation.

Research

  • Drive innovation by developing evaluation frameworks and reinforcement learning solutions, including recent advancements in Reinforcement Learning with Human Feedback (RLHF).
  • Engage with frontier research through open-source projects and potential publications, applying RLHF to Large Language Models (LLMs), ideally focusing on code generation tasks.
  • Qualifications

  • 2+ years experience in distributed training, preferably with PyTorch.
  • Strong background in reinforcement learning, with recent RLHF experience highly preferred.
  • Proven ability to build data curation and quality assurance pipelines.
  • Experience with evaluation framework development.
  • Ideally, experience across both data pipeline and orchestration sides.
  • Eligible for TS / SCI clearance.
  • Nice to have

  • Contributions to open-source AI or ML projects.
  • Published work or demonstrable research experience in related fields.
  • Hands‑on experience applying RLHF to LLMs, especially for code generation.
  • Experience with large‑scale synthetic data generation.
  • Benefits

  • Health care plan with 100% premium coverage, including medical, dental, and vision.
  • 401k with 5% matching.
  • Paid Time Off (Uncapped Vacation, plus Sick & Public Holidays).
  • Flexible hybrid work arrangement.
  • Relocation assistance for qualifying employees.
  • #J-18808-Ljbffr

    Create a job alert for this search

    Engineer Reinforcement Learning • San Francisco, CA, United States

    Related jobs
    • Promoted
    Reinforcement Learning Engineer, Policy, Optimus

    Reinforcement Learning Engineer, Policy, Optimus

    TeslaPalo Alto, CA, United States
    Full-time
    Tesla AI is solving robust embodied intelligence through humanoid robots.The goal ofourreinforcement learningteamis to build anddemonstratea general robot learning system that canleverageAI to perf...Show moreLast updated: 2 days ago
    • Promoted
    Reinforcement Learning Engineer, Self-Driving

    Reinforcement Learning Engineer, Self-Driving

    TeslaPalo Alto, CA, United States
    Full-time
    Tesla is looking for strong Machine Learning Engineers to help build foundation models for robotics to drive the future of autonomy across all current and future generations of vehicles.You will wo...Show moreLast updated: 2 days ago
    • Promoted
    Machine Learning Engineer, Reinforcement Learning, Self-Driving

    Machine Learning Engineer, Reinforcement Learning, Self-Driving

    Tesla Motors, Inc.Palo Alto, CA, United States
    Full-time
    Tesla is looking for strong Machine Learning Engineers to help build foundation models for robotics to drive the future of autonomy across all current and future generations of vehicles.You will wo...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, Distributed Training, Optimus

    Machine Learning Engineer, Distributed Training, Optimus

    Tesla Motors, Inc.Palo Alto, CA, United States
    Full-time
    As a Software Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture, visualize data, assist with exporting and d...Show moreLast updated: 30+ days ago
    • Promoted
    Reinforcement Learning Research Engineer

    Reinforcement Learning Research Engineer

    Strativ GroupSan Francisco, CA, United States
    Full-time
    This range is provided by Strativ Group.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. A scaling, SOTA Generative AI Startup operating with a w...Show moreLast updated: 7 days ago
    • Promoted
    LLM Training Frameworks and Optimization Engineer

    LLM Training Frameworks and Optimization Engineer

    Together AISan Francisco, CA, United States
    Full-time
    LLM Training Frameworks and Optimization Engineer.LLM Training Frameworks and Optimization Engineer.LLM Training Frameworks and Optimization Engineer. LLM Training Frameworks and Optimization Engine...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, Planning

    Machine Learning Engineer, Planning

    WaymoSan Francisco, CA, United States
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 30+ days ago
    • Promoted
    AI Engineer, Reinforcement Learning & Distillation, Tesla AI

    AI Engineer, Reinforcement Learning & Distillation, Tesla AI

    TeslaPalo Alto, CA, United States
    Full-time
    Scaling transformers, as well as more recent advances in Reinforcement Learning with Verifiable Rewards (RLVR), has created models with PhD level intelligence in a wide variety of subject areas - f...Show moreLast updated: 1 day ago
    • Promoted
    Sr. Machine Learning Engineer (Recommendation Systems)

    Sr. Machine Learning Engineer (Recommendation Systems)

    PhiloSan Francisco, CA, United States
    Full-time
    At Philo, we’re a group of technology and product people who set out to build the future of television, marrying the best in modern technology with the most compelling medium ever invented.We lever...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, 2+ Years Experience

    Machine Learning Engineer, 2+ Years Experience

    TwelveLabsSan Francisco, CA, United States
    Full-time
    Machine Learning Engineer, 2+ Years Experience.Machine Learning Engineer, 2+ Years Experience.This range is provided by TwelveLabs. Your actual pay will be based on your skills and experience — talk...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, Integration, AI Platforms

    Machine Learning Engineer, Integration, AI Platforms

    TeslaPalo Alto, CA, United States
    Full-time
    Machine Learning Engineer, Integration, AI Platforms.Tesla is a leader in innovative technology, pioneering advancements in autonomous vehicles and humanoid robotics. Our cutting-edge AI platform po...Show moreLast updated: 30+ days ago
    • Promoted
    MACHINE LEARNING ENGINEER

    MACHINE LEARNING ENGINEER

    IeeerusbSan Francisco, CA, United States
    Full-time
    Autodesk’s San Francisco, CA office has multiple openings for the following positions (various types / levels); Some telecommuting permitted : . MACHINE LEARNING ENGINEER [Job Code : RV068] Conduct resea...Show moreLast updated: 10 days ago
    • Promoted
    Machine Learning Systems Engineer, RL Engineering

    Machine Learning Systems Engineer, RL Engineering

    AnthropicSan Francisco, CA, United States
    Full-time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, Training Infrastructure

    Machine Learning Engineer, Training Infrastructure

    Hedra, IncSan Francisco, CA, United States
    Full-time
    Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, Reinforcement Learning, Self-Driving

    Machine Learning Engineer, Reinforcement Learning, Self-Driving

    TeslaPalo Alto, CA, United States
    Full-time
    Machine Learning Engineer, Motion Planning, Self-Driving.Machine Learning Engineer, Motion Planning, Self-Driving.Machine Learning Engineer, Motion Planning, Self-Driving.Be among the first 25 appl...Show moreLast updated: 30+ days ago
    • Promoted
    Humanoid Robot Learning Engineer — RL & Imitation

    Humanoid Robot Learning Engineer — RL & Imitation

    Agility RoboticsSan Francisco, CA, United States
    Full-time
    A robotics company in San Francisco seeks a skilled individual to join their AI innovation team.This role involves developing and testing advanced methods for humanoid robots, focusing on reinforce...Show moreLast updated: 2 days ago
    • Promoted
    Machine Learning Engineer - Post Training

    Machine Learning Engineer - Post Training

    EPM ScientificSan Francisco County, CA, United States
    Full-time
    Machine Learning Engineer - Post Training.A stealth-stage venture backed by Lux Capital (investors in DeepMind and OpenAI) is developing frontier-scale AI systems for high-impact applications in hu...Show moreLast updated: 18 days ago
    • Promoted
    Machine Learning Engineer, Training Infrastructure

    Machine Learning Engineer, Training Infrastructure

    Ipro Networks Pte. Ltd.San Francisco, CA, United States
    Full-time
    Job Title : Machine Learning Engineer, Training Infrastructure | Position Type : Full time | Location : San Francisco, CA, USA | Salary Range : $150,000 - $250,000 (USD) | Job ID# : 158135.Design, imple...Show moreLast updated: 30+ days ago