Responsibilities
Want to push the boundaries of what reinforcement learning can achieve with frontier models?
In this role you will be advancing reinforcement learning methods for large-scale AI systems. You’ll be applying RL techniques to enhance reasoning, planning, and decision‑making in models that directly impact fields from biology to climate and materials science.
Your work will combine RL with large language models, experimenting with RLHF, PPO, and DPO, designing evaluation frameworks, and fine‑tuning models at scale. The aim is to go beyond benchmarks and deliver models that researchers can use to accelerate discovery.
You will be a driving force in a team that is building towards a broader superintelligence platform : models that don’t just generate text or data, but drive breakthroughs across multiple domains. As part of this, you’ll collaborate with domain experts to ensure your research translates into real‑world scientific progress.
Qualifications
If you have experience with multi‑agent RL, hierarchical / offline RL, or domain‑specific work with scientific datasets you will be an ideal candidate for this position.
Package : $250k - $400k base + bonus + stock
Location : SF Bay area or potential for remote with travel to office when needed.
If you want to see your RL research power the next generation of superintelligence, this is the role for you!
#J-18808-Ljbffr
Senior Research Scientist • San Francisco, CA, United States