Talent.com
Research Engineer - Distributed Training
Research Engineer - Distributed TrainingPrime Intellect • San Francisco, CA, United States
Research Engineer - Distributed Training

Research Engineer - Distributed Training

Prime Intellect • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Building Open Superintelligence Infrastructure

Prime Intellect is building the open superintelligence stack - from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with the full rl post-training stack : environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts.

As a Research Engineer working on Distributed Training, you'll play a crucial role in shaping our technological direction, focusing on our decentralizing AI training stack. If you love scaling things and maximizing training efficiency, this role is for you.

Responsibilities

Lead and participate in novel research to build a massive scale, highly reliable and secure decentralized training orchestration solution

Optimize the performance, cost, and resource utilization of AI workloads by leveraging the most recent advances for compute & memory optimization techniques.

Contribute to the development of our open-source libraries and frameworks for distributed model training.

Publish research in top-tier AI conferences such as ICML & NeurIPS.

Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers.

Stay up-to-date with the latest advancements in AI / ML infrastructure and tools, decentralized training research and proactively identify opportunities to enhance our platform's capabilities and user experience.

Requirements

Strong background in AI / ML engineering, with extensive experience in designing and implementing end-to-end pipelines for training and deploying large-scale AI models.

Deep expertise in distributed training techniques, frameworks (e.g., PyTorch Distributed, DeepSpeed, MosaicML’s LLM Foundry), and tools (e.g. Ray) for optimizing the performance and scalability of AI workloads.

Experience in large-scale model training incl. distributed training techniques such as data, tensor & pipeline parallelism

Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration / deployment (CI / CD) pipelines.

Passion for advancing the state-of-the-art in decentralized AI model training and democratizing access to AI capabilities for researchers, developers, and businesses worldwide.

If you're not familiar with these, but feel like that you can contribute to our mission and you're a high-energy person, get familiar with these resources (here, here and here) and please reach out!

Benefits & Perks

Competitive compensation, including equity incentives, aligning your success with the growth and impact of Prime Intellect.

Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco.

Visa sponsorship and relocation assistance for international candidates.

Quarterly team off-sites, hackathons, conferences and learning opportunities.

Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI.

We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.

If you're excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers developers and researchers to push the boundaries of what's possible, we'd love to hear from you.

#J-18808-Ljbffr

Create a job alert for this search

Engineer Distributed • San Francisco, CA, United States

Related jobs
Research Engineer, Pre-training

Research Engineer, Pre-training

Anthropic • San Francisco, CA, United States
Full-time
Research Engineer, Pre-training.Get AI-powered advice on this job and more exclusive features.Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be saf...Show more
Last updated: 13 days ago • Promoted
Research Development Specialist - STEM

Research Development Specialist - STEM

Another Source • Redwood City, CA, United States
Full-time
Stanford University is one of the world's premier academic and research institutions, devoting tremendous intellectual and physical resources toward the betterment of humanity.As a major Silicon Va...Show more
Last updated: 30+ days ago • Promoted
UX Researcher, Mixed Methods

UX Researcher, Mixed Methods

META • Menlo Park, CA, United States
Full-time
Our UX Research team is designing for the broad spectrum of human needs, which requires us to understand the behaviors of the people behind them. Our researchers tackle some of the most complex chal...Show more
Last updated: 19 days ago • Promoted
Research Software Engineer

Research Software Engineer

Schlumberger • Menlo Park, CA, United States
Full-time
SLB's Software Technology Innovation Center (STIC) is looking for an experienced software engineer with enthusiasm to explore new technologies and drive innovation projects in the Foundations Lab.D...Show more
Last updated: 6 days ago • Promoted
Research Software Engineer

Research Software Engineer

SLB • Menlo Park, CA, United States
Full-time
SLB's Software Technology Innovation Center (STIC) is looking for an experienced software engineer with enthusiasm to explore new technologies and drive innovation projects in the Foundations Lab.D...Show more
Last updated: 5 days ago • Promoted
Research Engineer - Distributed Training

Research Engineer - Distributed Training

Menlo Ventures • San Francisco, CA, United States
Full-time
At Prime Intellect, we are on a mission to accelerate open and decentralized AI progress by enabling anyone to contribute compute, code or capital to train powerful, open models.Our ultimate goal? ...Show more
Last updated: 5 days ago • Promoted
ML Research Engineer - Training

ML Research Engineer - Training

Achira • San Francisco, CA, United States
Full-time
Join a world‑class team of scientists, ML researchers, and engineers working together to make the physical microcosm predictable and reshape the future of drug discovery. Move beyond the beaten path...Show more
Last updated: 30+ days ago • Promoted
Research Engineer, ML Systems (All Industry Levels)

Research Engineer, ML Systems (All Industry Levels)

Character.AI • San Francisco, CA, United States
Full-time
Research Engineer, ML Systems (All Industry Levels).Research Engineer, ML Systems (All Industry Levels).Research Engineer, ML Systems (All Industry Levels). Research Engineer, ML Systems (All Indust...Show more
Last updated: 30+ days ago • Promoted
Applied AI Engineer

Applied AI Engineer

Catalyst Labs, LLC • Menlo Park, CA, United States
Full-time
About the job Applied AI Engineer.Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science. We stand out as an agency that's deeply embe...Show more
Last updated: 19 days ago • Promoted
Scientific Engineering Associate (Micro-Scale Applications Group)

Scientific Engineering Associate (Micro-Scale Applications Group)

Lawrence Berkeley Lab • Berkeley, CA, United States
Full-time
Berkeley Lab's (LBNL) Joint Genome Institute (JGI) has an opening for a Scientific Engineering Associate to join the Micro-Scale Applications Group for cutting-edge developments in microbiome resea...Show more
Last updated: 8 days ago • Promoted
Software Engineer, Machine Learning - Engineering Systems and AI Research

Software Engineer, Machine Learning - Engineering Systems and AI Research

Snowflake Computing • Menlo Park, CA, United States
Full-time
Snowflake is about empowering enterprises to achieve their full potential - and people too.With a culture that's all in on impact, innovation, and collaboration, Snowflake is the sweet spot for bui...Show more
Last updated: 30+ days ago • Promoted
Distributed Training Engineer

Distributed Training Engineer

Periodic Labs • Menlo Park, CA, United States
Full-time
We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries.We are well funded and growing rapidly. Team members are owners who identity and solve prob...Show more
Last updated: 18 days ago • Promoted
Data Engineer - Robotics & Foundation Models

Data Engineer - Robotics & Foundation Models

Approach Venture LLC • Berkeley, CA, United States
Full-time
Data Engineer - Help Build the Future of Intelligent Robotics Systems!.Join a fast-growing robotics startup on a mission to redefine how machines learn, adapt, and collaborate with humans.As a Data...Show more
Last updated: 19 days ago • Promoted
Research Engineer

Research Engineer

Solcoa • San Francisco, CA, United States
Full-time
Making the Metals Powering the World.Solcoa exists to stabilize the western rare-earth metal supply chain—powering every fighter jet, EV, wind turbine, phone, and generator.We’re among the very few...Show more
Last updated: 30+ days ago • Promoted
Research Engineer

Research Engineer

gamma.app • San Francisco, CA, United States
Full-time
We're building the creative layer for modern communication.Every month, over a billion people make presentations — but the tools they use to make them haven't evolved in decades.We're changing that...Show more
Last updated: 5 days ago • Promoted
Senior / Staff Machine Learning Engineer

Senior / Staff Machine Learning Engineer

Dexterity • Redwood City, CA, United States
Full-time
At Dexterity, we believe robots can positively transform the world.Our breakthrough technology frees people to do the creative, inspiring, problem-solving jobs that humans do best by enabling robot...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Scale AI • San Francisco, CA, United States
Full-time
AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Show more
Last updated: 19 days ago • Promoted
UX Researcher

UX Researcher

INSPYR Solutions • Menlo Park, CA, United States
Full-time
Burlingame, Seattle, or NY - This role will hybird requiring 1-2 office visits a week as needed.US Citizen, GC Holders or Authorized to Work in the U. You'll specifically lead strategic and evaluati...Show more
Last updated: 19 days ago • Promoted