Talent.com
Research Engineer - Distributed Training

Research Engineer - Distributed Training

Prime IntellectSan Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

At Prime Intellect, we are on a mission to accelerate open and decentralized AI progress by enabling anyone to contribute compute, code or capital to train powerful, open models. Our ultimate goal? Openly accessible AGI that benefits everyone. But we can't do it alone and we want to do this together with you.

We are building the infrastructure for decentralized AI development at scale. We aggregate global compute and enable researchers to collaboratively train state-of-the-art models through distributed training across clusters.

As a Research Engineer working on Distributed Training, you'll play a crucial role in shaping our technological direction, focusing on our decentralizing AI training stack. If you love scaling things and maximizing training efficiency, this role is for you.

Responsibilities

Lead and participate in novel research to build a massive scale, highly reliable and secure decentralized training orchestration solution

Optimize the performance, cost, and resource utilization of AI workloads by leveraging the most recent advances for compute & memory optimization techniques.

Contribute to the development of our open-source libraries and frameworks for distributed model training.

Publish research in top-tier AI conferences such as ICML & NeurIPS.

Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers.

Stay up-to-date with the latest advancements in AI / ML infrastructure and tools, decentralized training research and proactively identify opportunities to enhance our platform's capabilities and user experience.

Requirements

Strong background in AI / ML engineering, with extensive experience in designing and implementing end-to-end pipelines for training and deploying large-scale AI models.

Deep expertise in distributed training techniques, frameworks (e.g., PyTorch Distributed, DeepSpeed, MosaicML’s LLM Foundry), and tools (e.g. Ray) for optimizing the performance and scalability of AI workloads.

Experience in large-scale model training incl. distributed training techniques such as data, tensor & pipeline parallelism

Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration / deployment (CI / CD) pipelines.

Passion for advancing the state-of-the-art in decentralized AI model training and democratizing access to AI capabilities for researchers, developers, and businesses worldwide.

If you're not familiar with these, but feel like that you can contribute to our mission and you're a high-energy person, get familiar with these resources ( here , here and here ) and please reach out!

Benefits & Perks

Competitive compensation, including equity incentives, aligning your success with the growth and impact of Prime Intellect.

Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco.

Visa sponsorship and relocation assistance for international candidates.

Quarterly team off-sites, hackathons, conferences and learning opportunities.

Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI.

We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.

If you're excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers developers and researchers to push the boundaries of what's possible, we'd love to hear from you.

#J-18808-Ljbffr

Create a job alert for this search

Research Engineer • San Francisco, CA, United States

Related jobs
  • Promoted
R&D Engineer, Detector Design, Model, and Analysis

R&D Engineer, Detector Design, Model, and Analysis

PsiQuantumPalo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
  • Promoted
Staff Research Engineer

Staff Research Engineer

Ambience Healthcare, Inc.San Francisco, CA, United States
Full-time
Ambience Healthcare is the leading AI platform for documentation, coding, and clinical workflow, built to reduce administrative burden and protect revenue integrity at the point of care.Trusted by ...Show moreLast updated: 30+ days ago
  • Promoted
Senior AI Research Engineer, Model Inference (Remote)

Senior AI Research Engineer, Model Inference (Remote)

Tether Operations LimitedSan Francisco, CA, United States
Remote
Full-time
Join Tether and Shape the Future of Digital Finance.At Tether, we’re building solutions that empower businesses to integrate reserve-backed tokens across blockchains with transparency and trust in ...Show moreLast updated: 7 days ago
  • Promoted
  • New!
AI Research Engineer

AI Research Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for an AI Research Engineer specializing in LLM orchestration and prompting.Key Responsibilities Build LLM-powered software by designing prompt flows and orchestrations for o...Show moreLast updated: 22 hours ago
  • Promoted
Research Engineer

Research Engineer

ZyphraPalo Alto, CA, US
Full-time
This will involve designing and rigorously testing novel model architectures and training methodologies, focusing on improving core modelling capabilities (in loss per flop or loss per parameter) a...Show moreLast updated: 30+ days ago
  • Promoted
AI Research Engineer, Lead

AI Research Engineer, Lead

Menlo VenturesSan Francisco, CA, United States
Full-time
The Technical Lead will drive AI research in one or more of the following areas : structure prediction, protein design, and lead optimization. PhD in AI, Machine Learning, Bioinformatics, or related ...Show moreLast updated: 25 days ago
  • Promoted
LLM Research Engineer

LLM Research Engineer

Cypress HCMMountain View, CA, US
Full-time
Design, train, and fine-tune large language models (e.GPT, LLaMA, PaLM) for various applications.Conduct research on cutting-edge techniques in natural language processing (NLP) and machine learnin...Show moreLast updated: 2 days ago
Research Scientist / Engineer – Training Infrastructure

Research Scientist / Engineer – Training Infrastructure

IntelliPro Group Inc.Palo Alto, CA, US
Full-time
Quick Apply
Research Scientist / Engineer – Training Infrastructure Position Type : Full time Location : Palo Alto, CA • Remote - US • Remote - International Salary Range : $220,000 - $300...Show moreLast updated: 11 days ago
  • Promoted
  • New!
Research Engineer, Machine Learning (Horizons) San Francisco, CA | New York City, NY

Research Engineer, Machine Learning (Horizons) San Francisco, CA | New York City, NY

Alcides FonsecaSan Francisco, CA, United States
Full-time
Research Engineer, Machine Learning (Horizons).Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. Our Horizons team leads reinforcement learning research and develop...Show moreLast updated: 4 hours ago
Senior AI Research Engineer, Model Inference (100% Remote)

Senior AI Research Engineer, Model Inference (100% Remote)

Tether Operations LimitedSan Francisco, CA, US
Remote
Full-time
Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Show moreLast updated: 1 day ago
  • Promoted
Research Engineer, Focused Bets

Research Engineer, Focused Bets

OpenAISan Francisco, CA, United States
Full-time
The Strategic Deployment team makes frontier models more capable, reliable, and aligned to transform high-impact domains. On one hand, this involves deploying models in real-world, high-stakes setti...Show moreLast updated: 30+ days ago
  • Promoted
Senior Applied ML Research Engineer, Early Stage Project

Senior Applied ML Research Engineer, Early Stage Project

X Development, LLCMountain View, CA, United States
Full-time
Senior Applied ML Research Engineer, Early Stage Project — Mountain View, CA (HQ).X is Alphabet’s moonshot factory with a mission of inventing and launching moonshot technologies that could someday...Show moreLast updated: 23 hours ago
  • Promoted
ML Research Engineer San Francisco, California

ML Research Engineer San Francisco, California

ExaSan Francisco, CA, United States
Full-time
We're looking for an ML research engineer to train embedding models for perfect search over the web.The role involves dreaming up novel transformer-based search architectures, creating datasets, cr...Show moreLast updated: 30+ days ago
  • Promoted
Senior Research Engineer, On-Device Personal Intelligence

Senior Research Engineer, On-Device Personal Intelligence

Samsung Electronics GmbHMountain View, CA, United States
Full-time
Artificial Intelligence Center.Samsung AI Research Center (AIC) located in Mountain View, California, is currently recruiting outstanding scientists for the Language Intelligence lab.Our goal is to...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior Research And Development Engineer

Senior Research And Development Engineer

Cambridge RecruitersPleasanton, CA, US
Full-time
Commercial-Stage, 200-Person Minimally-Invasive Interventional Technology medical device company.Very well-funded with $50MM financing recently secured. We are looking for a Senior R&D Sustainin...Show moreLast updated: 19 hours ago
  • Promoted
Research Scientist / Engineer - Training Infrastructure

Research Scientist / Engineer - Training Infrastructure

IntelliPro Group Inc.Palo Alto, CA, US
Full-time
Research Scientist / Engineer – Training Infrastructure.Palo Alto, CA • Remote - US • Remote - International.We believe that multimodality is critical for intelligence.To go beyond ...Show moreLast updated: 2 days ago
  • Promoted
Research Engineer / Research Scientist - Foundations Retrieval Lead

Research Engineer / Research Scientist - Foundations Retrieval Lead

OpenAISan Francisco, CA, United States
Full-time
Research Engineer / Research Scientist - Foundations Retrieval Lead.The Foundations Research team works on high-risk, high-reward ideas that could shape the next decade of AI.Our goal is to advance...Show moreLast updated: 18 days ago
  • Promoted
Research Engineer - Machine Learning & Systems

Research Engineer - Machine Learning & Systems

World LabsSan Francisco, CA, United States
Full-time
We are looking for a versatile Research Engineer with a strong background in machine learning or 3D, software development, and systems design. This role is ideal for someone excited about bridging c...Show moreLast updated: 13 days ago