Talent.com
Research Scientist / Engineer - Training Infrastructure

Research Scientist / Engineer - Training Infrastructure

IntelliPro Group Inc.Palo Alto, CA, US
11 days ago
Job type
  • Full-time
Job description

Job Description

Job Description

Job Title :   Research Scientist / Engineer – Training Infrastructure

Position Type : Full time

Location : Palo Alto, CA

  • Remote - US
  • Remote - International

Salary Range :  $220,000 - $300, 000 (USD)

Job ID# : 154559

Job Description :

We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will come from vision. So, we are working on training and scaling up multimodal foundation models for systems that can see and understand, show and explain, and eventually interact with our world to effect change. We are looking for engineers with significant experience solving hard problems in PyTorch, CUDA and distributed systems. You will work alongside the rest of the research team to build & train cutting edge foundation models on thousands of GPUs that are built to scale from the ground up.

Responsibilities

  • Design, implement, and optimize efficient distributed training systems for models with thousands of GPUs
  • Research and implement advanced parallelization techniques (FSDP, Tensor Parallel, Pipeline Parallel, Expert Parallel)
  • Build monitoring, visualization, and debugging tools for large-scale training runs
  • Optimize training stability, convergence, and resource utilization across massive clusters
  • Requirements :

  • Extensive experience with distributed PyTorch training and parallelisms in foundation model training
  • Deep understanding of GPU clusters, networking, and storage systems
  • Familiarity with communication libraries (NCCL, MPI) and distributed system optimization
  • (Preferred) Strong Linux systems administration and scripting capabilities
  • (Preferred) Experience managing training runs across >
  • 100 GPUs

  • (Preferred) Experience with containerization, orchestration, and cloud infrastructure
  • About Us :

    Founded in 2009, IntelliPro is a global leader in talent acquisition and HR solutions. Our commitment to delivering unparalleled service to clients, fostering employee growth, and building enduring partnerships sets us apart. We continue leading global talent solutions with a dynamic presence in over 160 countries, including the USA, China, Canada, Singapore, Japan, Philippines, UK, India, Netherlands, and the EU.

    IntelliPro, a global leader connecting individuals with rewarding employment opportunities, is dedicated to understanding your career aspirations. As an Equal Opportunity Employer, IntelliPro values diversity and does not discriminate based on race, color, religion, sex, sexual orientation, gender identity, national origin, age, genetic information, disability, or any other legally protected group status. Moreover, our Inclusivity Commitment emphasizes embracing candidates of all abilities and ensures that our hiring and interview processes accommodate the needs of all applicants. Learn more about our commitment to diversity and inclusivity at https : / / intelliprogroup.com / .

    Compensation : The pay offered to a successful candidate will be determined by various factors, including education, work experience, location, job responsibilities, certifications, and more. Additionally, IntelliPro provides a comprehensive benefits package, all subject to eligibility.

    Powered by JazzHR

    UYmlewWb2Y

    Create a job alert for this search

    Research Scientist • Palo Alto, CA, US

    Related jobs
    • Promoted
    Machine Learning Research Scientist / Engineer, Reasoning

    Machine Learning Research Scientist / Engineer, Reasoning

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, fueling the most exciting advancements in AI, including generat...Show moreLast updated: 30+ days ago
    • Promoted
    Research Scientist, Computational Design

    Research Scientist, Computational Design

    SRI InternationalMenlo Park, CA, United States
    Full-time
    Research Scientist, Computational Design.The Computational Design (CD) group of the AI Center at SRI International is seeking a strong and enthusiastic researcher with a Ph.Founded in 1966, SRI's A...Show moreLast updated: 30+ days ago
    • Promoted
    Travel Certified Surgical Technologist

    Travel Certified Surgical Technologist

    MedSource LLCGilroy, CA, US
    Full-time
    MedSource LLC is seeking a travel Certified Surgical Technologist for a travel job in Gilroy, California.Job Description & Requirements. Certified Surgical Technologist.MedSource Travelers provi...Show moreLast updated: 3 days ago
    • Promoted
    Remote Financial Planner - AI Trainer

    Remote Financial Planner - AI Trainer

    Data AnnotationWatsonville, California
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago
    • Promoted
    UC Cooperative Extension Specialist- Coastal Hydrology, Agriculture, and Water Resilience Located at the University of California, Santa Cruz (25-16)

    UC Cooperative Extension Specialist- Coastal Hydrology, Agriculture, and Water Resilience Located at the University of California, Santa Cruz (25-16)

    University of California Agriculture and Natural ResourcesSanta Cruz, CA, United States
    Full-time
    UC Cooperative Extension Specialist- Coastal Hydrology, Agriculture, and Water Resilience Located at the University of California, Santa Cruz (25-16). University of California Agriculture and Natur...Show moreLast updated: 30+ days ago
    Research Scientist / Engineer – Training Infrastructure

    Research Scientist / Engineer – Training Infrastructure

    IntelliPro Group Inc.Palo Alto, CA, US
    Full-time
    Quick Apply
    Research Scientist / Engineer – Training Infrastructure Position Type : Full time Location : Palo Alto, CA • Remote - US • Remote - International Salary Range : $220,000 - $300...Show moreLast updated: 20 days ago
    • Promoted
    Remote Financial Analyst - AI Trainer

    Remote Financial Analyst - AI Trainer

    Data AnnotationLivermore, California
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago
    • Promoted
    Travel Radiology Technologist

    Travel Radiology Technologist

    GQR Healthcare-AlliedWatsonville, CA, US
    Full-time
    GQR Healthcare-Allied is seeking a travel Radiology Technologist for a travel job in Watsonville, California.Job Description & Requirements. Job Title : CT / X •Ray Technician.Seeking a CT / X&bu...Show moreLast updated: 2 days ago
    • Promoted
    Research Engineer / Scientist, Trustworthy AI

    Research Engineer / Scientist, Trustworthy AI

    OpenAISan Francisco, CA, United States
    Full-time
    The Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to the real world to benefit society and is at the forefront of OpenAI's mission to b...Show moreLast updated: 1 day ago
    • Promoted
    Senior Research Scientist, Foundation Model for Simulation

    Senior Research Scientist, Foundation Model for Simulation

    WaymoMountain View, CA, United States
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 30+ days ago
    • Promoted
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show moreLast updated: 30+ days ago
    • Promoted
    Applied & Computational Research Scientists

    Applied & Computational Research Scientists

    SRI InternationalMenlo Park, CA, United States
    Full-time
    Applied & Computational Research Scientists.The Applied Sciences Laboratory at SRI International is seeking multiple.Applied and Computational Scientists. AI models, reinforcement learning, and algo...Show moreLast updated: 30+ days ago
    • Promoted
    Postdoctoral Fellow - Applied & Computational Scientists

    Postdoctoral Fellow - Applied & Computational Scientists

    SRI InternationalMenlo Park, CA, United States
    Full-time
    Postdoctoral Fellow - Applied & Computational Scientists.The Applied Sciences Laboratory at SRI International is seeking multiple. Postdoctoral Fellows - Applied and Computational Scientists.AI mode...Show moreLast updated: 30+ days ago
    • Promoted
    Remote Senior Financial Analyst - AI Trainer

    Remote Senior Financial Analyst - AI Trainer

    Data AnnotationLivermore, California
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Research Scientist / Research Engineer, Post-Training

    Machine Learning Research Scientist / Research Engineer, Post-Training

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale works with the industry's leading AI labs to provide high quality data and accelerate progress in GenAI research.We are looking for Research Scientists and Research Engineers with expertise i...Show moreLast updated: 30+ days ago
    • Promoted
    Travel Certified Surgical Technologist - $1,730 per week

    Travel Certified Surgical Technologist - $1,730 per week

    MedSource LLCGilroy, CA, United States
    Full-time
    MedSource LLC is seeking a travel Certified Surgical Technologist for a travel job in Gilroy, California.Job Description & Requirements. Certified Surgical Technologist.MedSource Travelers provides ...Show moreLast updated: 20 days ago
    • Promoted
    Remote Finance Advisor - AI Trainer

    Remote Finance Advisor - AI Trainer

    Data AnnotationMorgan Hill, California
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago
    • Promoted
    Remote Finance Director - AI Trainer

    Remote Finance Director - AI Trainer

    Data AnnotationMorgan Hill, California
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago
    • Promoted
    Travel CT Technologist

    Travel CT Technologist

    Genie HealthcareWatsonville, CA, US
    Full-time
    Genie Healthcare is seeking a travel CT Technologist for a travel job in Watsonville, California.Job Description & Requirements. Genie Healthcare Job ID #17229795.Pay package is based on 8 hour ...Show moreLast updated: 2 days ago
    • Promoted
    Field Technical Support Scientist II

    Field Technical Support Scientist II

    Shimadzu Scientific InstrumentsPleasanton, CA, United States
    Full-time
    Field Technical Support Scientist II.Based on your location, you may be eligible for a Cost of Living Adjustment (COLA) of up to. Established in 1975, Shimadzu Scientific Instruments is one of the l...Show moreLast updated: 23 days ago