Deep Learning Scientist, LLM Training Datasets

NVIDIASanta Clara, CA, United States

30+ days ago

Job type

Full-time

Job description

NVIDIA is looking for a dedicated Deep Learning Scientist specializing in LLM training datasets engineering. This is a highly technical role requiring deep expertise in machine learning, data science and data engineering to develop innovative solutions that address the unique challenges of training foundation models. This role involves addressing innovative machine learning challenges through building and improving our data ecosystem.

What you'll be doing :

Develop datasets for LLM pre-training and post training (fine-tuning and reinforcement learning), optimize models and evaluate performance.

Design and implement data strategies for model training and evaluation that includes data collection, cleaning, labeling, augmentation, RL verifier datasets to improve model performance. Actively identify and manage data issues such as outliers, noise, and biases.

Generate high-quality synthetic data to augment existing datasets, especially for domain-specific or safety-critical use cases and multi-modal use cases (text, image, video, etc)

Define data annotation guidelines and curate high-quality labeled datasets for model alignment, including reinforcement learning with human feedback (RLHF).

Conduct experiments to optimize Large Language Models with SFT and RL techniques.

Design and develop systems for reasoning, tool calling, multi-modal processing, RL verifiers.

Implement post-training tasks for LLMs, including fine-tuning, RL, distillation, and performance evaluation, and adjust hyperparameters to improve model quality.

Partner with ML researchers, data scientists, and infrastructure teams to understand data needs, integrate datasets, and deploy efficient ML workflows.

What we need to see :

You have a Master’s or PhD in Computer Science, Electrical Engineering or related field - or equivalent experience.

3+ years of work experience in developing datasets and training large language models or other generative AI models.

Hands-on programming expertise in python.

Solid understanding of machine learning concepts and algorithms for managing data and experiments, including multi-modal datasets.

Experience with synthetic data generation techniques, and evaluation strategies.

Background with high-performance data processing libraries and machine learning frameworks like PyTorch, Data Loader, TensorFlow Data.

Experience with alignment / fine-tuning of LLMs, VLMs (img-to-text, vid-to-text)or any-to-text large models.

Familiarity with distributed training paradigms and optimization techniques.

Good at problem solving and analytical ability as well as excellent collaboration and communication skills.

Demonstrates behaviors that build trust : humility, transparency, respect, intellectual integrity.

Ways to stand out from the crowd :

Strong track record of contributions to open-source data tools or research publications.

Experience with cloud platforms (e.g., AWS, GCP, Azure) and data storage systems (e.g., S3, Google Cloud Storage).

Stay ahead of research : Continuously evaluate new tools, techniques, and methodologies in data engineering and generative AI to improve training data infrastructure.

Passion for AI and a demonstrated commitment to advancing the field through innovative research, prior scientific research and publication experience.

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working with us and our engineering teams are growing fast in some of the hottest state of the art fields : Deep Learning, Artificial Intelligence, and Large Language Models. If you're a creative engineer with a real passion for robust and enjoyable user experiences, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits () .

Applications for this job will be accepted at least until October 21, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Learning Scientist • Santa Clara, CA, United States

Related jobs

Promoted
New!

Deep Learning Scientist, AI Safety

NVIDIASanta Clara, CA, United States

Full-time

Deep Learning Scientist, AI Safety page is loaded## Deep Learning Scientist, AI Safetylocations : US, CA, Santa Clara : US, CA, Remotetime type : Full timeposted on : Posted Todayjob requisition id : JR...Show moreLast updated: 14 hours ago

Promoted
New!

Senior ML Scientist (Optimization & Reinforcement Learning)

Macpower Digital Assets EdgeSan Jose, CA, United States

Full-time

AI ML-based dynamic pricing algorithms and personalized offer experiences.This role will focus on designing and implementing advanced machine learning models, including reinforcement learning techn...Show moreLast updated: 12 hours ago

Promoted
New!

Deep Learning Research Scientist in San Francisco

Energy Jobline ZRSan Francisco, CA, United States

Full-time

Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub.We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy ...Show moreLast updated: 14 hours ago

Promoted
New!

Staff Machine Learning Research Scientist, LLM Evals

Scale AISan Francisco, CA, United States

Full-time

As the leading data and evaluation partner for frontier AI companies, Scale is dedicated to advancing the evaluation and benchmarking of large language models (LLMs). We are building industry-leadin...Show moreLast updated: 14 hours ago

Promoted

Machine Learning Engineer, Distributed Training, Optimus

Tesla Motors, Inc.Palo Alto, CA, United States

Full-time

As a Software Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture, visualize data, assist with exporting and d...Show moreLast updated: 24 days ago

Promoted
New!

Machine Learning Engineer, Training Infrastructure

Intellipro GroupSan Francisco, CA, United States

Full-time

Machine Learning Engineer, Training Infrastructure.We are looking for an ML Engineer with 3+ YOE in high-performance computing systems to manage and optimize our computational infrastructure for tr...Show moreLast updated: 14 hours ago

Promoted

Tech Lead / Manager, Machine Learning Research Scientist- LLM Evals

Scale AI, Inc.San Francisco, CA, United States

Full-time

Promoted
New!

Machine Learning Engineer - Training & Infrastructure

P-1 AISan Francisco, CA, United States

Full-time

We are building an engineering AGI.We founded P-1 AI with the conviction that the greatest impact of artificial intelligence will be on the built worldhelping mankind conquer nature and bend it to ...Show moreLast updated: 14 hours ago

Promoted
New!

Senior Machine Learning Engineer, LLM / VLM Continual Pre-training

WaymoSan Francisco, CA, United States

Full-time

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 14 hours ago

Promoted

Autonomy Engineer - Deep Learning Model Acceleration

SkydioSan Mateo, CA, United States

Full-time

Skydio is the leading US drone company and the world leader in autonomous flight, the key technology for the future of drones and aerial mobility. The Skydio team combines deep expertise in artifici...Show moreLast updated: 30+ days ago

Promoted
New!

Machine Learning Scientist- LLM / AI, San Jose

Tik TokSan Jose, CA, United States

Full-time

About the team PDPO(Privacy and Data Protection Office) is the organization to lead, supervise, and empower all TikTok's privacy work in an accountable and industry leading way.This team is the exp...Show moreLast updated: 14 hours ago

Promoted

Autonomy Engineer - Deep Learning

SkydioSan Mateo, CA, United States

Full-time

Promoted
New!

Machine Learning Scientist, NLP (All Levels)

AbridgeSan Francisco, CA, United States

Full-time

Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare.Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation eff...Show moreLast updated: 14 hours ago

Promoted

Machine Learning Engineer, Training Infrastructure

HEDRA INCSan Francisco, CA, United States

Full-time

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show moreLast updated: 30+ days ago

Promoted
New!

ML Research Engineer - Training

AchiraSan Francisco, CA, United States

Full-time

Join a world?class team of scientists, ML researchers, and engineers working together to make the physical microcosm predictable and reshape the future of drug discovery. Move beyond the beaten path...Show moreLast updated: 14 hours ago

Promoted

Machine Learning Engineer, Training Infrastructure

Hedra, IncSan Francisco, CA, United States

Full-time

Promoted

Staff Machine Learning Research Scientist, LLM Evals

Scale AI, Inc.San Francisco, CA, United States

Full-time

Promoted
New!

Machine Learning Engineer - Post Training

EPM ScientificSan Francisco, CA, United States

Full-time

Machine Learning Engineer - Post Training.A stealth-stage venture backed by Lux Capital (investors in DeepMind and OpenAI) is developing frontier-scale AI systems for high-impact applications in hu...Show moreLast updated: 14 hours ago