Senior Machine Learning Engineer, LLM

NVIDIA
Redmond, WA, US
Full-time
We are sorry. The job offer you are looking for is no longer available.

We are now looking for a Senior Machine Learning Engineer (LLM)! NVIDIA is seeking machine learning engineers that make LLMs faster and slimmer while minimizing training friction.

You can (1) distill LLM research literature into its core, (2) run experiments at scale, (3) generate insights to support or refute the efficacy of a technique, and (4) generate reproducible training recipes.

Our mission is to create new value for NVIDIA's largest customers. We support this mission by building robust, reproducible, and friction-free training recipes that make compression-aware training and inference seamless.

We own the top of the stack, and our largest customers directly interact with the artifacts that you will build.

What you'll be doing :

Keep abreast on research in LLM compression algorithms

Design, implement, and evaluate LLM compression algorithms that minimize regressions, reduce training friction, and improve performance

Run ablation experiments to understand the Pareto-frontier

Build robust, reproducible, and portable training recipes

Work with production SW teams to realize recipes in production workflows

What we need to see :

M.S. degree or equivalent experience in Computer Science or a related field, and / or proven track record. PhD preferred

6+ years of relevant work experience

Strong programming skills and ability to debug ML systems

Experience with PyTorch

Proficient in the math of machine learning

Strong written and oral communication skills

Ways to stand out from the crowd :

Background with mixed-precision training flows

Experience working in Efficient LLMs

Experience with performance optimization of training and inference systems

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them.

We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud.

We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and hard-working people in the world working for us.

Are you creative and collaborative? Do you love the challenge of defining and leading an industry? If so, we want to hear from you!

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

26 days ago
Related jobs
Promoted
Scale AI, Inc.
Seattle, Washington

Strong skills in NLP, LLMs and deep learning. AWS or GCP) and developing machine learning models in a cloud environment. Published research in areas of machine learning at major conferences (NeurIPS, ICML, EMNLP, CVPR, etc. Your focus will be on developing Models as a Service for our enterprise part...

Apple
Seattle, Washington

As a Senior CV & ML Engineer, you will be working alongside our world-class creatives, designers, and engineers to help innovate in the creative space in ways that only Apple can. Extensive knowledge in theory and practice of computer vision, machine learning and deep learning techniques. Experience...

TikTok
Seattle, Washington

Minimum Qualifications- Bachelor's degree or above in computer science or related field- Solid coding skills, ability to develop in a Linux environment, proficient in Python, Go, or C++- Solid foundation in data structures/algorithms, proficient in machine learning/deep learning theory, and rich pra...

Splunk Inc
Washington, United States
Remote

Machine Learning Engineer (MLE), Artificial Intelligence. Learn more about Splunk careers and how you can become a part of our journey!Role:As a Machine Learning Engineer in the Artificial Intelligence group, you will be responsible for developing the core AI/ML capabilities to power the entire Splu...

TikTok
Seattle, Washington

Utilizing cutting-edge machine learning technology, advanced NLP, CV, recommendation, and multi-modal technology, we're shaping a pioneering engine within the industry. Optimize the recommender system based on hyperscale machine learning models, covering a range of tasks from recall/first-stage rank...

Supernormal
Seattle, Washington
Remote

Machine learning engineers at Supernormal build the AI that superpowers the core product experience for people’s meetings including transcription, note generation, and task automation. Building and shipping custom machine learning models to augment the AI stack including to improve transcript qualit...

TikTok
Seattle, Washington

We are seeking brilliant and motivated graduate software engineers, who are eager to apply their knowledge in machine learning (ML), operations research (OR), data mining, and statistical inference to real-world challenges. Strong understanding of applied machine learning and Big Data tools. Contrib...

Tik Tok
Seattle, Washington

As a Machine Learning Engineer, you will contribute to the development and implementation of cutting-edge machine learning models and algorithms to optimize inventory management, supply chain efficiency, marketplace growth, or CV/NLP (depending on the team). Collaborate with software engineers to in...

Apple
Seattle, Washington

Apple’s machine learning framework (Core ML) is looking for an exceptional software engineer to expand and support its suite of ML frameworks. You will be responsible for staying up to date on the latest applications of machine learning, and for ensuring we are developing the functionality necessary...

Amazon.com Services LLC - A57
Seattle, Washington

The Neuron Inference team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trn1. AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine. This role is for a software engineer in t...