Talent.com
Senior Software Engineer - AI Research Clusters
Senior Software Engineer - AI Research ClustersNVIDIA • Santa Clara, CA, United States
Senior Software Engineer - AI Research Clusters

Senior Software Engineer - AI Research Clusters

NVIDIA • Santa Clara, CA, United States
6 days ago
Job type
  • Full-time
Job description

NVIDIA is at the forefront of innovations in Artificial Intelligence, High-Performance Computing, and Visualization. Our invention—the GPU—functions as the visual cortex of modern computing and is central to groundbreaking applications from generative AI to autonomous vehicles. We are now looking for a Senior Software Engineer to help accelerate the next era of machine learning innovation.

In this role, you will propose and implement engineering solutions to ensure delivery of functional, reliable, secure, and performance-optimal GPU clusters to internal researchers, enable them to focus on training and development by reducing operational disruption and overhead, empower them for self-service continuous improvement on reliability, operational excellence & performance. Your work will empower scientists and engineers to train, fine-tune, and deploy the most advanced ML models on some of the world’s most powerful GPU systems.

What You'll Be Doing :

In this position, you will work with coworkers across the AI Platform organization to understand the pain points of validating, monitoring and operating GPU clusters at scale. Then you will d esign, develop and maintain engineering solutions to solve those pain points systematically.

You will also research in traditional AIOps and the emerging Agentic AI, and leverage it to further reduce the operation toil.

You will participate in on-call support for systems, platforms built and owned by the team.

What We Need To See :

BS / MS in Computer Science, Engineering, or equivalent experience.

8+ years in software / platform engineering, including 3+ years in ML infrastructure or distributed systems.

Experience in software development lifecycle on Linux-based platforms.

Strong coding skills in languages such as Python, C++ or Rust.

Experience with Docker, Kubernetes, GitLab CI, automated deployments.

Experience with AIOps or Agentic AI and apply it successfully in production environment.

Ways To Stand Out From The Crowd :

Proficiency with full-stack development : Relational Data Modeling, DB optimization, REST API Semantics, Javascript, CSS, providing API as a service.

Passion for building developer-centric platforms with great UX and strong operational reliability.

Experience running Slurm or custom scheduling frameworks in production ML environments.

Familiarity with GPU computing, Linux systems internals, and performance tuning at scale.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits () .

Applications for this job will be accepted at least until November 10, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Senior Software Engineer Ai • Santa Clara, CA, United States

Related jobs
Senior, Software Engineer - AI

Senior, Software Engineer - AI

Walmart • Sunnyvale, CA, United States
Full-time +1
Develop next generation Last Mile Delivery software applications, including very high capacity, guaranteed availability, and mass market usability without compromising quality.Engage in hands on en...Show more
Last updated: 5 days ago • Promoted
Senior Software Engineer, AI for Quantum

Senior Software Engineer, AI for Quantum

PsiQuantum • Palo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
Last updated: 26 days ago • Promoted
Senior Software Engineer - Generative AI Research & Development - Technology R&D

Senior Software Engineer - Generative AI Research & Development - Technology R&D

American Express • Palo Alto, CA, United States
Full-time
At American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleague...Show more
Last updated: 6 days ago • Promoted
Senior Software Engineer, AI Inference Platform

Senior Software Engineer, AI Inference Platform

Cerebras Systems • Sunnyvale, CA, United States
Full-time
Senior Software Engineer, AI Inference Platform.Sunnyvale, CA or Toronto, Canada.Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑scale architecture de...Show more
Last updated: 13 days ago • Promoted
Senior Software Engineer II, Agentic AI Platform

Senior Software Engineer II, Agentic AI Platform

Moveworks.ai • Mountain View, CA, United States
Full-time
Are you up for an exciting challenge? Picture yourself scaling and optimizing a cutting-edge Generative AI product that offers instant assistance to enterprise users. Ever wondered how to apply abst...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer AI Platforms

Senior Software Engineer AI Platforms

Cisco • San Jose, CA, United States
Full-time
The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world defend against threats and safeguard the most vital aspects ...Show more
Last updated: 30+ days ago • Promoted
Senior Generative AI Software Engineer

Senior Generative AI Software Engineer

NVIDIA Corporation • Santa Clara, CA, United States
Full-time
Senior Generative AI Software Engineer page is loaded## Senior Generative AI Software Engineerlocations : US, CA, Santa Clara : US, CA, Remotetime type : Full timeposted on : Posted Todayjob re...Show more
Last updated: 9 days ago • Promoted
Senior Software Engineer, AI Systems

Senior Software Engineer, AI Systems

HP IQ • Palo Alto, CA, United States
Full-time
HP IQ is HP's new AI innovation lab.Combining startup agility with HP's global scale, we're building intelligent technologies that redefine how the world works, creates, and collaborates.We're asse...Show more
Last updated: 6 days ago • Promoted
Senior Software Engineer, AI Platform

Senior Software Engineer, AI Platform

LinkedIn • Mountain View, CA, United States
Full-time
Get AI-powered advice on this job and more exclusive features.Direct message the job poster from.Senior Technical Recruiter @ | Systems & Infrastructure, Machine Learning.Our vision is to create e...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer AI Platforms

Senior Software Engineer AI Platforms

Cisco Systems, Inc. • San Jose, CA, United States
Full-time
The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world defend against threats and safeguard the most vital aspects ...Show more
Last updated: 30+ days ago • Promoted
Senior AI Software Engineer

Senior AI Software Engineer

Salesforce • Palo Alto, CA, United States
Full-time
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Salesforce is the #1 AI CRM, where humans with age...Show more
Last updated: 6 days ago • Promoted
Senior Software Engineer, Agentic AI

Senior Software Engineer, Agentic AI

PayPal • San Jose, CA, United States
Full-time
PayPal has been revolutionizing commerce globally for more than 25 years.Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empow...Show more
Last updated: 5 days ago • Promoted
Senior Software Engineer, AI for Quantum

Senior Software Engineer, AI for Quantum

PSI Quantum • Palo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
Last updated: 5 days ago • Promoted
AI Senior Software Engineer

AI Senior Software Engineer

General Motors • Mountain View, CA, United States
Full-time
As an AI Software Engineer on the Enterprise AI team, you will play a critical role in shaping GM's future through high-impact AI systems. You will be responsible for developing AI technologies and ...Show more
Last updated: 6 days ago • Promoted
Senior AI Software Engineer : GenAI & Geospatial

Senior AI Software Engineer : GenAI & Geospatial

Google Inc. • Mountain View, CA, United States
Full-time
A leading tech company in Mountain View is seeking a Senior Software Engineer to develop next-generation technologies related to AI. The role involves working on critical projects, writing and testi...Show more
Last updated: 2 days ago • Promoted
Senior AI Software Engineer

Senior AI Software Engineer

Salesforce.Com Inc • Palo Alto, CA, United States
Full-time
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Salesforce is the #1 AI CRM, where humans with age...Show more
Last updated: 5 days ago • Promoted
Senior Software Engineer, AI / ML

Senior Software Engineer, AI / ML

Striim • Palo Alto, CA, United States
Full-time
Striim, (pronounced stream with two is for integration and intelligence), is a unified data integration and streaming platform that connects clouds, data, and applications with unprecedented speed ...Show more
Last updated: 5 days ago • Promoted
Senior AI Software Engineer

Senior AI Software Engineer

Thomson Reuters • San Jose, CA, United States
Full-time
Are you driven by the prospect of creating AI-powered software that revolutionizes professional workflows? Join the innovative team at Thomson Reuters, a global leader deeply invested in AI technol...Show more
Last updated: 2 days ago • Promoted