Talent.com
Senior Software Engineer - AI Research Clusters
Senior Software Engineer - AI Research ClustersNVIDIA • Santa Clara, CA, United States
Senior Software Engineer - AI Research Clusters

Senior Software Engineer - AI Research Clusters

NVIDIA • Santa Clara, CA, United States
1 day ago
Job type
  • Full-time
Job description

NVIDIA is at the forefront of innovations in Artificial Intelligence, High-Performance Computing, and Visualization. Our invention—the GPU—functions as the visual cortex of modern computing and is central to groundbreaking applications from generative AI to autonomous vehicles. We are now looking for a Senior Software Engineer to help accelerate the next era of machine learning innovation.

In this role, you will propose and implement engineering solutions to ensure delivery of functional, reliable, secure, and performance-optimal GPU clusters to internal researchers, enable them to focus on training and development by reducing operational disruption and overhead, empower them for self-service continuous improvement on reliability, operational excellence & performance. Your work will empower scientists and engineers to train, fine-tune, and deploy the most advanced ML models on some of the world’s most powerful GPU systems.

What You'll Be Doing :

In this position, you will work with coworkers across the AI Platform organization to understand the pain points of validating, monitoring and operating GPU clusters at scale. Then you will d esign, develop and maintain engineering solutions to solve those pain points systematically.

You will also research in traditional AIOps and the emerging Agentic AI, and leverage it to further reduce the operation toil.

You will participate in on-call support for systems, platforms built and owned by the team.

What We Need To See :

BS / MS in Computer Science, Engineering, or equivalent experience.

8+ years in software / platform engineering, including 3+ years in ML infrastructure or distributed systems.

Experience in software development lifecycle on Linux-based platforms.

Strong coding skills in languages such as Python, C++ or Rust.

Experience with Docker, Kubernetes, GitLab CI, automated deployments.

Experience with AIOps or Agentic AI and apply it successfully in production environment.

Ways To Stand Out From The Crowd :

Proficiency with full-stack development : Relational Data Modeling, DB optimization, REST API Semantics, Javascript, CSS, providing API as a service.

Passion for building developer-centric platforms with great UX and strong operational reliability.

Experience running Slurm or custom scheduling frameworks in production ML environments.

Familiarity with GPU computing, Linux systems internals, and performance tuning at scale.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits () .

Applications for this job will be accepted at least until November 10, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Senior Software Engineer Ai • Santa Clara, CA, United States

Related jobs
Senior, Software Engineer - AI

Senior, Software Engineer - AI

Walmart • Sunnyvale, CA, United States
Full-time +1
Develop next generation Last Mile Delivery software applications, including very high capacity, guaranteed availability, and mass market usability without compromising quality.Engage in hands on en...Show more
Last updated: 1 day ago • Promoted
Senior Software Engineer, AI

Senior Software Engineer, AI

Peregrine • San Francisco, CA, United States
Full-time
Backed by leading investors from Silicon Valley, Peregrine supports public safety agencies across the country — from Los Angeles to Louisville to Atlanta — empowering public servants to improve ope...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, Applied AI

Senior Software Engineer, Applied AI

CHAOS Industries • San Francisco, CA, US
Full-time
Founded in 2022 by a seasoned leadership team CHAOS has quickly become the place where world-class multi-disciplinary engineers come to build mission-critical technologies.CHAOS has a mission-focus...Show more
Last updated: 18 hours ago • Promoted • New!
Senior Software Engineer, AI Platform

Senior Software Engineer, AI Platform

LinkedIn • Mountain View, CA, United States
Full-time
LinkedIn is the worlds largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover excit...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer - AI Incubator

Senior Software Engineer - AI Incubator

Sprinter Health, Inc. • San Francisco, CA, United States
Full-time
We're looking for a Senior Software Engineer with at least 5 years of experience who wants to make an impact.We want to make a difference in the lives of those falling between the cracks of the cur...Show more
Last updated: 30+ days ago • Promoted
Senior+ Software Engineer, Research Tools

Senior+ Software Engineer, Research Tools

Anthropic • San Francisco, CA, United States
Full-time
Senior Software Engineer, Research Tools.Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a w...Show more
Last updated: 15 days ago • Promoted
Senior Software Engineer Generative AI Research Development Technology RD

Senior Software Engineer Generative AI Research Development Technology RD

American Express • Palo Alto, CA, United States
Full-time
At American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleague...Show more
Last updated: 1 day ago • Promoted
Senior AI Software Engineer

Senior AI Software Engineer

Salesforce • Palo Alto, CA, United States
Full-time
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Salesforce is the #1 AI CRM, where humans with age...Show more
Last updated: 1 day ago • Promoted
Senior Software Engineer, AI Resiliency

Senior Software Engineer, AI Resiliency

NVIDIA • Santa Clara, CA, United States
Full-time
We are now looking for a Senior Software Engineer for AI Resiliency.At NVIDIA, we are pushing the boundaries of what’s possible in AI. We are currently seeking a Senior Software Engineer to lead the...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, AI for Quantum

Senior Software Engineer, AI for Quantum

PSI Quantum • Palo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
Last updated: 1 day ago • Promoted
Senior Software Engineer - AI-Driven Full-Stack, Hybrid

Senior Software Engineer - AI-Driven Full-Stack, Hybrid

Miter • San Francisco, CA, US
Full-time
A leading construction software firm is seeking an experienced software engineer to lead product design and implementation. The role involves collaborating with cross-functional teams and requires a...Show more
Last updated: 18 hours ago • Promoted • New!
Senior Software Engineer, AI for Quantum

Senior Software Engineer, AI for Quantum

PsiQuantum • Palo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
Last updated: 1 day ago • Promoted
AI Senior Software Engineer

AI Senior Software Engineer

General Motors • Mountain View, CA, United States
Full-time
As an AI Software Engineer on the Enterprise AI team, you will play a critical role in shaping GM's future through high-impact AI systems. You will be responsible for developing AI technologies and ...Show more
Last updated: 1 day ago • Promoted
Senior AI Software Engineer, GenAI Framework

Senior AI Software Engineer, GenAI Framework

NVIDIA • Santa Clara, CA, United States
Full-time
NVIDIA is now looking for AI Software Engineers for our GenAI Frameworks (Megatron Core () and NeMo Framework () ) team.Megatron Core and NeMo Framework are open-source, scalable and cloud-native f...Show more
Last updated: 30+ days ago • Promoted
Senior AI Software Engineer

Senior AI Software Engineer

Salesforce.Com Inc • Palo Alto, CA, United States
Full-time
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Salesforce is the #1 AI CRM, where humans with age...Show more
Last updated: 1 day ago • Promoted
Senior Software Engineer - Agentic AI

Senior Software Engineer - Agentic AI

NVIDIA • Santa Clara, CA, United States
Full-time
NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years! It's an outstanding legacy of innovation that's fueled by phenomenal technology—and outsta...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, AI Platform - Robotics

Senior Software Engineer, AI Platform - Robotics

NVIDIA • Santa Clara, CA, United States
Full-time
We’re building the infrastructure that powers GR00T, NVIDIA’s general-purpose humanoid robotics platform.This is not a typical DevOps job. You’ll help engineer the cloud-native backend that drives s...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer - AI Agents

Senior Software Engineer - AI Agents

FleetWorks Technology, Inc. • San Francisco, CA, United States
Full-time
Build and optimize autonomous AI agents that negotiate freight rates, book loads, and handle carrier conversations over phone and email. Own end-to-end feature development, working directly with a h...Show more
Last updated: 30+ days ago • Promoted