Talent.com
Senior Software Engineer, AI Infrastructure

Senior Software Engineer, AI Infrastructure

NVIDIASanta Clara, CA, United States
30+ days ago
Job type
  • Full-time
Job description

NVIDIA DGX Cloud is a managed, multi-cloud AI supercomputing service that provides enterprises with instant access to NVIDIA's high-performance AI infrastructure and software, including dedicated DGX AI supercomputing clusters, optimized software stacks, and expertise. The platform enables users to rapidly build, train, and deploy large-scale AI models across leading cloud providers like Oracle, Azure, and Google Cloud, eliminating the complexity of managing their own infrastructure. Key features include pre-trained and fine-tunable models, serverless GPU inference, and a unified interface for multi-cloud management.

NVIDIA is looking for a passionate member to join our DGX Engineering Team as a Senior Software Engineer. In this role, you will play a significant part in helping to craft and guide the future of AI & GPUs in the Cloud. Are you passionate about cloud software development and strive for quality? Do you pride yourself in building cloud-scale software systems? If so, join our team at NVIDIA, where we are dedicated to delivering GPU-powered services around the world!

What you'll be doing :

You will be building restful cloud services and virtualization frameworks that come together to form our NVIDIA DGX Cloud Reference Architecture. These services have requirements for high security & maximum performance to support extensive AI workloads.

Design, build, and implement scalable cloud-based systems for PaaS / IaaS.

Work closely with other teams on new products or features / improvements of existing products.

Drive performance tuning and automation.

Support, maintain, and document software functionality.

What we need to see :

Expertise in Kubernetes (K8s) & KubeVirt.

Expertise in Virtualization technologies such as Firecracker, KVM, OpenStack, Nutanix AHV & Redhat OpenShift.

Extensive experience with Golang and building RESTful web services.

Demonstrate understanding of cloud design in the areas of virtualization and global infrastructure, distributed systems, and security.

Experience with Docker and Containers.

Background with Infrastructure as Code.

Experience with AWS (Fargate, EC2, IAM, ECR, EKS, Route53 etc...).

Experience with Continuous Integration and Continuous Delivery.

BS or MS in Computer Science or equivalent experience with over 12+ years of hands-on software engineering.

Excellent interpersonal and written communication skills required.

Ways to stand out from the crowd :

Experience with Postgres.

Exposure to Helm Charts & Terraform.

A track record of solving complex problems with elegant solutions.

Prior experience with Rust & Python as well as demonstrate delivery of complex projects in previous roles.

Experience with load testing frameworks as well as experience with secrets management

With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most forward-thinking and versatile people in the world working with us, and our engineering teams are growing fast in some of the most impactful fields of our generation : Cloud Engineering and Cloud Functions

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 5, and 272,000 USD - 425,500 USD for Level 6.

You will also be eligible for equity and benefits () .

Applications for this job will be accepted at least until September 22, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Senior Software Engineer Infrastructure • Santa Clara, CA, United States

Related jobs
  • Promoted
Principal Software Engineer AI Platform

Principal Software Engineer AI Platform

Snorkel AiRedwood City, California, United States
Full-time
At Snorkel, we believe meaningful AI doesn’t start with the model, it starts with the data.We’re on a mission to help enterprises transform expert knowledge into specialized AI at scale.The AI land...Show moreLast updated: 2 days ago
  • Promoted
Software Engineer - Analytics & AI

Software Engineer - Analytics & AI

Cxapp Us, Inc.San Ramon, California, United States
Full-time
At CXApp, we are the innovators of Indoor Intelligence, delivering actionable insights for people, places and things.Our flagship product the “CXApp” is a workplace experience platform for the ente...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer, Science

Software Engineer, Science

Chan Zuckerberg InitiativeRedwood City, California, United States
Full-time
The Chan Zuckerberg Initiative was founded by Priscilla Chan and Mark Zuckerberg in 2015 to help solve some of society’s toughest challenges — from eradicating disease and improving education to ad...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer

Software Engineer

OracleRedwood City, California, USA
Full-time
OCI is Oracles next-generation cloud platform built for the most demanding enterprise workloads.We are focused on delivering high-performance computing storage networking and platform services at g...Show moreLast updated: 3 days ago
  • Promoted
Senior Software Engineer, AI

Senior Software Engineer, AI

FloqastSan Jose, California, United States
Full-time
As a Senior AI Engineer on our Core AI team, you will be a cornerstone of FloQast's AI transformation.You will architect, build, and scale the AI products that power our accounting automation platf...Show moreLast updated: 30+ days ago
  • Promoted
Senior Software Engineer - AI Agent Infrastructure (Healthcare)

Senior Software Engineer - AI Agent Infrastructure (Healthcare)

Honey HealthFremont, CA, United States
Full-time
Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...Show moreLast updated: 10 days ago
  • Promoted
Senior AI Applications Engineer, Collaboration

Senior AI Applications Engineer, Collaboration

SnowflakeMenlo Park, California, United States
Full-time
As a Senior AI Applications Engineer on this pivotal project, you will be instrumental in designing, developing, and deploying the core AI systems that power our intelligent data matching, recommen...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer II, AI Box

Software Engineer II, AI Box

BoxRedwood City, California, United States
Full-time
Box (NYSE : BOX) is the leader in Intelligent Content Management.Our platform enables organizations to fuel collaboration, manage the entire content lifecycle, secure critical content, and transform ...Show moreLast updated: 30+ days ago
  • Promoted
Senior Software Engineer

Senior Software Engineer

Lirvana LabsMenlo Park, California, United States
Full-time
Lirvana Labs is an early-stage, VC-backed ed-tech startup turning the dream of truly personalized learning into everyday reality. Backed by Kapor Capital and other mission-aligned investors, with de...Show moreLast updated: 30+ days ago
  • Promoted
Senior Software Engineer - Machine Learning Platform

Senior Software Engineer - Machine Learning Platform

SnowflakeMenlo Park, California, United States
Full-time
The Snowflake Machine Learning Platform team’s mission is to enable customers to bring their machine learning and deep learning workloads to Snowflake. Our customers want to build powerful models wi...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Software Engineer

Sr. Software Engineer

PersonalisFremont, California, United States
Full-time
At Personalis, we are transforming the active management of cancer through breakthrough personalized testing.We aim to drive a new paradigm for cancer management, guiding care from biopsy through t...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Staff Software Engineer, AI Infra

Sr. Staff Software Engineer, AI Infra

LinkedinMountain View, California, United States
Full-time
LinkedIn is the worlds largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover excit...Show moreLast updated: 30+ days ago
  • Promoted
Senior Staff AI Engineer, Enterprise AI Platform

Senior Staff AI Engineer, Enterprise AI Platform

Palo Alto NetworksSanta Clara, California, United States
Full-time
At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer, Machine Learning Infrastructure

Software Engineer, Machine Learning Infrastructure

DatologyaiRedwood City, California, United States
Full-time
Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...Show moreLast updated: 30+ days ago
  • Promoted
Senior Software Engineer - Machine Learning

Senior Software Engineer - Machine Learning

CelonisRedwood City, California, United States
Full-time
We're Celonis, the global leader in Process Intelligence technology and one of the world's fastest-growing SaaS firms.We believe there is a massive opportunity to unlock productivity by placing AI,...Show moreLast updated: 1 day ago
  • Promoted
Software Engineer - AI Agent Infrastructure (Healthcare)

Software Engineer - AI Agent Infrastructure (Healthcare)

Honey HealthSanta Clara, CA, United States
Full-time
Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patient data, processing orders and prescri...Show moreLast updated: 11 days ago
  • Promoted
Senior Staff AI Engineer

Senior Staff AI Engineer

Palo Alto NetworksSanta Clara County, California, USA
Full-time
As a Senior Staff AI Engineer for Enterprise AI Solutions you will be a critical technical leader responsible for the hands-on design development and implementation of AI-powered solutions that sol...Show moreLast updated: 2 days ago
  • Promoted
Senior / Staff AI Algorithms Engineer

Senior / Staff AI Algorithms Engineer

DexterityRedwood City, California, United States
Full-time
At Dexterity, we believe robots can positively transform the world.Our breakthrough technology frees people to do the creative, inspiring, problem-solving jobs that humans do best by enabling robot...Show moreLast updated: 30+ days ago