Talent.com
Senior Software Engineer, Cloud Functions
Senior Software Engineer, Cloud FunctionsNVIDIA • Santa Clara, CA, United States
Senior Software Engineer, Cloud Functions

Senior Software Engineer, Cloud Functions

NVIDIA • Santa Clara, CA, United States
1 day ago
Job type
  • Full-time
Job description

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions, from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

The team delivers NVIDIA Mission Control Software that runs on superpods. The software we develop is shipped as an autonomous hardware recovery engine and is responsible for baseline validation tests, taking remedial actions (break / fix workflows), and periodic health checks for hardware components. We are looking for a Senior Software Engineer with experience in building highly scalable and robust enterprise software to join us. We are building and improving a powerful platform that will automate the diagnosis and repair of a cluster of GPUs or CPUs across public clouds, private clouds, and virtual and physical hardware.

What you'll be doing :

Designing and implementing scalable and reliable software components to enable the core platform to maintain an inventory of resources, including hosts, GPUs, and switches; to automate actions to diagnose failures, and to repair

Enabling Agentic AI within the core platform to create remedial workflows

Influencing the product roadmap in collaboration with teams across various departments with the goal of reducing SRE toil and improving hardware utilization

Collaborating with various organizations across Nvidia to drive adoption of the platform in order to improve GPU utilization

Defining and running benchmarks for various subsystems

Leading and delivering high-impact projects with high quality, performance, and stability with the lowest resource consumption

Developing a robust feedback control system that analyzes signals about system health and automatically runs commands to fix discovered issues

Programming in modern languages like Go and Rust

What we need to see :

Bachelor's or Master's degree in Computer Science, Engineering, or a related field (or equivalent experience)

Keen interest in driving Agent AI projects

10 years of equivalent experience

Demonstrated ability in building scalable and robust distributed systems

Proven record of product rollouts and collaborating with early adopters

Proficiency in programming in C / C++, Java, Rust or Go.

Technical stewardship of projects across the organization

Ways to stand out from the crowd :

Deep understanding of multi-threading and distributed systems concepts

Excellent track record of delivering projects

Expertise in optimizing SQL queries

Expert-level knowledge of Go / Rust programming

With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most forward-thinking and versatile people in the world working with us, and our engineering teams are growing fast in some of the most impactful fields of our generation : Cloud Engineering and Cloud Functions. If you're a creative engineer who enjoys autonomy and shares our passion for technology, we want to hear from you.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and dedicated people in the world working for us. If you're creative and passionate about developing cloud services we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until November 3, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Software Engineer Cloud • Santa Clara, CA, United States

Related jobs
Senior Software Engineer - Cloud Infrastructure

Senior Software Engineer - Cloud Infrastructure

General Motors • Sunnyvale, CA, United States
Full-time
At General Motors, our product teams are redefining mobility.Through a human-centered design process, we create vehicles and experiences that are designed not just to be seen, but to be felt.We're ...Show more
Last updated: 2 days ago • Promoted
Senior Software Engineer - Cloud Logistics

Senior Software Engineer - Cloud Logistics

Nimble • San Francisco, CA, United States
Full-time
Senior Software Engineer - Cloud Logistics.Join to apply for the Senior Software Engineer - Cloud Logistics role at Nimble. Nimble is a robotics and AI company inventing and scaling autonomous logis...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer - Cloud Infrastructure

Senior Software Engineer - Cloud Infrastructure

WeRide.ai • San Jose, CA, United States
Full-time
Established in 2017, WeRide (NASDAQ : WRD) is a leading global commercial-stage company that develops autonomous driving technologies from Level 2 to Level 4. WeRide is the only tech company in the w...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, Edge & Cloud Integration

Senior Software Engineer, Edge & Cloud Integration

Auterion • San Francisco, CA, United States
Full-time
Senior Software Engineer – Edge & Cloud Integration is responsible for designing, implementing, and optimizing software that runs on the edge (onboard companion computers and embedded systems) and ...Show more
Last updated: 9 days ago • Promoted
Senior Software Engineer, Cloud Platform

Senior Software Engineer, Cloud Platform

Chef Robotics, Inc. • San Francisco, CA, United States
Full-time
Chef Robotics is on a mission to accelerate the advent of intelligent machines in the physical world.As the rise of LLMs like ChatGPT has shown, AI has the potential to drive immense change.However...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, Managed Cloud Services

Senior Software Engineer, Managed Cloud Services

Crusoe Energy Systems LLC • San Francisco, CA, United States
Full-time
A leading technology firm in San Francisco is seeking a Software Engineer to design, build, and scale customer-facing platforms. Responsibilities include creating scalable infrastructure, collaborat...Show more
Last updated: 3 hours ago • Promoted • New!
Senior Software Engineer - Cloud Logistics

Senior Software Engineer - Cloud Logistics

Nimble Robotics • San Francisco, CA, United States
Full-time
Nimble is a robotics and AI company inventing and scaling autonomous logistics with intelligent robots to enable fast, efficient, and sustainable commerce. We're developing generalized robot intelli...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, Cloud Infrastructure

Senior Software Engineer, Cloud Infrastructure

Nuro • Mountain View, CA, United States
Full-time
Senior Software Engineer, Cloud Infrastructure.Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable ...Show more
Last updated: 30+ days ago • Promoted
Senior Engineer Cloud Architecture

Senior Engineer Cloud Architecture

Tata Consultancy Services • Fremont, CA, United States
Full-time
Must Have Technical / Functional Skills.Net, C#, Java, AWS Roles & Responsibilities.Software Development : Design, develop, test, and deploy robust, scalable, and secure applications using C#,.Cloud A...Show more
Last updated: 5 days ago • Promoted
Senior Cloud Engineer

Senior Cloud Engineer

University of California, San Francisco • San Francisco, CA, United States
Full-time
University of California, San Francisco.Job Summary : The Senior Cloud Engineer will be accountable for driving the configuration and operation of University of California, San Francisco (UCSF) clou...Show more
Last updated: 9 days ago • Promoted
Senior Software Engineer, Cloud Platform

Senior Software Engineer, Cloud Platform

Verily Life Sciences • Mountain View, CA, United States
Full-time
Verily is a subsidiary of Alphabet that is using a data-driven approach to change the way people manage their health and the way healthcare is delivered. Launched from Google X in 2015, our purpose ...Show more
Last updated: 30+ days ago • Promoted
Senior Cloud Platform Engineer

Senior Cloud Platform Engineer

SambaNova Systems • Palo Alto, CA, United States
Full-time
The era of pervasive AI has arrived.In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fu...Show more
Last updated: 2 days ago • Promoted
Senior Software Engineer, Cloud Services

Senior Software Engineer, Cloud Services

Roku • San Jose, CA, United States
Full-time
Teamwork makes the stream work.Roku is changing how the world watches TV.Roku is the #1 TV streaming platform in the U.Canada, and Mexico, and we've set our sights on powering every television in t...Show more
Last updated: 2 days ago • Promoted
Senior Software Engineer, Cloud Platform

Senior Software Engineer, Cloud Platform

Verily • Mountain View, CA, United States
Full-time
Senior Software Engineer, Cloud Platform page is loaded## Senior Software Engineer, Cloud Platformremote type : Hybridlocations : Mountain View, Californiatime type : Full timeposted on : Posted Yester...Show more
Last updated: 1 day ago • Promoted
Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

Epoch Biodesign • San Francisco, CA, United States
Full-time
We are looking for a highly skilled engineer with deep expertise in building and operating observability platforms at scale. You will design, develop, and run Crusoe’s next-generation observability ...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer - Apple Cloud Products - iCloud Drive

Senior Software Engineer - Apple Cloud Products - iCloud Drive

Apple • Cupertino, CA, United States
Full-time
There are job opportunities, and then there are career-defining moments.You will join an extraordinary team within the AppleCloud organization. We are a fast-paced , but a high-growth team, where yo...Show more
Last updated: 2 days ago • Promoted
Senior Cloud Architect Engineer

Senior Cloud Architect Engineer

Advanced Micro Devices, Inc. • San Jose, CA, United States
Full-time
WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded syst...Show more
Last updated: 14 days ago • Promoted
Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

Crusoe Energy Systems LLC • San Francisco, CA, United States
Full-time
We are looking for a highly skilled engineer with deep expertise in building and operating observability platforms at scale. You will design, develop, and run Crusoe’s next-generation observability ...Show more
Last updated: 30+ days ago • Promoted