Talent.com
No longer accepting applications
Distinguished Engineer - Data Center System SoftwareArchitect

Distinguished Engineer - Data Center System SoftwareArchitect

NVIDIASanta Clara, CA, Santa Clara County, CA; California, United States
8 days ago
Job type
  • Full-time
Job description

NVIDIA data center systems, such as DGX and

HGX, have become core to NVIDIA's rapidly growing

enterprise and cloud provider businesses. These platforms bring

together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA

InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized

NVIDIA AI and HPC software stack. We’re looking for a strong

technical architect to own the end-to-end architecture of these

products, at the system software level.

Including firmware, kernel drivers, operating systems, and user

mode drivers. You will work with component leads internally and

engage with industry leading cloud service providers on taking

these products to market.

What you’ll be

doing : Serve as

the primary technical point of contact for major customers, leading

technological discussions, defining KPIs, gathering requirements,

and addressing complex technical queries.

As a system software

architect, lead technical innovation and strategic collaborations

with major hyperscalers to architect next-generation data center

products.

Align

NVIDIA's roadmap with major customers' requirements

through direct engagement.

Develop and drive adoption of new technologies and protocols.

Make critical technical

decisions in ambiguous situations, mitigating risks through

left-shift strategies.

What we need to see :

Deep expertise in

scalable and performant server system architecture, focusing on

SW / HW interfaces.

Extensive experience with complex system software for accelerators

(GPUs, DPUs, FPGAs).

Mastery of system firmware (SBIOS, OpenBMC), embedded systems, and

Linux kernel internals.

Proficiency in Out-of-Band and In-Band management architectures,

device management protocols (e.g., MCTP, PLDM, SPDM, RDE) and

system management protocols (Redfish, IPMI).

Extensive knowledge of

networking technologies and protocols, including TCP / IP, Ethernet,

InfiniBand, as well as advanced switching and routing concepts

Experience collaborating

with platform security experts to define tradeoffs between security

and ease of use.

Demonstrated success in leading complex, cross-functional projects

to completion, showcasing the ability to influence and achieve

results without direct authority in large-scale, collaborative

environments. Demonstrable experience in implementing left shift

strategy to de-risk program execution.

BS or MS degree in

Computer Science, Electrical Engineering or related field (or

equivalent experience).

20+

years in the area of System architecture and design.

Ways to stand out from the crowd :

Knowledge of

cloud and cluster level deployment and management systems.

Participation and contributions in standards bodies such as OCP and

DMTF.

Familiarity with

NVIDIA HPC programming models and libraries (CUDA, cuDNN,

DOCA)

Knowledge of

enterprise storage architectures and distributed parallel

processing paradigms

NVIDIA

is leading the way in groundbreaking developments in Artificial

Intelligence, High-Performance Computing and Visualization. The

GPU, our invention, serves as the visual cortex of modern computers

and is at the heart of our products and services. We have some of

the most forward-thinking and hardworking people on the planet

working for us. If you're creative, passionate and

self-motivated, we want to hear from

you!

NVIDIA’s invention of

the GPU in 1999 fueled the growth of the PC gaming market,

redefined modern computer graphics, and revolutionized parallel

computing. More recently, GPU deep learning ignited modern deep

learning — the next era of computing — with the GPU acting as the

brain of computers, robots, and self-driving cars that can perceive

and understand the world. Today, we are increasingly known as “the

AI computing company.” We're looking to grow our company

and establish teams with the most thoughtful people in the world.

Are you ready to change the next generation of computing? Join us

at the forefront of technological advancement.

Your base salary will be determined

based on your location, experience, and the pay of employees in

similar positions. The base salary range is 308,000 USD - 471,500

USD.

You will also be

eligible for equity and benefits .

Applications for this job will be accepted at least until September

30, 2025.

NVIDIA is committed to fostering a

diverse work environment and proud to be an equal opportunity

employer. As we highly value diversity in our current and future

employees, we do not discriminate (including in our hiring and

promotion practices) on the basis of race, religion, color,

national origin, gender, gender expression, sexual orientation,

age, marital status, veteran status, disability status or any other

characteristic protected by law.

Create a job alert for this search

Data Center Engineer • Santa Clara, CA, Santa Clara County, CA; California, United States

Related jobs
  • Promoted
Sr. Director, Data Center Solution

Sr. Director, Data Center Solution

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
Backend Infrastructure Engineer

Backend Infrastructure Engineer

Strategic Employment Partners (SEP)San Francisco, CA, US
Full-time
Join a stealth-mode startup on a mission to redefine how people shop online.Our client is building a hyper-personalized, AI-powered shopping experience backed by some of the most successful names i...Show moreLast updated: 8 days ago
  • Promoted
System Engineer

System Engineer

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
IT Engineer

IT Engineer

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
IT Service Desk Engineer

IT Service Desk Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for an IT Service Desk Engineer to provide technical support for a remote team.Key Responsibilities Serve as the primary point of contact for internal employee technical supp...Show moreLast updated: 9 hours ago
  • Promoted
Senior Software Engineer, Data Center Systems Design Engineer

Senior Software Engineer, Data Center Systems Design Engineer

Apple Inc.Cupertino, CA, United States
Full-time
Senior Software Engineer, Data Center Systems Design Engineer.Cupertino, California, United States Software and Services. The cloudOS team is responsible for all facets of delivering OS and system s...Show moreLast updated: 11 days ago
  • Promoted
Sr. System Engineer

Sr. System Engineer

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a top-tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC, and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
System Architect, Quantum Networking

System Architect, Quantum Networking

PsiQuantumPalo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior Citrix Engineer

Senior Citrix Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Senior Systems Engineer (Citrix).Key Responsibilities Design, deploy, upgrade, and support the enterprise Citrix environment Lead system enhancements and ensure compli...Show moreLast updated: 15 hours ago
  • Promoted
Software Engineer 4

Software Engineer 4

Russell TobinFremont, CA, US
Full-time
Job Title : Software Engineer 4.Job Location : Fremont, CA (Remote).Job Duration : Contract (15Months).We are seeking a full-stack software engineer with expertise in the Dash Enterprise platform to d...Show moreLast updated: 2 days ago
  • Promoted
Software Engineer, Distributed Data Systems

Software Engineer, Distributed Data Systems

OpenAISan Francisco, CA, United States
Full-time
Software Engineer, Distributed Data Systems.The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multim...Show moreLast updated: 30+ days ago
  • Promoted
Data Center Architect

Data Center Architect

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Data Center Architect to lead the design and evolution of software for modern data center operations. Key Responsibilities Architect end-to-end software solutions for da...Show moreLast updated: 30+ days ago
  • Promoted
Systems Engineer - Networks and Architecture

Systems Engineer - Networks and Architecture

WaymoMountain View, CA, United States
Full-time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 30+ days ago
  • Promoted
Data Engineer II

Data Engineer II

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Data Engineer II to enhance data solutions for their insurance data platform.Key Responsibilities Design, create, and maintain data solutions, including data pipelines ...Show moreLast updated: 30+ days ago
  • Promoted
Service Engineer

Service Engineer

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
Cloud Infrastructure Engineer

Cloud Infrastructure Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Software Engineer, Infrastructure.Key Responsibilities Own and manage core infrastructure across multiple data centers and regions Innovate infrastructure capabilities...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior z / OS Systems Engineer

Senior z / OS Systems Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Senior z / OS Systems Engineer.Key Responsibilities Design and implement automated solutions in the z / OS environment Support existing in-house solutions and provide cust...Show moreLast updated: 3 hours ago
  • Promoted
System Debug Engineer

System Debug Engineer

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
Platform & Infrastructure Engineer

Platform & Infrastructure Engineer

MindsdbSan Francisco, CA, US
Full-time
Job description ABOUT USMindsDB is a fast-growing AI startup headquartered in San Francisco, California.MindsDB is an AI Analytics solution that connects to diverse data sources and applications th...Show moreLast updated: 4 days ago
  • Promoted
  • New!
Software Engineer (Infrastructure)

Software Engineer (Infrastructure)

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Software Engineer (Infrastructure).Key Responsibilities Set infrastructure standards to support SaaS products in collaboration with product engineering teams Contribut...Show moreLast updated: 9 hours ago