Talent.com
Distinguished Engineer – Data Center System Software Architect

Distinguished Engineer – Data Center System Software Architect

NVIDIASanta Clara, CA, US
9 hours ago
Job type
  • Full-time
Job description

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We’re looking for a strong technical architect to own the end-to-end architecture of these products, at the system software level. Including firmware, kernel drivers, operating systems, and user mode drivers. You will work with component leads internally and engage with industry leading cloud service providers on taking these products to market. What you’ll be doing : Serve as the primary technical point of contact for major customers, leading technological discussions, defining KPIs, gathering requirements, and addressing complex technical queries. As a system software architect, lead technical innovation and strategic collaborations with major hyperscalers to architect next-generation data center products. Align NVIDIA's roadmap with major customers' requirements through direct engagement. Develop and drive adoption of new technologies and protocols. Make critical technical decisions in ambiguous situations, mitigating risks through left-shift strategies. What we need to see : Deep expertise in scalable and performant server system architecture, focusing on SW / HW interfaces. Extensive experience with complex system software for accelerators (GPUs, DPUs, FPGAs). Mastery of system firmware (SBIOS, OpenBMC), embedded systems, and Linux kernel internals. Proficiency in Out-of-Band and In-Band management architectures, device management protocols (e.g., MCTP, PLDM, SPDM, RDE) and system management protocols (Redfish, IPMI). Extensive knowledge of networking technologies and protocols, including TCP / IP, Ethernet, InfiniBand, as well as advanced switching and routing concepts Experience collaborating with platform security experts to define tradeoffs between security and ease of use. Demonstrated success in leading complex, cross-functional projects to completion, showcasing the ability to influence and achieve results without direct authority in large-scale, collaborative environments. Demonstrable experience in implementing left shift strategy to de-risk program execution. BS or MS degree in Computer Science, Electrical Engineering or related field (or equivalent experience). 20 years in the area of System architecture and design. Ways to stand out from the crowd : Knowledge of cloud and cluster level deployment and management systems. Participation and contributions in standards bodies such as OCP and DMTF. Familiarity with NVIDIA HPC programming models and libraries (CUDA, cuDNN, DOCA) Knowledge of enterprise storage architectures and distributed parallel processing paradigms NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you! NVIDIA’s invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 308,000 USD - 471,500 USD. You will also be eligible for equity and benefits . Applications for this job will be accepted at least until September 30, 2025. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Data Center Engineer • Santa Clara, CA, US

Related jobs
  • Promoted
  • New!
System Architect

System Architect

NVIDIASanta Clara, CA, Santa Clara County, CA; California, United States
Full-time
The product development team is seeking an experienced Systems Architect to drive and support our engineering efforts.Our team takes pride in building a wide range of products — GPU PCIe cards, SHI...Show moreLast updated: less than 1 hour ago
  • Promoted
Senior System Design Engineer

Senior System Design Engineer

NVIDIA CorporationSanta Clara, CA, United States
Full-time
Senior System Design Engineer (Finance).NVIDIA has continuously reinvented itself over two decades.Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern comp...Show moreLast updated: 30+ days ago
  • Promoted
Enterprise Account Manager (AI / Data Center Infrastructure)

Enterprise Account Manager (AI / Data Center Infrastructure)

WESCO InternationalFremont, CA, United States
Full-time
We're building a high-impact sales team at the forefront of emerging technologies and we are looking for a true hunter to join us! As an Enterprise Account Manager, you will establish, develop, and...Show moreLast updated: 4 days ago
  • Promoted
Distinguished Engineer – Data Center System Software Architect

Distinguished Engineer – Data Center System Software Architect

NVIDIASanta Clara, CA, United States
Full-time
NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, ...Show moreLast updated: 30+ days ago
  • Promoted
Staff System Architect, Next-Gen Experiences, Google Pixel

Staff System Architect, Next-Gen Experiences, Google Pixel

Google Inc.Mountain View, CA, United States
Full-time
Staff System Architect, Next-Gen Experiences, Google Pixel.Bachelor's degree in Electrical Engineering, Computer Engineering, Physics, a related field, or equivalent practical experience.PhD in Ele...Show moreLast updated: 30+ days ago
  • Promoted
Distinguished Engineer - Data Center System Software Architect

Distinguished Engineer - Data Center System Software Architect

NvidiaSanta Clara, California, US
Full-time
NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, ...Show moreLast updated: 30+ days ago
  • Promoted
Senior Software Engineer, Data Center Systems Design Engineer

Senior Software Engineer, Data Center Systems Design Engineer

Apple Inc.Cupertino, CA, United States
Full-time
Senior Software Engineer, Data Center Systems Design Engineer.Cupertino, California, United States Software and Services. The cloudOS team is responsible for all facets of delivering OS and system s...Show moreLast updated: 3 days ago
  • Promoted
Field Applications Engineer (FAE) - Data Center Interconnect Solutions

Field Applications Engineer (FAE) - Data Center Interconnect Solutions

Macpower Digital Assets EdgeSanta Clara, CA, United States
Full-time
Electrical / Mechanical Engineering or Computer Science.Creo, SolidWorks, Microsoft Office 365.Commercial Products (power, memory, storage, IO).Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer - Distributed Data Systems

Software Engineer - Distributed Data Systems

xAIPalo Alto, CA, US
Full-time
AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...Show moreLast updated: 2 days ago
  • Promoted
Principal Data Center Solutions Architect

Principal Data Center Solutions Architect

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 4 days ago
  • Promoted
Software Engineer, Distributed Data Systems

Software Engineer, Distributed Data Systems

OpenAISan Francisco, CA, United States
Full-time
Software Engineer, Distributed Data Systems.The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multim...Show moreLast updated: 27 days ago
  • Promoted
System Performance Engineer -Nor Cal

System Performance Engineer -Nor Cal

Communication Technology Services, LLCLivermore, CA, United States
Full-time
System Performance Engineer -Nor Cal.Communication Technology Services (CTS).Enterprises, Public Sector and Mobile Network Operators, solving and managing the most complex networking challenges.We ...Show moreLast updated: 4 days ago
  • Promoted
  • New!
Distinguished Engineer - Data Center System SoftwareArchitect

Distinguished Engineer - Data Center System SoftwareArchitect

NVIDIASanta Clara, CA, Santa Clara County, CA; California, United States
Full-time
NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, ...Show moreLast updated: less than 1 hour ago
  • Promoted
DSP System Architect

DSP System Architect

Credo Semiconductor, Inc.San Jose, CA, United States
Full-time
We are seeking a DSP System Architect to optimize digital signal processing (DSP) architectures for high-speed SerDes (Serializer / Deserializer) transceivers. This role defines system-level architect...Show moreLast updated: 9 days ago
  • Promoted
Software Engineer, Distributed Systems

Software Engineer, Distributed Systems

OpenAISan Francisco, CA, United States
Full-time
The Compute Runtime team builds the low level framework components to power our ML training systems.We work on building robust, scalable, high performance components to support our distributed trai...Show moreLast updated: 30+ days ago
  • Promoted
Distinguished Engineer - Data Center System Software Architect

Distinguished Engineer - Data Center System Software Architect

NVIDIASanta Clara, CA, US
Full-time
NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, ...Show moreLast updated: 30+ days ago
  • Promoted
System Architect

System Architect

NvidiaSanta Clara, California, US
Full-time
The product development team is seeking an experienced Systems Architect to drive and support our engineering efforts.Our team takes pride in building a wide range of products - GPU PCIe cards, SHI...Show moreLast updated: 30+ days ago
  • Promoted
Contact Center - Consulting Solution Engineer

Contact Center - Consulting Solution Engineer

ZoomSan Jose, CA, United States
Full-time
What you can expectZoom is looking to grow its global Contact Center Solution Engineering team.Do youhave a passion for working with customers? Do you have experience working in theContact Center i...Show moreLast updated: 3 days ago
  • Promoted
Data Center Controls Engineer

Data Center Controls Engineer

OpenAISan Francisco, CA, United States
Full-time
OpenAI is on a bold mission to build the world’s most advanced AI infrastructure.We are seeking a Data Center Controls Engineer with 10+ years of experience designing and implementing advanced SCAD...Show moreLast updated: 14 days ago
  • Promoted
Structural Engineer - Data Center Design

Structural Engineer - Data Center Design

OpenAISan Francisco, CA, United States
Full-time
OpenAI is seeking a Data Center Structural Engineer with over 10 years of experience to join our team.This role is crucial for our mission to build advanced AI infrastructure, focusing on designing...Show moreLast updated: 13 days ago