Talent.com
Senior Solutions Architect, HPC and AI

Senior Solutions Architect, HPC and AI

NVIDIASanta Clara, CA, United States
3 days ago
Job type
  • Full-time
Job description

NVIDIA is looking for a Field Escalation Solution Architect with experience in validation and debugging of large-scale GPU clusters focused on performance. As part of the Solution Architecture organization, we work with the most sophisticated computing hardware and software, driving the latest deep learning and machine learning breakthroughs with NVIDIA’s enterprise customers. This role offers an excellent opportunity to build your career in the rapidly growing field of deep learning while enabling the world's most successful technology companies. Primary responsibilities will be to validate and debug customer cluster performance issues, functional bottlenecks and drive customer technical engagements around NVIDIA products and technologies. Join us in this exciting endeavor!

What you’ll be doing :

A considerable part of the day-to-day job is staying up to date on pioneering High Performance Computing, Deep Learning and Machine Learning ecosystems. You'll be called on to help architect and scale high-performance, distributed AI infrastructure on-prem or in the cloud built with the latest NVIDIA GPU supercomputers for new and existing customers.

Address and resolve problems starting from the bare metal level, all the way up to the operating system, software stack, and application level.

Share knowledge with different teams by delivering demos, assisting with proof-of-concepts, and writing papers and developer blogs. By collaborating with executives and engineering, address sophisticated problems and help bring NVIDIA's premiere technologies to life in the cloud and in the datacenter.

Work directly with developers and hardware architects to debug cluster performance issues, identify new requirements, and improve workflows.

Will be engaged by the account team when extra analysis is required in debugging customer issues.

Provide additional expertise to enable the account team to be more adaptable to the customer and product engineering to get more actionable data at speed of light making them more efficient.

Building custom product demonstrations and POCs for solutions that address critical business needs of our customers.

What we need to see :

BS, MS, or PhD in Computer Science, Electrical / Computer Engineering, Physics, Mathematics, or other Engineering fields or equivalent experience.

8+ years of work-related experience in NVIDIA and / or accelerated computing technologies.

Platform level understanding of server architecture, PCIe topology, GPUs, NICs, Linux OS and kernel drivers.

Networking experience, including knowledge of Ethernet, InfiniBand or other networking protocols.

Experience working with DevOps on-prem or in cloud environments, including but not limited to Docker / Containers, cloud APIs, IaaS and Data Center deployments.

SLURM, Kubernetes, and / or other job scheduler use, deployment, and debugging skills.

Deep understanding of dense data center design, including computing, storage, networking, cloud APIs, and IaaS.

Effective time management and capable of balancing multiple tasks.

Strong analytical and problem-solving skills.

Strong communication skills, both written and verbal, with the ability to collaborate and coordinate efficiently across multi-functional teams in engineering, sales, marketing, product, and program management.

Ways to stand out from the crowd :

Demonstrated Communication Collectives (NCCL) experience.

Excellent customer-facing skills and background.

Platform design engineering, coding and proficient debugging skills including experience in C / C++, Linux kernel, virtualization and drivers, profilers / performance analysis tools (NSys).

Familiarity with NVIDIA systems / SDKs (e.g. CUDA), NVIDIA Networking technologies (e.g., RoCE, InfiniBand), Switch interconnects and / or ARM CPU solutions through hands-on experience.

Understanding of Deep Learning and Machine Learning frameworks (TensorFlow or PyTorch), LLM, MLOps, DevOps, and workflows applying cloud technologies, using Docker / containers, Kubernetes, cloud APIs, and data center deployments, among others.

We make extensive use of conferencing tools, but occasional travel (25%) is required for a local on-site visit to customers and data science conferences.

With highly competitive salaries, a comprehensive benefits package, and an excellent engineering culture, NVIDIA is widely considered to be one of the technology industry's most desirable employers. NVIDIA has some of the most innovative people working on significant problems that define the field of ML / DL, data science, and graphics.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits () .

Applications for this job will be accepted at least until September 2, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Senior Solution Architect • Santa Clara, CA, United States

Related jobs
  • Promoted
Sr. Solution Architect

Sr. Solution Architect

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
Senior Solution Architect, HPC and AI - NVIS

Senior Solution Architect, HPC and AI - NVIS

NVIDIASanta Clara, CA, United States
Full-time
Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for a hardworking Solution Architect (SA) to join the NVIDIA AI Enterpri...Show moreLast updated: 3 days ago
  • Promoted
Senior Solutions Architect - Startups

Senior Solutions Architect - Startups

AmazonSan Francisco, CA, United States
Full-time
Do you like startups? Are you interested in cloud computing? Come join us.Startups are the enterprises of the future.These young companies are founded by ambitious people who have a desire to build...Show moreLast updated: 3 days ago
  • Promoted
Solutions Architect, Applied AI

Solutions Architect, Applied AI

NVIDIASanta Clara, CA, United States
Full-time
Do you want to be part of the team that brings Artificial Intelligence (AI) technology to the field? We are looking for a Solution Architect (SA) or Data Scientist to join the Applied AI SA Segment...Show moreLast updated: 3 days ago
  • Promoted
Expert AI Solutions Architect

Expert AI Solutions Architect

Delta Dental Plans AssociationSan Francisco, CA, United States
Full-time
The Expert Solutions Architect will primarily be responsible for architecting solutions for strategic initiatives and projects that combine new and existing applications, tools and technology to de...Show moreLast updated: 3 days ago
  • Promoted
Senior Solutions Architect, Agentic AI

Senior Solutions Architect, Agentic AI

NVIDIASanta Clara, CA, United States
Full-time
NVIDIA is seeking an outstanding Senior AI Engineer or Solutions Architect to join our growing team focused on partner enablement for Agentic AI. In this role, you will lead by example, acting as bo...Show moreLast updated: 3 days ago
  • Promoted
Senior Solutions Architect - Enterprise AI

Senior Solutions Architect - Enterprise AI

NVIDIASanta Clara, CA, United States
Full-time
We are now looking for a Senior Solution Network Architect, Enterprise Products! Join the NVIDIA Solution Architect to support Enterprise Products team as a Senior Network Architect, where your pas...Show moreLast updated: 3 days ago
  • Promoted
AI Solutions Architect

AI Solutions Architect

Your Personal AISan Francisco, CA, United States
Full-time
We are seeking a talented AI Solutions Architect to join our AI Solutions department at Your Personal AI.As an AI Solutions Architect, you will be responsible for designing and implementing AI solu...Show moreLast updated: 3 days ago
  • Promoted
AI Solution Architect / Senior AI Solution Architect (Post-Sales)

AI Solution Architect / Senior AI Solution Architect (Post-Sales)

C3 AIRedwood City, CA, United States
Full-time
C3 AI (NYSE : AI), is the Enterprise AI application software company.C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing,...Show moreLast updated: 3 days ago
  • Promoted
Solutions Architect, Generative AI Deployment

Solutions Architect, Generative AI Deployment

OpenAISan Francisco, CA, United States
Full-time
The Solutions Architecture team ensures the safe and effective deployment of Generative AI applications for developers and enterprises. We act as trusted advisors and technical partners to our custo...Show moreLast updated: 3 days ago
  • Promoted
Solutions Architect, Startups

Solutions Architect, Startups

OpenAISan Francisco, CA, United States
Full-time
The Solutions Architecture team is responsible for ensuring the safe and effective deployment of Generative AI applications for developers and enterprises. We act as a trusted advisor and thought pa...Show moreLast updated: 3 days ago
  • Promoted
Solutions Architect, Applied AI

Solutions Architect, Applied AI

AnthropicSan Francisco, CA, United States
Full-time
Anthropic's mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 3 days ago
  • Promoted
Senior Solution Architect

Senior Solution Architect

Compunnel, Inc.San Francisco, CA, United States
Full-time
We are seeking a Senior Solution Architect to design and implement comprehensive data and analytics solutions using modern technologies such as Databricks, Immuta, Starburst, Collibra, and AWS.This...Show moreLast updated: 1 day ago
  • Promoted
AI / ML Solutions Architect

AI / ML Solutions Architect

JobotSan Jose, CA, US
Full-time
Join one of the fasted growing AI / ML services companies!.This Jobot Job is hosted by : Adam Bennett.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.Salary...Show moreLast updated: 2 days ago
  • Promoted
Senior Solutions Architect, Generative AI

Senior Solutions Architect, Generative AI

NVIDIASanta Clara, CA, United States
Full-time
NVIDIA is looking for an AI Solutions Architect with hands-on experience in efficient AI model training and / or deployment for a customer facing role. Primary responsibilities will be to help acceler...Show moreLast updated: 3 days ago
  • Promoted
Senior AI Solutions Architect

Senior AI Solutions Architect

VariteSanta Clara, CA, United States
Full-time
AI technology with enterprise-scale business value.AI workloads across hybrid and multi-cloud environments.You'll partner directly with C-suite executives, engineering teams, and developers at some...Show moreLast updated: 3 days ago
  • Promoted
Solutions Architect, Applied AI

Solutions Architect, Applied AI

Menlo VenturesSan Francisco, CA, United States
Full-time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 30+ days ago
  • Promoted
Senior Solutions Architect

Senior Solutions Architect

NVIDIASanta Clara, CA, United States
Full-time
NVIDIA is looking for an experienced network infrastructure Solutions Architect.Do you want to be part of a team that brings Artificial Intelligence (AI) hardware and software technologies to produ...Show moreLast updated: 3 days ago