Talent.com
Senior Solution Architect, HPC and AI - NVIS
Senior Solution Architect, HPC and AI - NVISNVIDIA Corporation • Santa Clara, CA, United States
Senior Solution Architect, HPC and AI - NVIS

Senior Solution Architect, HPC and AI - NVIS

NVIDIA Corporation • Santa Clara, CA, United States
30+ days ago
Job type
  • Full-time
Job description

NVIDIA is the world leader in computer graphics, artificial intelligence, and accelerated computing. For over 25 years, we have been at the forefront of research and engineering around the greatest advances in technology. Our history of innovation drives us to solve the worlds hardest problems.NVIDIA is looking for Senior HPC / AI Solutions Architect to join its NVIDIA Infrastructure Specialists Team. Academic and commercial groups around the world are using NVIDIA products to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the largest and fastest AI / HPC systems in the world! We are looking for someone with the ability to work on a dynamic customer focused team that requires excellent interpersonal skills. This role will be interacting with customers, partners and internal teams, to analyze, define and implement large scale AI / HPC projects. The scope of these efforts includes a combination of Networking, System Design and Automation and being the face to the customer!

  • What You’ll Be Doing :
  • Primary responsibilities will include building robust AI / HPC infrastructure for new and existing customers.
  • Support operational and reliability aspects of large-scale AI clusters, focusing on performance at scale, training stability, real-time monitoring, logging, and alerting.
  • Engage in and improve the whole lifecycle of services from inception and design through deployment, operation, and refinement.
  • Your primary focus would be on understanding the AI workload and how it interacts with other parts of the system like networking, storage, deep learning frameworks, data cleaning tools, etc.
  • Help maintain services once they are live by measuring and monitoring progress of AI jobs and helping engineering design solutions for more robust training at scale.
  • Provide feedback to internal teams such as opening bugs, documenting workarounds, and suggesting improvements.
  • What We Need to See :
  • BS / MS / PhD or equivalent experience in Computer Science, Data Science, Electrical / Computer Engineering, Physics, Mathematics, other Engineering fields with at least 8 years work or research experience with Python / C++ / other software development.
  • Track record of medium to large scale AI training and understanding of key libraries used for NLP / LLM / VLA training (NeMo Framework, DeepSpeed etc.)
  • Experience with integration and deployment of software products in production enterprise environments, and microservices software architecture.
  • You are excited to work with multiple levels and teams across organisations (Engineering, Product, Sales and Marketing team) Capable of working in a constantly evolving environment without losing focus. Ability to multitask in a fast-paced environment.
  • Driven with strong analytical and problem-solving skills. Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very sophisticated projects.
  • You are a self-starter with demeanour for growth, passion for continuous learning and sharing findings across the team.
  • Technical leadership and strong understanding of NVIDIA technologies, and success in working with customers.
  • Excellent verbal, written communication, and technical presentation skills in English.
  • Ways to Stand Out from The Crowd :
  • Experience working with large transformer-based architectures for NLP, CV, ASR or other. Experience running large scale distributed DL training.
  • Understanding of HPC systems : data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and / or management experience.
  • Proven experience with one or more Tier-1 Clouds (AWS, Azure, GCP or OCI) and cloud-native architectures and software.
  • Expertise with parallel filesystems (e.g. Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects (InfiniBand, Omni Path, and Gig-E).
  • Strong coding and debugging skills, and demonstrated expertise in one or more of the following areas : Machine Learning, Deep Learning, Slurm, Docker / Kubernetes, Kubernetes, Singularity, MPI, MLOps, LLMOps, Ansible, Terraform, and other high-performance AI cluster solutions.
  • Technical leadership and strong understanding of NVIDIA technologies including GX Cloud, NVIDIA AI Enterprise AI Software, Base Command Manager, NEMO and NVIDIA Inference Microservices. Success in working with customers using NVIDIA technologies.NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking individuals in the world working for us. If you're creative and autonomous, we want to hear from you.The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and .
  • NVIDIA accepts applications on an ongoing basis.
  • NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr

Create a job alert for this search

Senior Solution Architect • Santa Clara, CA, United States

Similar jobs
Senior Solution Architect – AI / GPU Cloud

Senior Solution Architect – AI / GPU Cloud

GMI Cloud • Mountain View, CA, United States
Full-time
Senior Solution Architect – AI / GPU Cloud.We are seeking a Senior Solution Architect to design GPU‑cloud and AI infrastructure solutions, lead PoCs and benchmarks, guide customers through deployme...Show more
Last updated: 17 days ago • Promoted
Solutions Architect

Solutions Architect

Amtex Systems Inc • Sunnyvale, CA, US
Full-time
Seeking a Senior Solution Architect to lead the design and implementation of a physician Master Data Management (MDM) solution, starting with a proof of concept. The ideal candidate will have deep e...Show more
Last updated: 5 days ago • Promoted
Senior Solutions Architect

Senior Solutions Architect

Workato • Palo Alto, CA, US
Full-time
Workato transforms technology complexity into business opportunity.As the leader in enterprise orchestration, Workato helps businesses globally streamline operations by connecting data, processes, ...Show more
Last updated: 30+ days ago • Promoted
Senior AI System Architect : Enterprise AI Orchestrator

Senior AI System Architect : Enterprise AI Orchestrator

Adobe Inc. • San Jose, CA, United States
Full-time
A leading technology company in California is seeking a Senior AI System Architect to bridge business needs with advanced AI system design. The role focuses on translating use cases into scalable so...Show more
Last updated: 23 days ago • Promoted
Solution Architect - Senior

Solution Architect - Senior

EY • Palo Alto, CA, United States
Full-time
At EY, we’re all in to shape your future with confidence.We’ll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help ...Show more
Last updated: 4 days ago • Promoted
Senior DAC Architect and Lead

Senior DAC Architect and Lead

Omni Design Technologies • Milpitas, CA, US
Full-time
DAC architect and development lead focusing on high-performance digital-to-analog converters.The successful candidate in this role will work with customers to understand requirements, and will lead...Show more
Last updated: 30+ days ago • Promoted
AI Solution Architect

AI Solution Architect

Groq • Mountain View, CA, United States
Full-time
Job Title : AI Solution ArchitectDuties : Requiring limited supervisionOur AI Solutions Architect, AI / ML will be at the intersection of technology and business.The architect will bl...Show more
Last updated: 5 days ago • Promoted
Senior Solution Architect

Senior Solution Architect

LotusFlare, Inc • Santa Clara, CA, United States
Full-time
LotusFlare employees join and remain at LotusFlare for two simple reasons.First, they can see immediately that their work makes a positive impact on LotusFlare customers, and second, they grow on a...Show more
Last updated: 22 days ago • Promoted
Senior Solution Architect, HPC and AI - NVIS

Senior Solution Architect, HPC and AI - NVIS

NVIDIA Corporation • Santa Clara, CA, United States
Full-time
NVIDIA is the world leader in computer graphics, artificial intelligence, and accelerated computing.For over 25 years, we have been at the forefront of research and engineering around the greatest ...Show more
Last updated: 30+ days ago • Promoted
Senior AI Platform Architect — Enterprise Solutions

Senior AI Platform Architect — Enterprise Solutions

Palo Alto Networks • Santa Clara, CA, United States
Full-time
A leading cybersecurity firm seeks a Principal AI Engineer for its Enterprise AI Platform.The successful candidate will design and implement AI solutions, optimize machine learning systems, and pro...Show more
Last updated: 3 days ago • Promoted
Solutions Architect - HPC / AI / ML

Solutions Architect - HPC / AI / ML

CoreWeave • Sunnyvale, CA, US
Permanent
CoreWeave is The Essential Cloud for AI™.Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confi...Show more
Last updated: 30+ days ago • Promoted
Senior Cloud Solution Architect

Senior Cloud Solution Architect

Tencent • Palo Alto, CA, United States
Full-time
Senior Cloud Solution Architect.Be among the first 25 applicants.Senior Cloud Solution Architect.Get AI-powered advice on this job and more exclusive features. Cloud & Smart Industries Group (CSIG) ...Show more
Last updated: 30+ days ago • Promoted
Senior Solution Architect, HPC and AI - NVIS

Senior Solution Architect, HPC and AI - NVIS

NVIDIA • Santa Clara, CA, United States
Full-time
Senior Solution Architect, HPC and AI - NVIS.Join to apply for the Senior Solution Architect, HPC and AI - NVIS role at NVIDIA. Do you want to be part of the team that brings Artificial Intelligence...Show more
Last updated: 30+ days ago • Promoted
Avaya Solution Architect

Avaya Solution Architect

Avaya • San Jose, CA, United States
Full-time
Avaya is an enterprise software leader that helps the world’s largest organizations and government agencies forge unbreakable connections. The Avaya Infinity™ platform unifies fragmented customer ex...Show more
Last updated: 12 days ago • Promoted
Senior Solutions Architect - Enterprise Networking & AI

Senior Solutions Architect - Enterprise Networking & AI

Presidio • Pleasanton, CA, United States
Full-time
A leading technology firm in Pleasanton, CA is seeking a skilled Senior Solutions Architect to join their Pre-Sales Engineering Team. The role involves meeting with clients to gather business requir...Show more
Last updated: 15 days ago • Promoted
Senior Solution Architect

Senior Solution Architect

LotusFlare • Santa Clara, CA, United States
Full-time
LotusFlare employees join and remain at LotusFlare for two simple reasons.First, they can see immediately that their work makes a positive impact on LotusFlare customers, and second, they grow on a...Show more
Last updated: 11 days ago • Promoted
Senior Strategic Solution Architect

Senior Strategic Solution Architect

Jobright.ai • Sunnyvale, CA, United States
Full-time
Senior Strategic Solution Architect.Be among the first 25 applicants.Senior Strategic Solution Architect.Jobright is an AI-powered career platform that helps job seekers discover the top opportunit...Show more
Last updated: 30+ days ago • Promoted
AI Solutions Architect — Integrations & POC Lead

AI Solutions Architect — Integrations & POC Lead

Thelevel • Mountain View, CA, United States
Full-time
A cutting-edge AI startup in Mountain View is seeking a Solutions Architect to lead integrations between their platform and customer systems. The role requires a combination of technical expertise, ...Show more
Last updated: 24 days ago • Promoted