Talent.com
GPU Inference Architect for Kubernetes & LLM Deployments
GPU Inference Architect for Kubernetes & LLM DeploymentsNVIDIA Corporation • Santa Clara, CA, United States
No longer accepting applications
GPU Inference Architect for Kubernetes & LLM Deployments

GPU Inference Architect for Kubernetes & LLM Deployments

NVIDIA Corporation • Santa Clara, CA, United States
9 days ago
Job type
  • Full-time
Job description

A leading AI technology company in California is seeking a Solutions Architect with expertise in AI inference and Kubernetes. You will collaborate with engineering and customer success teams to deploy scalable GPU-accelerated pipelines for generative AI workloads. The ideal candidate has at least 5 years of experience and proficiency in GPU orchestration. This position offers a competitive salary ranging from $148,000 to $235,750, along with equity options.

#J-18808-Ljbffr

Create a job alert for this search

Inference • Santa Clara, CA, United States

Related jobs
AI Inference Architect - Scalable GPU on Kubernetes

AI Inference Architect - Scalable GPU on Kubernetes

NVIDIA • Santa Clara, CA, United States
Full-time
A leading technology company is seeking a Solutions Architect focusing on inference deployments to enhance AI solutions at scale. This role involves collaborating with engineering and DevOps teams, ...Show more
Last updated: 5 hours ago • Promoted • New!
Senior AI GPU Cloud Architect – Enterprise Solutions

Senior AI GPU Cloud Architect – Enterprise Solutions

GMI Cloud • Mountain View, CA, United States
Full-time
A leading AI infrastructure company is seeking a Senior Solution Architect to design innovative GPU cloud and AI solutions. In this role, you'll engage with enterprise and hyperscaler customers, lea...Show more
Last updated: 5 hours ago • Promoted • New!
GPU / AI Application Platform Architect - San Jose

GPU / AI Application Platform Architect - San Jose

TikTok • San Jose, CA, United States
Full-time
GPU / AI Application Platform Architect - San Jose.Be among the first 25 applicants.Server platform team is responsible for architecting, designing and building best server and storage system to meet...Show more
Last updated: 18 days ago • Promoted
Cloud IAM & Cyber Architect — Distinguished Engineer

Cloud IAM & Cyber Architect — Distinguished Engineer

Capital One National Association • San Jose, CA, United States
Full-time
A leading financial institution in San Jose is seeking a Distinguished Engineer to lead solutions in Cyber and Identity Access Management. This role involves defining architectures, providing mentor...Show more
Last updated: 4 hours ago • Promoted • New!
AI Infrastructure Architect for Large Kubernetes Clusters

AI Infrastructure Architect for Large Kubernetes Clusters

CareerArc • San Jose, CA, United States
Full-time
A technology leader in San Jose, CA is seeking a System Solutions Architect to enable large AI inference workloads.The ideal candidate will possess deep knowledge of Kubernetes-based infrastructure...Show more
Last updated: 2 hours ago • Promoted • New!
Lead ASIC Verification Architect for IP Core Development

Lead ASIC Verification Architect for IP Core Development

Synopsys, Inc. • Sunnyvale, CA, United States
Full-time
A leading semiconductor company in Sunnyvale is seeking an experienced ASIC Digital Verification Principal Engineer to architect and implement robust verification environments.The role demands expe...Show more
Last updated: 8 days ago • Promoted
PKI Architect

PKI Architect

Inficare • Sunnyvale, CA, United States
Full-time
We are seeking a highly skilled PKI Architect to lead the design, implementation, and management of enterprise Public Key Infrastructure solutions. The ideal candidate will have deep expertise in cr...Show more
Last updated: 22 days ago • Promoted
Android AI ML Engineer - Infrastructure

Android AI ML Engineer - Infrastructure

Focuskpi • Mountain View, California, United States
Temporary
Android AI ML Engineer - Infrastructure.The client is seeking an experienced Android AI / ML Engineer - Infrastructure to develop advanced on-device machine learning systems that enable secure, adapt...Show more
Last updated: 30+ days ago • Promoted
Cloud Architect – AIML

Cloud Architect – AIML

Purple Drive • San Jose, California, USA
Full-time
Design and optimize cloud-based AI / ML data systems.Lead architecture strategy and implementation across full ML pipelines in Azure. Ensure scalability automation security and cross-team collaboratio...Show more
Last updated: 10 days ago • Promoted
MLOps Architect for Scalable, Production-Ready AI

MLOps Architect for Scalable, Production-Ready AI

Microsoft • Mountain View, CA, United States
Full-time
A leading tech company in Mountain View is seeking a Machine Learning Operations Engineer to develop and architect robust infrastructure for AI products. This high-impact role involves designing sca...Show more
Last updated: 10 days ago • Promoted
Software Architect - GPU AI Kernel Library

Software Architect - GPU AI Kernel Library

Intel Corporation • Santa Clara, CA, United States
Full-time
Software Architect - GPU AI Kernel Library page is loaded## Software Architect - GPU AI Kernel Librarylocations : US, Oregon, Hillsboro : US, California, Santa Claratime type : Full timeposted o...Show more
Last updated: 21 days ago • Promoted
Global Solutions Architect — AI & GPU Enablement

Global Solutions Architect — AI & GPU Enablement

NVIDIA Corporation • Santa Clara, CA, United States
Full-time
A leading technology company in California is seeking Solutions Architects to support customers in adopting GPU technologies. You will engage with developers and data scientists, facilitating the in...Show more
Last updated: 6 days ago • Promoted
AWS Cloud & AI Pipeline Architect (Kubernetes, Terraform)

AWS Cloud & AI Pipeline Architect (Kubernetes, Terraform)

TechDigital Group • San Jose, CA, United States
Full-time
An innovative firm is seeking a skilled engineer with deep knowledge of Terraform and Kubernetes to enhance build processes and develop robust cloud-based solutions. In this role, you will leverage ...Show more
Last updated: 11 days ago • Promoted
GPU / AI Application Platform Engineer Intern (Server Platform) - 2026 Fall (PhD)

GPU / AI Application Platform Engineer Intern (Server Platform) - 2026 Fall (PhD)

Tik Tok • San Jose, CA, United States
Full-time
TikTok's Server platform team is responsible for architecting, designing, and building the best server and storage system to meet the requirements of high-performance, low cost and easy to operate....Show more
Last updated: 8 days ago • Promoted
Cloud Architect

Cloud Architect

TEKsystems • San Jose, CA, United States
Full-time
TEKsystems is seeking a Cloud Architect for an on-site role in San Jose, CA or Lehi, UT •.This is a long term contract position. Must be authorized to work in the US for any employer.Bachelor's degre...Show more
Last updated: 23 days ago • Promoted
Director, GPU Solutions Architect for AI & HPC Clusters

Director, GPU Solutions Architect for AI & HPC Clusters

AMD • San Jose, CA, US
Full-time
A leading technology company is seeking an experienced Director Solutions Architect to enable large clusters for AI & HPC workloads. This role requires a technical expert in datacenter infrastru...Show more
Last updated: 8 hours ago • Promoted • New!
Elite PCIe Verification Architect - Gen4 / 5, On-Site

Elite PCIe Verification Architect - Gen4 / 5, On-Site

SiMa.ai • San Jose, CA, United States
Full-time
A leading tech company in San Jose is seeking a Principal Engineer for PCIe Verification.You will be involved in PCIe controller verification and silicon debug. The ideal candidate has over 12 years...Show more
Last updated: 10 days ago • Promoted
Principal DRAM Architect - GPU Memory Solutions

Principal DRAM Architect - GPU Memory Solutions

Nvidia Corporation • Santa Clara, CA, United States
Full-time
NVIDIA is seeking a world-class Principal DRAM Architect to define, drive, and deliver the architecture, roadmap, and implementation of next-generation AI and graphics memory solutions.This role si...Show more
Last updated: 14 days ago • Promoted