Talent.com
Software Architect, NIM Factory

Software Architect, NIM Factory

NVIDIA CorporationSanta Clara, CA, United States
28 days ago
Job type
  • Full-time
Job description

Software Architect, NIM Factory page is loaded## Software Architect, NIM Factorylocations : US, CA, Santa Clara : US, SC, Remotetime type : Full timeposted on : Posted Yesterdayjob requisition id : JR2003908NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Software Architect to define and own the technical vision for the NVIDIA Inference Microservices (NIM) Factory. You will set the architectural direction for how we build, deploy, and scale enterprise-grade AI services to delight customers, while staying hands-on to guide our most critical implementations. The scope spans day-0 launches and the follow-through to harden them into enterprise-grade software, ensuring reliability, performance, and security across thousands of GPUs. You will shape our strategy for emerging challenges like disaggregated LLM inference and safeguard the long-term technical health of the platform.

  • What you'll be doing :
  • Define the end-to-end technical architecture for the NIM Factory, from container build systems and CI / CD to Kubernetes deployment patterns and runtime optimization.
  • Drive technical strategy and roadmap, making high-impact decisions on frameworks, technologies, and standards that empower dozens of engineering teams.
  • Architect and influence the design of workflow orchestration systems that underpin the NIM factory.
  • Guide and support senior engineers throughout the organization in building a culture centered on technical excellence and innovation.
  • Advocate for guidelines in software development, encompassing API composition, automation, observability, and secure supply chain management.
  • Collaborate with leadership across research, backend, SRE, and product to align technical vision with product goals and influence technical roadmaps.
  • What we need to see :
  • 15+ years of experience building large-scale, production distributed systems.
  • Consistent track record in a technical leadership or architect role, setting technical direction, and implementing.
  • Deep architectural expertise in cloud-native technologies, including Kubernetes, containers, and microservices.
  • Exceptional ability to mentor, and grow senior engineers with a passion for raising the technical bar of the entire organization.
  • Proficiency in languages like Python for building tooling and services.
  • Experience architecting solutions for GPU-accelerated or other high-performance computing workloads.
  • Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to diverse audiences and drive consensus.
  • A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience.
  • Ways to stand out from the crowd :
  • Hands-on with LLM inference stacks (Triton Inference Server, TensorRT-LLM, vLLM).
  • Experience optimizing large-model serving (KV cache sharding / paging, tensor / sequence parallelism, speculative decoding, dynamic batching).
  • Experience architecting next-generation container build systems or CI / CD platforms at scale.
  • Background with workflow orchestration engines (e.g., Temporal, Airflow) for complex, distributed processes.
  • Expertise in designing multi-tenant, multi-cluster, or edge / air-gapped deployment architectures.With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and. due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 425,500 USD.You will also be eligible for equity and .Applications for this job will be accepted at least until September 18, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr

Create a job alert for this search

Software Architect • Santa Clara, CA, United States

Related jobs
  • Promoted
Technical Architect

Technical Architect

Hayes Group Architects Inc.Redwood City, CA, United States
Full-time
You’ll play a pivotal role in turning design concepts into fully resolved, constructible solutions—balancing technical excellence with thoughtful design execution. As a leader in both documentation ...Show moreLast updated: 22 days ago
  • Promoted
Technical Architect (Redwood City)

Technical Architect (Redwood City)

Hayes Group Architects Inc.Redwood City, CA, US
Part-time
Youll play a pivotal role in turning design concepts into fully resolved, constructible solutionsbalancing technical excellence with thoughtful design execution. As a leader in both documentation an...Show moreLast updated: 22 days ago
  • Promoted
Software Build & Release Engineer

Software Build & Release Engineer

Redolent Infotech Pvt. Ltd.San Bruno, CA, United States
Full-time
One of our direct clients is urgently looking for.Software Build & Release Engineer.TITLE : Software Build & Release Engineer. Bachelor’s in Computer Science / Software Engineering with 10+ years of ex...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer (AI Performance)

Software Engineer (AI Performance)

Gimlet Labs, IncSan Francisco, CA, United States
Full-time
Gimlet Labs is building the foundation for the next generation of AI applications.As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck.Gimlet is redefi...Show moreLast updated: 13 days ago
  • Promoted
  • New!
Principal AI Infrastructure Abstraction Engineer

Principal AI Infrastructure Abstraction Engineer

Cisco Systems, Inc.San Jose, CA, United States
Full-time
This position requires a hybrid working schedule in the San Jose or Milpitas office.We are an innovation team on a mission to transform how enterprises harness AI. Operating with the agility of a s...Show moreLast updated: 2 hours ago
  • Promoted
Senior Technical Architect

Senior Technical Architect

Skidmore, Owings & Merrill LLP (SOM)San Francisco, CA, United States
Full-time
At SOM, we are a collective committed to shaping a better future for our clients, communities and planet.We aspire to create the most sustainable, impactful work through creative, interdisciplinary...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior DevOps Engineer (Redwood City) (2025)

Senior DevOps Engineer (Redwood City) (2025)

AnomaliRedwood City, CA, United States
Full-time
Anomali is headquartered in Silicon Valley and is the Leading AI-Powered Security Operations Platform that is modernizing security operations. At the center of it is an omnipresent, intelligent, and...Show moreLast updated: 2 hours ago
  • Promoted
Principal Implementation Architect

Principal Implementation Architect

AtlassianSan Francisco, CA, United States
Full-time
At Atlassians can choose where they work – in an office, from home, or a combination.Interviews and onboarding are conducted virtually as part of being a distributed-first company.We hire people in...Show moreLast updated: 26 days ago
  • Promoted
Senior Firmware Engineer

Senior Firmware Engineer

GridwareSan Francisco, CA, United States
Full-time
Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...Show moreLast updated: 27 days ago
  • Promoted
Senior Firmware Engineer

Senior Firmware Engineer

Foundation Robotics Labs Inc.San Francisco, CA, United States
Full-time
Our mission is to create advanced robots that can operate in complex environments, reducing human risk in conflict zones and enhancing efficiency in labor-intensive industries.We are on the lookout...Show moreLast updated: 5 days ago
  • Promoted
Senior Firmware Engineer

Senior Firmware Engineer

Roman Health Pharmacy LLCLos Altos, CA, United States
Full-time
Join Afero and change the world, a gazillion connections at a time!.Our vision is to make all the world's devices really smart and truly secure, through innovation and scale.Afero is the leading Pa...Show moreLast updated: 30+ days ago
  • Promoted
Senior Firmware Engineer

Senior Firmware Engineer

Gridware Technologies Inc.San Francisco, CA, United States
Full-time
Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...Show moreLast updated: 24 days ago
  • Promoted
Firmware Engineer

Firmware Engineer

Albin Engineering Services, Inc.San Francisco, CA, United States
Full-time
Albin Engineering Services, Inc.Firmware Engineer for one of our premier clients.This will be an 8 week contract position working a hybrid model reporting onsite in the San Francisco, CA area.This ...Show moreLast updated: 30+ days ago
  • Promoted
Firmware Engineer

Firmware Engineer

Nudge Real EstateSan Francisco, CA, United States
Full-time
Nudge’s goal is to help the brain work better by creating a generalized product that can precisely stimulate and image the brain, entirely non-invasively. We aim to achieve this by developing cuttin...Show moreLast updated: 28 days ago
  • Promoted
Senior Firmware Engineer

Senior Firmware Engineer

NVIDIA CorporationSanta Clara, CA, United States
Full-time
Senior Firmware Engineer page is loaded## Senior Firmware Engineerlocations : US, CA, Santa Claratime type : Full timeposted on : Posted 2 Days Agojob requisition id : JR2005431We are now looki...Show moreLast updated: 9 days ago
  • Promoted
Firmware Engineer II

Firmware Engineer II

El Camino HealthSan Francisco, CA, United States
Full-time
Firmware Engineer II page is loaded## Firmware Engineer IIremote type : Hybridlocations : San Francisco, CAtime type : Full timeposted on : Posted Yesterdayjob requisition id : JR603 • •Career-d...Show moreLast updated: 28 days ago
  • Promoted
Senior Firmware Engineer – GPU Networking

Senior Firmware Engineer – GPU Networking

NVIDIA CorporationSanta Clara, CA, United States
Full-time
Senior Firmware Engineer – GPU Networking page is loaded## Senior Firmware Engineer – GPU Networkinglocations : US, CA, Santa Claratime type : Full timeposted on : Posted Todayjob requisition id...Show moreLast updated: 1 day ago
  • Promoted
MEP Systems Engineer (Redwood City)

MEP Systems Engineer (Redwood City)

SamaraRedwood City, CA, US
Part-time
Ready to play a key role in building the future of living? Join Samara in tackling Californias housing shortage and enabling people to attain sustainable housing without compromising design or qual...Show moreLast updated: 9 days ago
  • Promoted
Senior Firmware Engineer

Senior Firmware Engineer

PhizenixSanta Clara, CA, US
Full-time
As Senior Firmware Engineer, you will be a key player in the architecture and the full lifecycle development of an AI platform system, including requirements, design, code, and test.In this role, y...Show moreLast updated: 1 day ago
  • Promoted
  • New!
Senior UI / UX Engineer (Redwood City) (2025)

Senior UI / UX Engineer (Redwood City) (2025)

AnomaliRedwood City, CA, United States
Full-time
Anomali is headquartered in Silicon Valley and is the Leading AI-Powered Security Operations Platform that is modernizing security operations. At the center of it is an omnipresent, intelligent, and...Show moreLast updated: 2 hours ago