Talent.com
Staff ML Engineer - Infrastructure

Staff ML Engineer - Infrastructure

ChipStackSan Jose, California, United States
30+ days ago
Job type
  • Full-time
Job description

About Us

Chips are at the center of today's tech-driven world. But how we design them has not changed in decades, while their complexity and specialization have skyrocketed due to increasing performance demands from applications like AI. We want to change that.

Our team is small, technical, and fast-moving. We’ve built and shipped at the intersection of AI, EDA, and systems software, with deep roots at companies like Qualcomm, Nvidia, Google, Meta, and the Allen Institute for AI. We’re backed by top investors including Khosla Ventures, Cerberus, and Clear Ventures, and already deployed with 10+ innovative customers—from Fortune 100s to cutting-edge AI silicon startups.

About This Role

This role offers a unique opportunity to be part of the founding team at ChipStack, where we are reinventing how modern silicon chips are designed. You will work alongside highly experienced chip designers who have built complex chips, ML scientists who have trained LLMs at scale, and top-notch infrastructure and software engineers. You will get to leverage your experience building ML and data infrastructure and apply it to some of the hardest problems in chip design.

About You

You want to be at a startup because you love to be at the center of all the dynamism that a startup offers.

You are willing to put in the hours and go the extra mile to ensure every customer has an exceptional experience.

You are self-motivated with a sense of urgency and can operate independently without much guidance.

You are not afraid of difficult problems and enjoy venturing into areas you have not explored before.

This Role

We’re looking for a strong, experienced ML Infrastructure Engineer to join our founding team. We are seeking someone with experience designing and scaling ML infrastructure and training pipelines. You’ll be responsible for building the core infrastructure that enables training, fine-tuning, evaluation, and deployment of LLMs across cloud and on-premise environments. Your work will directly impact product capabilities and speed of iteration.

What's needed

5+ years of experience in ML infrastructure or adjacent roles

Deep expertise in Python and experience with training frameworks like PyTorch or TensorFlow

Strong systems engineering skills and experience with distributed training, data pipelines, and performance optimization

Experience deploying ML models to production (REST APIs, batch jobs, streaming pipelines)

Proficiency with cloud platforms (e.g., GCP, AWS) and containerized systems (Docker, Kubernetes)

Experience managing GPU / TPU workloads efficiently

Good communication skills and the ability to work directly with engineers and customers

Prior experience training or fine-tuning LLMs

Experience setting up observability, monitoring, and evaluation pipelines for ML models

What's good to have

Exposure to chip design fundamentals (via coursework or elsewhere)

Experience at an early-stage startup

Our Culture

Challenge status quo : We are innovators who can challenge the status quo and push forward our vision of the world.

Strong opinions, loosely held : We are low on ego, but high on collaboration. We are okay to be wrong and are always open to learning.

Ship fast, ship quality : We ruthlessly prioritize what matters. We build a few things, but at lightning speed with high quality.

Proud of our craft : Attention to detail is in our DNA. We take pride in what we build and ensure they exceed the high standards of the semiconductor industry.

Create a job alert for this search

Staff Infrastructure Engineer • San Jose, California, United States

Related jobs
  • Promoted
Sr. Staff ML Platform Engineer (TLM)

Sr. Staff ML Platform Engineer (TLM)

EarninMountain View, California, United States
Full-time
As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living paycheck to pay...Show moreLast updated: 30+ days ago
  • Promoted
Staff Infrastructure Engineer

Staff Infrastructure Engineer

CrusoeSan Francisco, CA, United States
Full-time
Crusoe's mission is to accelerate the abundance of energy and intelligence.We're crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, spe...Show moreLast updated: 30+ days ago
  • Promoted
Staff Infrastructure / DevOps Engineer

Staff Infrastructure / DevOps Engineer

Gatik AiMountain View, California, United States
Full-time
Gatik, the leader in autonomous middle-mile logistics, is revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution and prioritizing safe, consistent del...Show moreLast updated: 30+ days ago
  • Promoted
Staff Infrastructure Engineer

Staff Infrastructure Engineer

Tools for HumanitySan Francisco, CA, United States
Full-time
Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.World is a network of real humans, built on privacy-preserving proof-of-human technology, and powered ...Show moreLast updated: 30+ days ago
  • Promoted
AI Infrastructure Engineer, Model Serving Platform

AI Infrastructure Engineer, Model Serving Platform

Scale AI, Inc.San Francisco, CA, United States
Full-time
As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show moreLast updated: 30+ days ago
  • Promoted
Senior / Staff Infrastructure Engineer

Senior / Staff Infrastructure Engineer

ApiphanySan Francisco, CA, United States
Full-time
Apiphany is a pioneering foundational AI company for physical product development.We empower global innovators in automotive, aerospace, medtech, and energy to transform mountains of unstructured t...Show moreLast updated: 15 days ago
  • Promoted
Senior / Staff Infrastructure Engineer

Senior / Staff Infrastructure Engineer

Apiphany CorporationSan Francisco, CA, United States
Full-time
Apiphany is a pioneering foundational AI company for physical product development.We empower global innovators in automotive, aerospace, medtech, and energy to transform mountains of unstructured t...Show moreLast updated: 16 days ago
  • Promoted
ML Infrastructure Engineer

ML Infrastructure Engineer

OpenaiSan Francisco, California, United States
Full-time
The Runtime team builds the low level framework components to power our ML training systems.We work on building robust, scalable, high performance components to support our distributed training wor...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Software Engineer - ML Infrastructure

Software Engineer - ML Infrastructure

SpecterSan Francisco, California, United States
Full-time
Specter is creating a software-defined "control plane" for the physical world.We are starting with protecting American businesses by granting them ubiquitous perception over their physical assets.T...Show moreLast updated: 9 hours ago
  • Promoted
  • New!
Staff ML Platform Engineer, Scalable Infra & Search

Staff ML Platform Engineer, Scalable Infra & Search

Apple Inc.San Francisco, CA, United States
Full-time
A leading technology company seeks a Staff ML Engineer for their Machine Learning Platform Technologies team in San Francisco. The role involves building scalable infrastructures for machine learnin...Show moreLast updated: 19 hours ago
  • Promoted
Staff ML Engineer

Staff ML Engineer

GrindrSan Francisco, CA, United States
Full-time
San Francisco or Palo Alto offices (Palo Alto preferred) and will require you to be in the office on Tuesdays and Thursdays. What’s So Interesting About This Role?.At Grindr, we’re at the dawn of an...Show moreLast updated: 30+ days ago
  • Promoted
ML Infrastructure Engineer

ML Infrastructure Engineer

PhizenixMenlo Park, California, United States
Full-time +1
Menlo Park, CA | On-Site | Full-Time / Direct Hire.Client Opportunity | Through Phizenix.Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering ...Show moreLast updated: 30+ days ago
  • Promoted
Staff ML Infrastructure Engineer

Staff ML Infrastructure Engineer

Cubiq RecruitmentAlameda, CA, US
Full-time
Staff / Lead ML Infrastructure Engineer.Salary - Over market average + equity.We are building one of the world's leading generative video and multimodal AI platforms, and we're looking for a senior...Show moreLast updated: 2 days ago
  • Promoted
ML Infrastructure Engineer

ML Infrastructure Engineer

Virtue AISan Francisco, CA, United States
Full-time
Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.Virtue AI is at the forefront of AI security. As enterprises increasingly adopt Large Language Models, ...Show moreLast updated: 30+ days ago
  • Promoted
Staff Infrastructure Engineer

Staff Infrastructure Engineer

WorldSan Francisco, CA, United States
Full-time
World is a network of real humans, built on privacy-preserving proof-of-human technology, and powered by a globally inclusive financial network that enables the free flow of digital assets for all....Show moreLast updated: 5 days ago
  • Promoted
  • New!
Staff Infrastructure Engineer

Staff Infrastructure Engineer

SlopeSan Francisco, CA, United States
Full-time
We're creating the shift to voice as humanity's default interface.Why voice? Because voice captures the nuance, the emotion, and the humanity of interactions in ways text alone can't : voice makes t...Show moreLast updated: 19 hours ago
  • Promoted
ML Infrastructure Engineer, Safeguards

ML Infrastructure Engineer, Safeguards

AnthropicSan Francisco, California, United States
Full-time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer, ML Infrastructure, Optimization

Software Engineer, ML Infrastructure, Optimization

NuroMountain View, CA, United States
Full-time
Nuro is a self-driving technology company on a mission to make autonomy accessible to all.Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge AI with automoti...Show moreLast updated: 1 day ago