Talent.com
Staff ML Infrastructure Engineer
Staff ML Infrastructure EngineerCubiq Recruitment • Santa Clara, CA, United States
Staff ML Infrastructure Engineer

Staff ML Infrastructure Engineer

Cubiq Recruitment • Santa Clara, CA, United States
21 hours ago
Job type
  • Full-time
Job description

Staff / Lead ML Infrastructure Engineer

San Francisco, CA — Onsite

Salary - Over market average + equity

We are building one of the world’s leading generative video and multimodal AI platforms, and we’re looking for a senior infrastructure engineer to drive the backbone that makes it possible. This role is ideal for an engineer from a top-tier tech company who has built cloud-scale systems, high-performance compute platforms, and battle-tested CI / CD pipelines that support complex ML workloads.

What You’ll Own

  • Core ML Platform Architecture : Design and evolve the infrastructure that supports large-scale generative video and multimodal model training, evaluation, and deployment.
  • High-Throughput Compute Systems : Build and optimize GPU / TPU clusters, distributed training systems, and orchestration layers tailored for video-heavy pipelines.
  • Production Reliability for Generative Models : Create the tooling and services needed to safely push frequent model updates while handling massive compute loads and long-running jobs.
  • End-to-End CI / CD for ML : Lead the development of automated pipelines for model training, validation, artifact management, and production rollout.
  • Multimodal Data Infrastructure : Build systems to ingest, version, transform, and serve large-scale video, audio, and text datasets with high reliability.
  • Internal Developer Experience : Partner with research, product, and applied ML teams to build intuitive internal tooling for experiment tracking, model lineage, and resource scheduling.
  • Technical Leadership : Mentor engineers, set platform standards, and influence long-term architectural direction.

What You’ve Done

  • Experience architecting and operating large-scale infrastructure at a cloud provider, hyperscaler, or leading AI company.
  • Built or owned mission-critical CI / CD systems, high-capacity compute platforms, or data infrastructure supporting ML teams.
  • Deep experience with distributed compute across GPUs / accelerators, Kubernetes, and cloud infrastructure (AWS / GCP / Azure).
  • Strong engineering fundamentals in Python, Go, or equivalent languages.
  • Previous exposure to ML training pipelines—especially systems that handle heavy video, multimodal, or high-dimensional data.
  • Demonstrated ability to lead complex cross-org initiatives and drive technical strategy.
  • Nice to Have

  • Experience with video processing systems, large-scale media pipelines, or streaming architectures.
  • Familiarity with modern multimodal or video-generation frameworks (PyTorch, JAX, diffusers, custom accelerators).
  • Experience with Ray, Triton, CUDA optimization, or specialized scheduling for ML workloads.
  • Background working in high-growth AI startups or research-focused environments.
  • Security and compliance considerations for models that generate or process user content.
  • Why Join

  • Shape the underlying platform powering one of the most advanced generative video systems in the world.
  • Influence the future of multimodal AI by building infrastructure that directly accelerates research and product breakthroughs.
  • Work closely with experienced founding engineers, researchers, and platform builders from leading tech companies.
  • Highly competitive compensation, meaningful equity, and strong in-person engineering culture in San Francisco.
  • Create a job alert for this search

    Staff Engineer Infrastructure • Santa Clara, CA, United States

    Related jobs
    Staff Thermal Engineer

    Staff Thermal Engineer

    Supermicro • San Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
    Last updated: 7 days ago • Promoted
    Cloud Infrastructure Engineer

    Cloud Infrastructure Engineer

    Info Way Solutions • Fremont, CA, United States
    Full-time
    Job Title : Cloud Infrastructure and Java Backend Developer.Location : [Specify location if applicable].Company Description : [Provide a brief overview of the company and its industry.Job Description : ...Show more
    Last updated: 30+ days ago • Promoted
    Staff ML Infrastructure Engineer

    Staff ML Infrastructure Engineer

    Cubiq Recruitment • Hayward, CA, United States
    Full-time
    Staff / Lead ML Infrastructure Engineer.Salary - Over market average + equity.We are building one of the world’s leading generative video and multimodal AI platforms, and we’re looking for a senior...Show more
    Last updated: 7 days ago • Promoted
    Staff Systems Engineer

    Staff Systems Engineer

    Bio-Rad Laboratories • Pleasanton, CA, United States
    Full-time
    Working within Bio-Rad's Life Science R&D Group as a Systems Engineer, you will take engineering concepts, requirements and transform them into functional prototypes and finished products that impr...Show more
    Last updated: 30+ days ago • Promoted
    Structural Engineer

    Structural Engineer

    T&S Structural • Hayward, California, US
    Full-time
    Structural Engineer (Levels 2–6).Employment Type : Full-time | On-site.Applying for this role is straight forward Scroll down and click on Apply to be considered for this position.T&S Structural...Show more
    Last updated: 16 hours ago • Promoted • New!
    Senior Software Engineer, ML Infrastructure, PrePlan

    Senior Software Engineer, ML Infrastructure, PrePlan

    Waymo • Mountain View, CA, United States
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show more
    Last updated: 3 days ago • Promoted
    Staff Software Engineer - AI Agent Infrastructure (Healthcare)

    Staff Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • Hayward, CA, United States
    Full-time
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...Show more
    Last updated: 13 days ago • Promoted
    Senior Software Engineer, ML Data Infrastructure

    Senior Software Engineer, ML Data Infrastructure

    Nuro • Mountain View, CA, United States
    Full-time
    Nuro exists to better everyday life through robotics.Founded in 2016, Nuro is a leading autonomous technology company with vehicles on road today in California and Texas. The company's core technolo...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Ads ML Infrastructure

    Software Engineer, Ads ML Infrastructure

    Tik Tok • San Jose, CA, United States
    Full-time
    About the team The ads system at TikTok operates on a massive scale and serves millions of advertisers, clients and influencers across the world. The quality of the ads system highly depends on the ...Show more
    Last updated: 30+ days ago • Promoted
    Lead Platform Engineer (Network Infrastructure)

    Lead Platform Engineer (Network Infrastructure)

    Capital One • San Jose, CA, United States
    Full-time +1
    Lead Platform Engineer (Network Infrastructure).Do you love building and pioneering in the technology space? Do you enjoy solving complex technical problems in a fast-paced, collaborative, inclusiv...Show more
    Last updated: 3 days ago • Promoted
    Senior ML Platform Engineer : Scale LLM Infrastructure

    Senior ML Platform Engineer : Scale LLM Infrastructure

    GEICO • Palo Alto, CA, United States
    Full-time
    A leading insurance company in California is seeking a Senior ML Platform Engineer to enhance their machine learning infrastructure. This role involves designing scalable systems for Large Language ...Show more
    Last updated: 15 hours ago • Promoted • New!
    Staff / Senior AI / ML Engineer - (Dublin, CA)

    Staff / Senior AI / ML Engineer - (Dublin, CA)

    Articul8 • Dublin, CA, United States
    Full-time
    At Articul8, we build enterprise-grade Generative AI solutions that help global organizations unlock new value from their data. Our platform is trusted by some of the world's most innovative compani...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Infrastructure Engineering

    Senior Software Engineer, Infrastructure Engineering

    10X Genomics • Pleasanton, CA, United States
    Full-time
    We are looking for an Infrastructure Software Engineer to join our growing Cloud Infrastructure and HPC Platform team.You'll work at the intersection of cloud infrastructure, high performance compu...Show more
    Last updated: 30+ days ago • Promoted
    Principal Staff Software Engineer, Capacity Infrastructure

    Principal Staff Software Engineer, Capacity Infrastructure

    LinkedIn • Mountain View, CA, US
    Full-time
    Principal Staff Software Engineer, Capacity Infrastructure 5 days ago Be among the first 25 applicants.Company Description LinkedIn is the world's largest professional network, built to create econ...Show more
    Last updated: 8 days ago • Promoted
    Staff ML Engineer, IT Data & Analytics

    Staff ML Engineer, IT Data & Analytics

    Tesla • Fremont, CA, United States
    Full-time
    We are seeking a highly skilled Staff Machine Learning Engineer to join the Data Engineering and Analytics team.As a full-stack Machine Learning Engineer, you will work on a wide range of models, i...Show more
    Last updated: 16 days ago • Promoted
    ML Infrastructure Engineer — Scale Generative Models

    ML Infrastructure Engineer — Scale Generative Models

    Apple Inc. • Cupertino, CA, United States
    Full-time
    A leading technology company in Cupertino, California, is seeking a ML Infrastructure Engineer to design and optimize the systems that power large-scale model training. The ideal candidate will have...Show more
    Last updated: 2 days ago • Promoted
    Staff ML Engineer : Build & Deploy Scalable ML Pipelines

    Staff ML Engineer : Build & Deploy Scalable ML Pipelines

    Intuit Inc. • Mountain View, CA, United States
    Full-time
    A leading company in financial software is seeking a Staff Machine Learning Engineer to join their innovative team.In this role, you'll collaborate with highly skilled AI scientists and engineers t...Show more
    Last updated: 4 days ago • Promoted
    ML Engineer

    ML Engineer

    CData Software • Pleasanton, CA, United States
    Full-time
    ML Engineer , ML Platform Experience , Spark, Python , SQL.Strong Experience in ML Platform, Spark, Python, SQL.Experience with programming languages like Python, Java. Strong understanding of data ...Show more
    Last updated: 2 days ago • Promoted