Talent.com
Staff ML Infrastructure Engineer
Staff ML Infrastructure EngineerCubiq Recruitment • Santa Clara, CA, United States
Staff ML Infrastructure Engineer

Staff ML Infrastructure Engineer

Cubiq Recruitment • Santa Clara, CA, United States
1 day ago
Job type
  • Full-time
Job description

Staff / Lead ML Infrastructure Engineer

San Francisco, CA — Onsite

Salary - Over market average + equity

We are building one of the world’s leading generative video and multimodal AI platforms, and we’re looking for a senior infrastructure engineer to drive the backbone that makes it possible. This role is ideal for an engineer from a top-tier tech company who has built cloud-scale systems, high-performance compute platforms, and battle-tested CI / CD pipelines that support complex ML workloads.

What You’ll Own

  • Core ML Platform Architecture : Design and evolve the infrastructure that supports large-scale generative video and multimodal model training, evaluation, and deployment.
  • High-Throughput Compute Systems : Build and optimize GPU / TPU clusters, distributed training systems, and orchestration layers tailored for video-heavy pipelines.
  • Production Reliability for Generative Models : Create the tooling and services needed to safely push frequent model updates while handling massive compute loads and long-running jobs.
  • End-to-End CI / CD for ML : Lead the development of automated pipelines for model training, validation, artifact management, and production rollout.
  • Multimodal Data Infrastructure : Build systems to ingest, version, transform, and serve large-scale video, audio, and text datasets with high reliability.
  • Internal Developer Experience : Partner with research, product, and applied ML teams to build intuitive internal tooling for experiment tracking, model lineage, and resource scheduling.
  • Technical Leadership : Mentor engineers, set platform standards, and influence long-term architectural direction.

What You’ve Done

  • Experience architecting and operating large-scale infrastructure at a cloud provider, hyperscaler, or leading AI company.
  • Built or owned mission-critical CI / CD systems, high-capacity compute platforms, or data infrastructure supporting ML teams.
  • Deep experience with distributed compute across GPUs / accelerators, Kubernetes, and cloud infrastructure (AWS / GCP / Azure).
  • Strong engineering fundamentals in Python, Go, or equivalent languages.
  • Previous exposure to ML training pipelines—especially systems that handle heavy video, multimodal, or high-dimensional data.
  • Demonstrated ability to lead complex cross-org initiatives and drive technical strategy.
  • Nice to Have

  • Experience with video processing systems, large-scale media pipelines, or streaming architectures.
  • Familiarity with modern multimodal or video-generation frameworks (PyTorch, JAX, diffusers, custom accelerators).
  • Experience with Ray, Triton, CUDA optimization, or specialized scheduling for ML workloads.
  • Background working in high-growth AI startups or research-focused environments.
  • Security and compliance considerations for models that generate or process user content.
  • Why Join

  • Shape the underlying platform powering one of the most advanced generative video systems in the world.
  • Influence the future of multimodal AI by building infrastructure that directly accelerates research and product breakthroughs.
  • Work closely with experienced founding engineers, researchers, and platform builders from leading tech companies.
  • Highly competitive compensation, meaningful equity, and strong in-person engineering culture in San Francisco.
  • Create a job alert for this search

    Staff Engineer Infrastructure • Santa Clara, CA, United States

    Related jobs
    Sr. Staff ML Platform Engineer (TLM)

    Sr. Staff ML Platform Engineer (TLM)

    Earnin • Mountain View, California, United States
    Full-time
    As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living paycheck to pay...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Staff Software Engineer Network Infrastructure Observability

    Sr. Staff Software Engineer Network Infrastructure Observability

    LinkedIn • Mountain View, California, USA
    Full-time
    At LinkedIn our approach to flexible work is centered on trust and optimized for culture connection clarity and the evolving needs of our business. The work location of this role is hybrid meaning i...Show more
    Last updated: 6 days ago • Promoted
    Senior ML Platform Engineer : Scale LLM Infrastructure

    Senior ML Platform Engineer : Scale LLM Infrastructure

    GEICO • Palo Alto, CA, United States
    Full-time
    A leading insurance company in California is seeking a Senior ML Platform Engineer to enhance their machine learning infrastructure. This role involves designing scalable systems for Large Language ...Show more
    Last updated: 12 hours ago • Promoted • New!
    Software Infrastructure Engineer (Starlink)

    Software Infrastructure Engineer (Starlink)

    Spacex • Sunnyvale, California, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show more
    Last updated: 18 days ago • Promoted
    RAN Infrastructure Engineer

    RAN Infrastructure Engineer

    Skylo Technologies • Mountain View, California, United States
    Full-time
    Skylo is a global Non-Terrestrial Network service provider based in Mountain View, CA, offering a service that allows smartphone and IoT cellular devices to connect directly over existing satellite...Show more
    Last updated: 30+ days ago • Promoted
    Senior Kubernetes & Infrastructure Engineer

    Senior Kubernetes & Infrastructure Engineer

    Third Wave Automation • Union City, California, United States
    Full-time
    Third Wave Automation is a rapidly growing startup that has demonstrated its core technology components, proven its market fit, and just closed its Series C funding. If you are excited about cutting...Show more
    Last updated: 16 days ago • Promoted
    Staff Infrastructure / DevOps Engineer

    Staff Infrastructure / DevOps Engineer

    Gatik Ai • Mountain View, California, United States
    Full-time
    Gatik, the leader in autonomous middle-mile logistics, is revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution and prioritizing safe, consistent del...Show more
    Last updated: 30+ days ago • Promoted
    Staff ML Infrastructure Engineer

    Staff ML Infrastructure Engineer

    Cubiq Recruitment • Hayward, CA, United States
    Full-time
    Staff / Lead ML Infrastructure Engineer.Salary - Over market average + equity.We are building one of the world’s leading generative video and multimodal AI platforms, and we’re looking for a senior...Show more
    Last updated: 7 days ago • Promoted
    Product Infrastructure Engineer - Site Reliability

    Product Infrastructure Engineer - Site Reliability

    Zyphra • Palo Alto, California, United States
    Full-time
    Infrastructure Engineer - Site Reliability.Your work will be essential to ensuring the reliability and reproducibility of ML workloads, the safety and control of deployments, and the long-term main...Show more
    Last updated: 30+ days ago • Promoted
    Staff Cloud Infrastructure Engineer

    Staff Cloud Infrastructure Engineer

    Zscaler • San Jose, California, USA
    Full-time
    Zscaler accelerates digital transformation so our customers can be more agile efficient resilient and secure.Our cloud native Zero Trust Exchange platform protects thousands of customers from cyber...Show more
    Last updated: 23 hours ago • Promoted
    Atlas Infrastructure Engineer

    Atlas Infrastructure Engineer

    Aurora Innovation • Mountain View, California, United States
    Full-time
    Aurora hires talented people with diverse backgrounds who are ready to help build a transportation ecosystem that will make our roads safer, get crucial goods where they need to go, and make mobili...Show more
    Last updated: 17 days ago • Promoted
    Staff Software Engineer, Integrations

    Staff Software Engineer, Integrations

    Ridgeline • San Ramon, California, United States
    Full-time
    Are you a seasoned software engineer with deep expertise in integration technologies? Do you enjoy crafting the building blocks that connect platforms? If you thrive in dynamic environments and lov...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - ML Performance

    Software Engineer - ML Performance

    Baseten • San Ramon, California, United States
    Full-time
    We’re a growing team of builders backed by top-tier investors, including.ML teams at enterprises and category-defining AI-native companies like. Baseten to power their core production workloads with...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Ads ML Infrastructure

    Software Engineer, Ads ML Infrastructure

    Tik Tok • San Jose, CA, United States
    Full-time
    About the team The ads system at TikTok operates on a massive scale and serves millions of advertisers, clients and influencers across the world. The quality of the ads system highly depends on the ...Show more
    Last updated: 30+ days ago • Promoted
    Staff ML Engineer : Build & Deploy Scalable ML Pipelines

    Staff ML Engineer : Build & Deploy Scalable ML Pipelines

    Intuit Inc. • Mountain View, CA, United States
    Full-time
    A leading company in financial software is seeking a Staff Machine Learning Engineer to join their innovative team.In this role, you'll collaborate with highly skilled AI scientists and engineers t...Show more
    Last updated: 3 days ago • Promoted
    Machine Learning Infrastructure Engineer

    Machine Learning Infrastructure Engineer

    Institute Of Foundation Models • Sunnyvale, California, United States
    Full-time
    About the Institute of Foundation Models.We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next...Show more
    Last updated: 30+ days ago • Promoted
    Staff ML Engineer - Infrastructure

    Staff ML Engineer - Infrastructure

    ChipStack • San Jose, California, United States
    Full-time
    Chips are at the center of today's tech-driven world.But how we design them has not changed in decades, while their complexity and specialization have skyrocketed due to increasing performance dema...Show more
    Last updated: 30+ days ago • Promoted
    ML Infrastructure Engineer — Scale Generative Models

    ML Infrastructure Engineer — Scale Generative Models

    Apple Inc. • Cupertino, CA, United States
    Full-time
    A leading technology company in Cupertino, California, is seeking a ML Infrastructure Engineer to design and optimize the systems that power large-scale model training. The ideal candidate will have...Show more
    Last updated: 2 days ago • Promoted