Talent.com
Staff Software Engineer - Machine Learning Platform (San Francisco)

Staff Software Engineer - Machine Learning Platform (San Francisco)

Replicate, Inc.San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Staff Software Engineer - Machine Learning Platform (San Francisco)

Replicate makes it easy for software engineers to run and customize machine learning models in the cloud. With a library of thousands of open-source models, you can get started with one line of code—or fine-tune and deploy your own models when you need something custom. We handle the infrastructure, so you can focus on building. Our team comes from places like Docker, GitHub, and NVIDIA, and we’re obsessed with making AI as intuitive as deploying a web app. We build in public, ship fast, and care about getting the details right.

The Platform team at Replicate oversees the entire lifecycle of models, from packaging and deployment to serving, scaling, and monitoring. You’ll be developing the infrastructure that supports thousands of models and powers millions of predictions daily. This is a chance to build something truly innovative, where each decision you make has a tangible impact and allows your creativity to shine.

What you’ll be doing :

  • Designing and building our deployment and model-serving platform.
  • Building technology to operate the latest advancements in the ML and AI space.
  • Designing systems to maximize the utilization and reliability of our Kubernetes clusters and GPUs, including multi-regional traffic shifting and failover capabilities.
  • Owning and optimizing fair and reliable task allocation and queuing across a diverse set of customers with heterogeneous workloads.
  • Working with our Models team to speed up model inference through techniques like caching, weights management, machine configurations, and runtime optimizations in Python and PyTorch.

Technologies you’ll be working with :

  • Python, Go, and Node.js
  • Kubernetes and Terraform
  • Redis, Google BigQuery, and PostgreSQL
  • We're looking for the right person, not just someone who checks boxes, but it’s likely you have…

  • Experience building platforms at scale.
  • Worked in complex systems with many moving parts; you have opinions on monoliths vs. services.
  • Designed and implemented developer-friendly APIs to enable scalable and reliable integration.
  • Hands-on experience setting up and operating Kubernetes.
  • A passion for building tools that empower developers.
  • Strong communication and collaboration skills, with the ability to understand customer needs and distill complex topics into clear, actionable insights. We believe that most of programming isn’t just about writing code; building a platform requires a collaborative approach.
  • At least 10 years of full-time software engineering experience.
  • These aren’t hard requirements, but we definitely want to talk with you if…

  • You have worked on machine learning platform teams in the past.
  • You have experience working with or on teams that have put ML / AI into production, even though this role does not entail building ML models directly.
  • You have some exposure to serving Generative AI features where GPUs are costly commodities and workloads can take significant time to finish.
  • You'll be working from our beautiful office in the Mission, San Francisco, at least 3 days a week.

    #J-18808-Ljbffr

    Create a job alert for this search

    Staff Machine Learning Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    Staff Software Engineer, Machine Learning

    Staff Software Engineer, Machine Learning

    HeadspaceSan Francisco, CA, United States
    Full-time
    Staff Software Engineer, Machine Learning.About the Staff Software Engineer, Machine Learning at Headspace : Machine Learning at Headspace is a dynamic and innovative group whose mission is to impro...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Software Engineer

    Staff Machine Learning Software Engineer

    IntuitiveSan Francisco, California, United States
    Full-time
    At Intuitive, we are united behind our mission : we believe that minimally invasive care is life-enhancing care.Through ingenuity and intelligent technology, we expand the potential of physicians to...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer - ML Platform (Staff / Sr Staff)

    Software Engineer - ML Platform (Staff / Sr Staff)

    Equilibrium EnergySan Francisco, CA, United States
    Full-time
    Software Engineer - ML Platform (Staff / Sr Staff).Equilibrium Energy is revolutionizing the clean energy transition by developing innovative grid-scale energy storage solutions.Our technology and ...Show moreLast updated: 14 days ago
    • Promoted
    Senior Staff Software Engineer, Machine Learning San Francisco (USA) Remote (USA) Discord USD 3[...]

    Senior Staff Software Engineer, Machine Learning San Francisco (USA) Remote (USA) Discord USD 3[...]

    GamecompaniesSan Francisco, CA, United States
    Remote
    Full-time
    Senior Staff Software Engineer, Machine Learning - removed.Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our ...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Software Engineer - Machine Learning

    Staff Software Engineer - Machine Learning

    Bee MapsSan Francisco, CA, United States
    Full-time
    Hivemapper is a decentralized global mapping data network built by tens of thousands of people.At any given time, many 1000s of dashcams are collecting high-quality sensor data somewhere in the wor...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    HeadwaySan Francisco, California, United States
    Full-time
    Headway’s mission is a big one – to build a new mental health care system everyone can access.We’ve built technology that helps people find great therapists with the first software-enabled national...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    SoFiSan Francisco, CA, United States
    Full-time
    Employee Applicant Privacy Notice.Shape a brighter financial future with us.Together with our members, we're changing the way people think about and interact with personal finance.We're a next-gene...Show moreLast updated: 1 day ago
    • Promoted
    Senior / Staff Software Engineer, Machine Learning Infrastructure

    Senior / Staff Software Engineer, Machine Learning Infrastructure

    NuroMountain View, California, United States
    Full-time
    Nuro is a self-driving technology company on a mission to make autonomy accessible to all.Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI with automoti...Show moreLast updated: 3 days ago
    • Promoted
    Staff Software Engineer, Machine Learning Trust & Safety

    Staff Software Engineer, Machine Learning Trust & Safety

    Match GroupPalo Alto, California, United States
    Full-time
    Launched in 2012, Tinder® revolutionized how people meet, growing from 1 match to one billion matches in just two years.This rapid growth demonstrates its ability to fulfill a fundamental human nee...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer (Modeling), Support San Francisco, CA, US

    Staff Machine Learning Engineer (Modeling), Support San Francisco, CA, US

    Block, Inc.San Francisco, CA, United States
    Full-time
    Block is one company built from many blocks, all united by the same purpose of economic empowerment.The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Sec...Show moreLast updated: 9 days ago
    • Promoted
    Sr / Staff Software Engineer – Information Retrieval (IR), Machine Learning (ML), Natural Languag[...]

    Sr / Staff Software Engineer – Information Retrieval (IR), Machine Learning (ML), Natural Languag[...]

    NLP PEOPLESan Francisco, CA, United States
    Full-time
    Are you passionate about text mining, information retrieval, and machine learning? Do you enjoy designing algorithms, building models, and moving metrics by applying these techniques to solve real ...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Software Engineer, Machine Learning

    Staff Software Engineer, Machine Learning

    DiscordSan Francisco, CA, United States
    Full-time
    Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform : . Over 90% of our users play games, spending a comb...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Software Engineer - Machine Learning

    Staff Software Engineer - Machine Learning

    HivemapperSan Francisco, CA, United States
    Full-time
    Hivemapper is a decentralized global mapping data network built by 10s of thousands of people.At any given time, many 1000s of dashcams are collecting high-quality sensor data somewhere in the worl...Show moreLast updated: 30+ days ago
    • Promoted
    Senior / Staff Software Engineer - Machine Learning

    Senior / Staff Software Engineer - Machine Learning

    ZooxFoster City, California, United States
    Full-time
    At Zoox, you will collaborate with a team of world-class engineers with diverse backgrounds in areas such as AI, robotics, mechatronics, planning, control, localization, computer vision, rendering,...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Software Engineer, Machine Learning

    Staff Software Engineer, Machine Learning

    Recruiting From ScratchSan Francisco, CA, United States
    Full-time
    Staff Software Engineer, Machine Learning.Location : Burlingame, CA (On-site, 4 days / week).Company Stage of Funding : Venture-Backed (Growth Stage). Office Type : On‑site (4 days per week).Salary : $145...Show moreLast updated: 6 days ago
    • Promoted
    Staff Software Engineer, Machine Learning

    Staff Software Engineer, Machine Learning

    Match GroupPalo Alto, California, United States
    Full-time
    Launched in 2012, Tinder® revolutionized how people meet, growing from 1 match to one billion matches in just two years.This rapid growth demonstrates its ability to fulfill a fundamental human nee...Show moreLast updated: 30+ days ago
    • Promoted
    Senior / Staff Software Engineer - Learned Trajectory Machine Learning Engineer

    Senior / Staff Software Engineer - Learned Trajectory Machine Learning Engineer

    ZooxFoster City, California, United States
    Full-time
    The Prediction & Behavior ML team is responsible for developing machine learning (ML) algorithms that learn and predict behaviors from data, applying them both on-vehicle to influence driving behav...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Software Engineer, Machine Learning Platform

    Staff Software Engineer, Machine Learning Platform

    DiscordSan Francisco, CA, United States
    Full-time
    Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform : . Over 90% of our users play games, spending a comb...Show moreLast updated: 6 days ago