Talent.com
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Character.AISan Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Machine Learning Infrastructure Engineer

Join to apply for the Machine Learning Infrastructure Engineer role at Character.AI

Machine Learning Infrastructure Engineer

Join to apply for the Machine Learning Infrastructure Engineer role at Character.AI

Get AI-powered advice on this job and more exclusive features.

About The Role

We’re looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research.

About The Role

We’re looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research.

Responsibilities :

  • Provide infrastructure support to our ML research and product
  • Build tooling to diagnose cluster issues and hardware failures
  • Monitor deployments, manage experiments, and generally support our research
  • Maximize GPU allocation and utilization for both serving and training

Requirements :

  • 4+ years of experience supporting the infrastructure within an ML environment
  • Experience in developing tools used to diagnose ML infrastructure problems and failures
  • Experience with cloud platforms (e.g., Compute Engine, Kubernetes, Cloud Storage)
  • Experience working with GPUs
  • Nice to have

  • Experience with large GPU clusters and high-performance computing / networking
  • Experience with supporting large language model training
  • Experience with ML frameworks like Pytorch / TensorFlow / JAX
  • Experience with GPU kernel development
  • About Character.AI

    Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventure s.

    In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.

    Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

    At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

    Seniority level

    Seniority level

    Not Applicable

    Employment type

    Employment type

    Full-time

    Job function

    Job function

    Engineering and Information Technology

    Industries

    Software Development

    Referrals increase your chances of interviewing at Character.AI by 2x

    Sign in to set job alerts for “Machine Learning Engineer” roles.

    Fall 2025 Intern, Artificial Intelligence / Machine Learning

    Mountain View, CA $155,000.00-$207,000.00 2 weeks ago

    San Francisco, CA $130,000.00-$230,000.00 5 months ago

    Sunnyvale, CA $141,000.00-$202,000.00 1 day ago

    Software Engineer, AI Platform - New Grad

    San Francisco, CA $140,000.00-$215,000.00 4 weeks ago

    Machine Learning Engineer (I, II, or Sr.)

    San Jose, CA $119,000.00-$177,000.00 6 days ago

    Staff Software Engineer, AI / ML Recommendations, Rankings, Predictions, YouTube

    San Jose, CA $113,500.00-$250,000.00 1 week ago

    Staff Software Engineer, AI / ML Recommendations, Rankings, Predictions, YouTube

    Mountain View, CA $141,000.00-$202,000.00 1 week ago

    San Jose, CA $120,700.00-$228,600.00 1 week ago

    Data Scientist III, Research, Search AI Overview

    Mountain View, CA $141,000.00-$202,000.00 2 weeks ago

    San Jose, CA $120,700.00-$228,600.00 1 week ago

    Palo Alto, CA $150,000.00-$220,000.00 3 weeks ago

    Staff Software Engineer, AI / ML Recommendations, Rankings, Predictions, Google Ads

    New Grads 2025 - Software Engineer, Algorithm

    San Jose, CA $120,000.00-$165,000.00 9 months ago

    New Grads 2025 - Software Engineer - Computer Vision / Deep Learning

    San Jose, CA $120,000.00-$165,000.00 8 months ago

    Data Scientist, Energy Analytics, Google Cloud

    Sunnyvale, CA $166,000.00-$244,000.00 2 weeks ago

    San Jose, CA $113,500.00-$250,000.00 1 week ago

    San Francisco, CA $88,000.00-$140,000.00 4 weeks ago

    San Francisco, CA $130,000.00-$238,000.00 2 days ago

    Mountain View, CA $125,400.00-$188,100.00 2 weeks ago

    Staff Data Scientist, Research, Search Platforms

    Mountain View, CA $197,000.00-$291,000.00 1 week ago

    San Francisco, CA $163,200.00-$223,200.00 2 weeks ago

    Machine Learning Engineer, Search Ads, Shopping Relevance Models

    Mountain View, CA $141,000.00-$202,000.00 5 days ago

    Machine Learning Engineer (I, II, or Sr.)

    Data Scientist, Analytics - Safety Response

    San Francisco Bay Area $140,000.00-$157,500.00 19 hours ago

    We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    • New!
    Machine Learning Engineer, Prediction

    Machine Learning Engineer, Prediction

    WaymoMountain View, CA, United States
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 19 hours ago
    • Promoted
    Machine Learning Infrastructure Engineer

    Machine Learning Infrastructure Engineer

    Ambience Healthcare, Inc.San Francisco, CA, United States
    Full-time
    Ambience Healthcare is the leading AI platform for documentation, coding, and clinical workflow, built to reduce administrative burden and protect revenue integrity at the point of care.Trusted by ...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Infrastructure and Data Engineer

    Machine Learning Infrastructure and Data Engineer

    Apple Inc.Sunnyvale, CA, United States
    Full-time
    Machine Learning Infrastructure and Data Engineer.Sunnyvale, California, United States.Want to ship amazing experiences in Apple products? Be part of the team in the Video Computer Vision (VCV) org...Show moreLast updated: 9 days ago
    • Promoted
    Machine Learning Engineer, Mapping

    Machine Learning Engineer, Mapping

    WaymoMountain View, CA, United States
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Metric BioFremont, CA, US
    Full-time
    Metric Bio is recruiting on behalf of a San Francisco–based digital health company that is building an AI-powered platform to transform patient care and healthcare delivery.ML techniques to s...Show moreLast updated: 7 days ago
    • Promoted
    Machine Learning Engineer - Infrastructure

    Machine Learning Engineer - Infrastructure

    NextdoorSan Francisco, CA, US
    Full-time
    Nextdoor (NYSE : NXDR) is the essential neighborhood network.Neighbors, public agencies, and businesses use Nextdoor to connect around local information that matters in more than 340,000 neighborhoo...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Infrastructure Simulation Engineer, Optimus

    Machine Learning Infrastructure Simulation Engineer, Optimus

    Tesla Motors, Inc.Palo Alto, CA, United States
    Full-time
    The Optimus Simulation team is at the forefront of advancing humanoid robotics by building a high-fidelity virtual world where Optimus can safely learn, adapt, and improve.Our mission is to recreat...Show moreLast updated: 5 days ago
    • Promoted
    ML Infrastructure Engineer with GCP

    ML Infrastructure Engineer with GCP

    iSoftTek Solutions IncMountain View, CA, US
    Full-time
    Job Title : ML Infrastructure Engineer with GCP.Location : Mountain View, CA [Needs to be onsite for 1 week once in a quarter on your own expenses]. Note : Only PST and MST candidates are required.Expe...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, ML Infrastructure - Training Platform

    Software Engineer, ML Infrastructure - Training Platform

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale is looking for an AI / ML Infrastructure Engineer to join our Machine Learning Infrastructure team to build out our Training Platform. You will partner closely with Machine Learning researchers ...Show moreLast updated: 30+ days ago
    • Promoted
    Founding Machine Learning Infrastructure Engineer

    Founding Machine Learning Infrastructure Engineer

    NomadicML Inc.San Francisco, CA, United States
    Full-time
    Harvard, where they both did research in the intersection of computation and evaluations.Between them, they have authored multiple published papers in the machine learning domain and hold numerous ...Show moreLast updated: 30+ days ago
    • Promoted
    AI Infrastructure Engineer, Model Serving Platform

    AI Infrastructure Engineer, Model Serving Platform

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show moreLast updated: 30+ days ago
    Senior Machine Learning Infrastructure Engineer

    Senior Machine Learning Infrastructure Engineer

    PlusAI IncSanta Clara, California, United States, 95050
    Full-time
    Senior Machine Learning Infrastructure Engineer.Plus, also known as PlusAI, is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks.Headquartered in...Show moreLast updated: 30+ days ago
    • Promoted
    ML Infrastructure Engineer

    ML Infrastructure Engineer

    PhizenixMenlo Park, CA, US
    Full-time +1
    Menlo Park, CA | On-Site | Full-Time / Direct Hire.Looking for ML Infra experts (Bay Area preferred) with deep experience in CUDA, GPU optimization, VLLMs, and LLM inference—pure language focus...Show moreLast updated: 30+ days ago
    • Promoted
    AI Infrastructure Engineer, ML Data Platform

    AI Infrastructure Engineer, ML Data Platform

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale's AI Infrastructure team supports both R&D and applied Generative AI initiatives, driving breakthroughs in areas of post-training research such as AI safety, agents, and evaluating state-of-t...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Machine Learning Engineer, Planning

    Machine Learning Engineer, Planning

    WaymoMountain View, CA, United States
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 19 hours ago
    • Promoted
    Machine Learning Engineer, Training Infrastructure

    Machine Learning Engineer, Training Infrastructure

    Hedra, IncSan Francisco, CA, United States
    Full-time
    Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Infrastructure Engineer

    Senior Machine Learning Infrastructure Engineer

    AbridgeSan Francisco, CA, United States
    Full-time
    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare.Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation eff...Show moreLast updated: 2 days ago
    • Promoted
    Machine Learning Engineer, Training Infrastructure

    Machine Learning Engineer, Training Infrastructure

    Ipro Networks Pte. Ltd.San Francisco, CA, United States
    Full-time
    Job Title : Machine Learning Engineer, Training Infrastructure | Position Type : Full time | Location : San Francisco, CA, USA | Salary Range : $150,000 - $250,000 (USD) | Job ID# : 158135.Design, imple...Show moreLast updated: 16 days ago