Talent.com
Machine Learning Infrastructure Engineer
Machine Learning Infrastructure EngineerCharacter.AI • Menlo Park, CA, United States
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Character.AI • Menlo Park, CA, United States
7 days ago
Job type
  • Full-time
Job description

About the role

We’re looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research.

Responsibilities

  • Provide infrastructure support to our ML research and product
  • Build tooling to diagnose cluster issues and hardware failures
  • Monitor deployments, manage experiments, and generally support our research
  • Maximize GPU allocation and utilization for both serving and training

Requirements :

  • 4+ years of experience supporting the infrastructure within an ML environment
  • Experience in developing tools used to diagnose ML infrastructure problems and failures
  • Experience with cloud platforms (e.g., Compute Engine, Kubernetes, Cloud Storage)
  • Experience working with GPUs
  • Nice to have

  • Experience with large GPU clusters and high-performance computing / networking
  • Experience with supporting large language model training
  • Experience with ML frameworks like Pytorch / TensorFlow / JAX
  • Experience with GPU kernel development
  • About Character.AI

    Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventure.

    In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.

    Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

    At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • Menlo Park, CA, United States

    Related jobs
    Machine Learning Engineer

    Machine Learning Engineer

    Harnham • Fremont, CA, US
    Full-time
    STAFF AI / ML ENGINEER – HEALTHTECH.HYBRID – SUNNYVALE, CA (MON–THU ONSITE, FRI WFH).Are you passionate about applying AI to improve real-world health outcomes and quality of life f...Show more
    Last updated: 2 days ago • Promoted
    Defense Machine Learning Engineer - Remote

    Defense Machine Learning Engineer - Remote

    iO Associates • Fremont, California, United States
    Remote
    Full-time
    Job Title : Uncleared Machine Learning Engineer - Remote.Our Client is at the forefront of Intelligent Exploration and Enterprise AI with their cutting-edge AI Platform. Operating in a dynamic indust...Show more
    Last updated: 19 days ago • Promoted
    Machine Learning Infrastructure and Data Engineer

    Machine Learning Infrastructure and Data Engineer

    Apple Inc. • Sunnyvale, CA, United States
    Full-time
    Machine Learning Infrastructure and Data Engineer.Sunnyvale, California, United States Machine Learning and AI.The Video Computer Vision organization is working on exciting technologies for future ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Infrastructure Engineer

    Machine Learning Infrastructure Engineer

    Abridge • San Francisco, CA, United States
    Full-time
    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare.Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation eff...Show more
    Last updated: 20 days ago • Promoted
    Machine Learning Infrastructure Engineer

    Machine Learning Infrastructure Engineer

    Greylock Partners • San Francisco, CA, United States
    Full-time
    Machine Learning Infrastructure Engineer — join early B2C investment to help build large-scale ML infrastructure for a cutting-edge AI-first mobile product. Founders have experience building iconic ...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Machine Learning Infrastructure

    Software Engineer, Machine Learning Infrastructure

    David AI • San Francisco, CA, United States
    Full-time
    David AI is the first audio data research company.We bring an R&D approach to data–developing datasets with the same rigor AI labs bring to models. Our mission is to bring AI into the real world, an...Show more
    Last updated: 3 days ago • Promoted
    ML Infrastructure Engineer

    ML Infrastructure Engineer

    Phizenix • Menlo Park, CA, US
    Full-time +1
    Menlo Park, CA | On-Site | Full-Time / Direct Hire.Client Opportunity | Through Phizenix.Phizenix, a certified minority and women-led recruiting firm, is hiring on behalf of an AI startup pioneering ...Show more
    Last updated: 30+ days ago • Promoted
    AI Infrastructure Engineer, Model Serving Platform

    AI Infrastructure Engineer, Model Serving Platform

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer - Training & Infrastructure

    Machine Learning Engineer - Training & Infrastructure

    P-1 AI • San Francisco, CA, United States
    Full-time
    We are building an engineering AGI.We founded P-1 AI with the conviction that the greatest impact of artificial intelligence will be on the built world—helping mankind conquer nature and bend it to...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Infrastructure Engineer

    Machine Learning Infrastructure Engineer

    Ambience Healthcare • San Francisco, CA, United States
    Full-time
    Ambience Healthcare is the leading AI platform for documentation, coding, and clinical workflow, built to reduce administrative burden and protect revenue integrity at the point of care.Trusted by ...Show more
    Last updated: 8 days ago • Promoted
    Machine Learning Engineer, Training Infrastructure

    Machine Learning Engineer, Training Infrastructure

    IntelliPro Group Inc. • San Francisco, CA, US
    Full-time
    Quick Apply
    Machine Learning Engineer, Training Infrastructure Position Type : Full time Location : San Francisco, CA, USA Salary Range : $150,000 - $250, 000 (USD) Job ID# : 158135 Job Description : We are l...Show more
    Last updated: 30+ days ago
    Machine Learning Engineer, Training Infrastructure

    Machine Learning Engineer, Training Infrastructure

    Hedra, Inc • San Francisco, CA, United States
    Full-time
    Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer, Platform Architecture

    Machine Learning Engineer, Platform Architecture

    Apple Inc. • Cupertino, CA, United States
    Full-time
    Machine Learning Engineer, Platform Architecture.Cupertino, California, United States Hardware.At Apple, our Platform Architecture group is responsible for connecting our hardware and software into...Show more
    Last updated: 27 days ago • Promoted
    Machine Learning Engineer II - Engagement Optimization

    Machine Learning Engineer II - Engagement Optimization

    Uber • San Francisco, CA, United States
    Full-time
    Uber has evolved from a simple ride-hailing service to a comprehensive platform that connects users with various on-demand services in their cities. The Uber One Membership team is dedicated to enha...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer, Training Infrastructure

    Machine Learning Engineer, Training Infrastructure

    Ipro Networks Pte. Ltd. • San Francisco, CA, United States
    Full-time
    Job Title : Machine Learning Engineer, Training Infrastructure | Position Type : Full time | Location : San Francisco, CA, USA | Salary Range : $150,000 - $250,000 (USD) | Job ID# : 158135.Design, imple...Show more
    Last updated: 30+ days ago • Promoted
    Staff Machine Learning Infrastructure Engineer

    Staff Machine Learning Infrastructure Engineer

    Finoit Inc • Redwood City, CA, United States
    Full-time
    Quick Apply
    Staff Machine Learning Engineer Location : Redwood City, CA Hybrid 2 days onsite in a weeek Salary $150...Show more
    Last updated: 5 days ago
    Autonomy Engineer - Deep Learning Infrastructure

    Autonomy Engineer - Deep Learning Infrastructure

    Skydio • San Mateo, CA, United States
    Full-time
    Skydio is the leading US drone company and the world leader in autonomous flight, the key technology for the future of drones and aerial mobility. The Skydio team combines deep expertise in artifici...Show more
    Last updated: 2 days ago • Promoted
    Machine Learning Engineer, Training Infrastructure

    Machine Learning Engineer, Training Infrastructure

    Hedra • San Francisco, CA, United States
    Full-time
    Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show more
    Last updated: 30+ days ago • Promoted