Talent.com
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Character.aiRedwood City, California, United States
9 hours ago
Job type
  • Full-time
Job description

About the role

We’re looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research.

Responsibilities

Provide infrastructure support to our ML research and product

Build tooling to diagnose cluster issues and hardware failures

Monitor deployments, manage experiments, and generally support our research

Maximize GPU allocation and utilization for both serving and training

Requirements

4+ years of experience supporting the infrastructure within an ML environment

Experience in developing tools used to diagnose ML infrastructure problems and failures

Experience with cloud platforms (e.g., Compute Engine, Kubernetes, Cloud Storage)

Experience working with GPUs

Nice to have

Experience with large GPU clusters and high-performance computing / networking

Experience with supporting large language model training

Experience with ML frameworks like Pytorch / TensorFlow / JAX

Experience with GPU kernel development

About Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventures.

In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.

Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

#J-18808-Ljbffr

Create a job alert for this search

Machine Learning Engineer • Redwood City, California, United States

Related jobs
  • Promoted
  • New!
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Intellipro GroupSan Francisco, CA, United States
Full-time
Machine Learning Engineer, Training Infrastructure.We are looking for an ML Engineer with 3+ YOE in high-performance computing systems to manage and optimize our computational infrastructure for tr...Show moreLast updated: 18 hours ago
  • Promoted
  • New!
Staff Machine Learning Infrastructure Engineer

Staff Machine Learning Infrastructure Engineer

DYNA Robotics IncRedwood City, CA, United States
Full-time
Dyna Robotics makes general-purpose robots powered by a proprietary embodied AI foundation model that generalizes and self-improves across varied environments with commercial-grade performance.Dyna...Show moreLast updated: 18 hours ago
  • Promoted
ML Infrastructure Engineer, Safeguards

ML Infrastructure Engineer, Safeguards

AnthropicSan Francisco, CA, United States
Full-time
Anthropics mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group o...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Machine Learning Engineer - Training & Infrastructure

Machine Learning Engineer - Training & Infrastructure

P-1 AISan Francisco, CA, United States
Full-time
We are building an engineering AGI.We founded P-1 AI with the conviction that the greatest impact of artificial intelligence will be on the built worldhelping mankind conquer nature and bend it to ...Show moreLast updated: 18 hours ago
  • Promoted
AI Infrastructure Engineer, Model Serving Platform

AI Infrastructure Engineer, Model Serving Platform

Scale AI, Inc.San Francisco, CA, United States
Full-time
As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Staff Machine Learning Engineer, Infrastructure

Staff Machine Learning Engineer, Infrastructure

WaymoSan Francisco, CA, United States
Full-time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 18 hours ago
  • Promoted
Software Engineer, Machine Learning Infrastructure

Software Engineer, Machine Learning Infrastructure

David AISan Francisco, CA, United States
Full-time
David AI is the first audio data research company.We bring an R&D approach to data-developing datasets with the same rigor AI labs bring to models. Our mission is to bring AI into the real world, an...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Abridge Al, IncSan Francisco, CA, United States
Full-time
Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare.Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation eff...Show moreLast updated: 18 hours ago
  • Promoted
  • New!
Staff Machine Learning Engineer, ML Infrastructure (Predictive Planner)

Staff Machine Learning Engineer, ML Infrastructure (Predictive Planner)

WaymoSan Francisco, CA, United States
Full-time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self?Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 18 hours ago
  • Promoted
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

HEDRA INCSan Francisco, CA, United States
Full-time
Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Engineer - Infra

Machine Learning Engineer - Infra

GenentechSouth San Francisco, CA, United States
Full-time
It's what drives us to innovate.To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more ...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior Software Engineer - Infrastructure, Machine Learning (Technical Lead)

Senior Software Engineer - Infrastructure, Machine Learning (Technical Lead)

BatonSan Francisco, CA, United States
Full-time
With $10B in freight under management, our technology reaches every part of the U.We design and ship category-defining software that enables Ryder and its 50,000+ customers-including some of the wo...Show moreLast updated: 18 hours ago
  • Promoted
  • New!
Senior Machine Learning Infrastructure Engineer, Ads

Senior Machine Learning Infrastructure Engineer, Ads

RobloxSan Mateo, CA, United States
Full-time
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers...Show moreLast updated: 18 hours ago
  • Promoted
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

CharacterRedwood City, CA, United States
Full-time
We're looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research. Provide infrastructure support to our ...Show moreLast updated: 30+ days ago
  • Promoted
Senior Machine Learning Systems Infrastructure Engineer - SIML, ISE

Senior Machine Learning Systems Infrastructure Engineer - SIML, ISE

AppleCupertino, CA, United States
Full-time
Do you think Computer Vision and Machine Learning can change the world? Do you think it can transform the way millions of people collect, discover and share the most special moments of their lives?...Show moreLast updated: 4 days ago
  • Promoted
  • New!
Principle Machine Learning Infrastructure Engineer, Ads

Principle Machine Learning Infrastructure Engineer, Ads

RobloxSan Mateo, CA, United States
Full-time
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers...Show moreLast updated: 18 hours ago
  • Promoted
ML Infrastructure Engineer

ML Infrastructure Engineer

PhizenixMenlo Park, CA, United States
Full-time +1
Menlo Park, CA | On-Site | Full-Time / Direct Hire.Looking for ML Infra experts (Bay Area preferred) with deep experience in CUDA, GPU optimization, VLLMs, and LLM inference-pure language focus, no v...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior / Staff Machine Learning Infrastructure Engineer

Senior / Staff Machine Learning Infrastructure Engineer

Calico LLCSouth San Francisco, CA, United States
Full-time
Senior / Staff Machine Learning Infrastructure Engineer.Senior / Staff Machine Learning Infrastructure Engineer.Senior / Staff Machine Learning Infrastructure Engineer. Senior / Staff Machine Learni...Show moreLast updated: 18 hours ago