Talent.com
No longer accepting applications
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

HedraSan Francisco, CA, US
1 day ago
Job type
  • Full-time
Job description

About Hedra

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.

At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content creation and build a generational company together. We value startup energy, initiative, and the ability to turn bold ideas into real products. Our team is fully in-person in SF / NY with a shared love for whiteboard problem-solving.

Overview

We are looking for an ML Engineer with 3+ YOE in high-performance computing systems to manage and optimize our computational infrastructure for training and deploying our machine learning models. The ideal candidate has diverse experience managing ML workloads at scale, supporting our 3DVAE and video diffusion models. We encourage you to apply even if you don't meet every requirement — we value curiosity, creativity, and the drive to solve hard problems.

Responsibilities

Design, implement, and maintain scalable computing solutions for training and deploying ML models, ensuring infrastructure can handle large video datasets.

Manage and optimize the performance of our computing clusters or cloud instances, such as AWS or Google Cloud, to support distributed training.

Ensure that our infrastructure can handle the resource-intensive tasks associated with training large generative models.

Monitor system performance and implement improvements to maximize efficiency and utilization , using tools like Airflow for orchestration.

Collaborate across research teams to understand their computational needs and provide appropriate solutions, facilitating seamless model deployment.

Qualifications

Bachelor’s degree in Computer Science, Information Technology, or a related field, with a focus on system administration.

Experience with cloud computing platforms such as Amazon Web Services, Google Cloud, or Microsoft Azure, essential for managing large-scale ML workloads.

Values engineering processes and version control (CI / CD).

Knowledge of containerization technologies like Docker and Kubernetes required for deployments at scale.

Understanding of distributed training techniques and how to scale models across multi-node clusters aligning with video generation needs.

Strong problem-solving and communication skills, given the need to collaborate with diverse teams.

This role is vital for ensuring the computational backbone supports the company’s ML efforts, focusing on deployment and scalability.

Benefits

Competitive compensation + equity

401k (no match)

Healthcare (Silver PPO Medical, Vision, Dental)

Lunch and snacks at the office

#J-18808-Ljbffr

Create a job alert for this search

Machine Learning Engineer • San Francisco, CA, US

Related jobs
  • Promoted
Machine Learning Engineer, Recommendation

Machine Learning Engineer, Recommendation

NewsBreakMountain View, CA, US
Full-time
Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy.With over 40 million monthly active users, our flagship platform delivers highly personalized loca...Show moreLast updated: 1 day ago
  • Promoted
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

Ambience Healthcare, Inc.San Francisco, CA, United States
Full-time
Ambience Healthcare is the leading AI platform for documentation, coding, and clinical workflow, built to reduce administrative burden and protect revenue integrity at the point of care.Trusted by ...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Infrastructure and Data Engineer

Machine Learning Infrastructure and Data Engineer

Apple Inc.Sunnyvale, CA, United States
Full-time
Machine Learning Infrastructure and Data Engineer.Sunnyvale, California, United States.Want to ship amazing experiences in Apple products? Be part of the team in the Video Computer Vision (VCV) org...Show moreLast updated: 1 day ago
  • Promoted
Machine Learning Engineer - Infra

Machine Learning Engineer - Infra

GenentechSan Francisco, CA, United States
Full-time
It’s what drives us to innovate.To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more ...Show moreLast updated: 21 days ago
  • Promoted
Machine Learning Engineer - Infrastructure

Machine Learning Engineer - Infrastructure

NextdoorSan Francisco, CA, US
Full-time
Nextdoor (NYSE : NXDR) is the essential neighborhood network.Neighbors, public agencies, and businesses use Nextdoor to connect around local information that matters in more than 340,000 neighborhoo...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer, ML Infrastructure - Training Platform

Software Engineer, ML Infrastructure - Training Platform

Scale AI, Inc.San Francisco, CA, United States
Full-time
Scale is looking for an AI / ML Infrastructure Engineer to join our Machine Learning Infrastructure team to build out our Training Platform. You will partner closely with Machine Learning researchers ...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Engineer — Infrastructure

Machine Learning Engineer — Infrastructure

Fundamental Research LabsMenlo Park, CA, United States
Full-time
Machine Learning Infrastructure Engineer.AI : from high-performance inference engines to the underlying agent technologies and large-scale compute clusters that keep everything running.You’ll collab...Show moreLast updated: 12 days ago
  • Promoted
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

ZyphraPalo Alto, CA, US
Full-time
Machine Learning Data Engineer - Systems & Retrieval.This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from ...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Engineer

Machine Learning Engineer

JobotSan Francisco, CA, US
Full-time
Entry Level ML Engineer Needed for Growing AI Startup!.This Jobot Job is hosted by : Reed Kellick.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.Salary : ...Show moreLast updated: 2 days ago
  • Promoted
Machine Learning Infrastructure Engineer - Diffuse

Machine Learning Infrastructure Engineer - Diffuse

Astera InstituteEmeryville, CA, US
Temporary
The Diffuse Project is dedicated to advancing our understanding of protein motion through the use of diffuse scattering – a signal in X-ray crystallography that is currently under-utilized or...Show moreLast updated: 2 days ago
  • Promoted
Machine Learning Engineer, Relevance

Machine Learning Engineer, Relevance

PatreonSan Francisco, CA, United States
Full-time
Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show moreLast updated: 23 hours ago
  • Promoted
Machine Learning Engineer

Machine Learning Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Machine Learning Engineer specializing in Personalization for a remote position.Key Responsibilities Build and deploy robust ML systems / algorithms for personalized prod...Show moreLast updated: 30+ days ago
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

IntelliPro Group Inc.San Francisco, CA, US
Full-time
Quick Apply
Machine Learning Engineer, Training Infrastructure Position Type : Full time Location : San Francisco, CA, USA Salary Range : $150,000 - $250, 000 (USD) Job ID# : 158135 Job Description : We are l...Show moreLast updated: 11 days ago
  • Promoted
Senior Machine Learning Engineer

Senior Machine Learning Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Senior Machine Learning Engineer to join their Data Science team.Key Responsibilities Design data architecture for real-time model endpoints and batch solutions Engine...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Hedra, IncSan Francisco, CA, United States
Full-time
Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show moreLast updated: 30+ days ago
  • Promoted
Generative Machine Learning Engineer

Generative Machine Learning Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Generative Machine Learning Engineer.Key Responsibilities Develop and implement Deep Learning Pipelines using cutting-edge models Sustain the infrastructure for Deep L...Show moreLast updated: 30+ days ago
  • Promoted
Senior Deep Learning Engineer

Senior Deep Learning Engineer

VirtualVocationsConcord, California, United States
Full-time
A company is looking for a Senior Deep Learning Software Engineer - Autonomous Vehicles.Key Responsibilities Train, fine-tune, optimize, and customize perception DNNs in low precision (FP16 / INT8)...Show moreLast updated: 30+ days ago
  • Promoted
Senior MLOps Engineer

Senior MLOps Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Senior MLOps Engineer to design and scale infrastructure for AI research and product development. Key Responsibilities Identify and resolve infrastructure and software b...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Ipro Networks Pte. Ltd.San Francisco, CA, United States
Full-time
Job Title : Machine Learning Engineer, Training Infrastructure | Position Type : Full time | Location : San Francisco, CA, USA | Salary Range : $150,000 - $250,000 (USD) | Job ID# : 158135.Design, imple...Show moreLast updated: 8 days ago
  • Promoted
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

HedraSan Francisco, CA, United States
Full-time
Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show moreLast updated: 30+ days ago