Talent.com
Machine Learning Engineer, Training Infrastructure
Machine Learning Engineer, Training InfrastructureHedra, Inc • San Francisco, CA, United States
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Hedra, Inc • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

About Hedra

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.

At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content creation and build a generational company together. We value startup energy, initiative, and the ability to turn bold ideas into real products. Our team is fully in-person in SF / NY with a shared love for whiteboard problem-solving.

Overview

We are looking for an ML Engineer with 3+ YOE in high-performance computing systems to manage and optimize our computational infrastructure for training and deploying our machine learning models. The ideal candidate has diverse experience managing ML workloads at scale, supporting our 3DVAE and video diffusion models. We encourage you to apply even if you don't meet every requirement — we value curiosity, creativity, and the drive to solve hard problems.

Responsibilities

Design, implement, and maintain scalable computing solutions for training and deploying ML models, ensuring infrastructure can handle large video datasets.

Manage and optimize the performance of our computing clusters or cloud instances, such as AWS or Google Cloud, to support distributed training.

Ensure that our infrastructure can handle the resource-intensive tasks associated with training large generative models.

Monitor system performance and implement improvements to maximize efficiency and utilization , using tools like Airflow for orchestration.

Collaborate across research teams to understand their computational needs and provide appropriate solutions, facilitating seamless model deployment.

Qualifications

Bachelor’s degree in Computer Science, Information Technology, or a related field, with a focus on system administration.

Experience with cloud computing platforms such as Amazon Web Services, Google Cloud, or Microsoft Azure, essential for managing large-scale ML workloads.

Values engineering processes and version control (CI / CD).

Knowledge of containerization technologies like Docker and Kubernetes required for deployments at scale.

Understanding of distributed training techniques and how to scale models across multi-node clusters aligning with video generation needs.

Strong problem-solving and communication skills, given the need to collaborate with diverse teams.

This role is vital for ensuring the computational backbone supports the company’s ML efforts, focusing on deployment and scalability.

Benefits

Competitive compensation + equity

401k (no match)

Healthcare (Silver PPO Medical, Vision, Dental)

Lunch and snacks at the office

#J-18808-Ljbffr

Create a job alert for this search

Machine Learning Engineer • San Francisco, CA, United States

Related jobs
Machine Learning Engineer

Machine Learning Engineer

VirtualVocations • Concord, California, United States
Full-time
A company is looking for a Machine Learning Engineer.Key Responsibilities Design, develop, and deploy intelligent systems and multi-agent architectures for autonomous reasoning and planning Buil...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, GenAI Applied ML

Machine Learning Engineer, GenAI Applied ML

Scale AI, Inc. • San Francisco, CA, United States
Full-time
At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...Show more
Last updated: 30+ days ago • Promoted
Staff Machine Learning Engineer

Staff Machine Learning Engineer

VirtualVocations • Concord, California, United States
Full-time
A company is looking for a Staff Machine Learning Engineer - Wildfire.Key Responsibilities Architect and build advanced ML models to predict vegetation and fuel conditions Design and maintain da...Show more
Last updated: 30+ days ago • Promoted
MLOps Engineer

MLOps Engineer

VirtualVocations • Concord, California, United States
Full-time
A company is looking for an MLOps / ML Platform Engineer.Key Responsibilities Design and operate ML infrastructure for high-throughput model workflows Build scalable pipelines for training and e...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Hedra, Inc • San Francisco, CA, United States
Full-time
Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show more
Last updated: 30+ days ago • Promoted
Applied Machine Learning Engineer

Applied Machine Learning Engineer

VirtualVocations • Fremont, California, United States
Full-time
A company is looking for an Applied Machine Learning Engineer to design, build, and deploy ML-driven solutions for cybersecurity. Key Responsibilities Develop, train, and deploy machine learning m...Show more
Last updated: 30+ days ago • Promoted
Senior Machine Learning Manager

Senior Machine Learning Manager

VirtualVocations • Fremont, California, United States
Full-time
A company is looking for a Senior Machine Learning Manager, Gen AI & Knowledge AI.Key Responsibilities Lead the vision, design, and execution of LLM-powered AI products, including system architec...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer (NLP)

Machine Learning Engineer (NLP)

VirtualVocations • Concord, California, United States
Full-time
A company is looking for a Staff Software Engineer specializing in Natural Language Processing (NLP).Key Responsibilities Lead the end-to-end development of machine learning pipelines for entity ...Show more
Last updated: 4 days ago • Promoted
Machine Learning Engineer - Training & Infrastructure

Machine Learning Engineer - Training & Infrastructure

P-1 AI • San Francisco, CA, United States
Full-time
We are building an engineering AGI.We founded P-1 AI with the conviction that the greatest impact of artificial intelligence will be on the built world—helping mankind conquer nature and bend it to...Show more
Last updated: 30+ days ago • Promoted
Missouri Licensed Release Train Engineer

Missouri Licensed Release Train Engineer

VirtualVocations • Concord, California, United States
Full-time
A company is looking for a SAFe Release Train Engineer.Key Responsibilities : Coordinate plans and activities with senior leaders across the organization Drive enterprise-wide Agile adoption and ...Show more
Last updated: 5 days ago • Promoted
Engineering Manager, Machine Learning

Engineering Manager, Machine Learning

VirtualVocations • Fremont, California, United States
Full-time
A company is looking for an Engineering Manager, Machine Learning.Key Responsibilities Lead and mentor a team of data scientists in developing and monitoring ML models at scale Oversee the desig...Show more
Last updated: 2 days ago • Promoted
Remote Machine Learning Engineer

Remote Machine Learning Engineer

VirtualVocations • Fremont, California, United States
Remote
Full-time
A company is looking for a Remote Machine Learning Engineer.Key Responsibilities Design, develop, and architect high-performance AI / ML services and pipelines in a cloud-based environment Contrib...Show more
Last updated: 5 days ago • Promoted
Machine Learning Data Engineer

Machine Learning Data Engineer

VirtualVocations • Concord, California, United States
Full-time
A company is looking for a Machine Learning Data Engineer, Research - 3D Data Focus.Key Responsibilities Ingest, clean, normalize, and preprocess data for machine learning model training pipeline...Show more
Last updated: 2 days ago • Promoted
Lead Machine Learning Engineer

Lead Machine Learning Engineer

VirtualVocations • Oakland, California, United States
Full-time
A company is looking for a Lead Machine Learning Engineer.Key Responsibilities Develop and manage ML infrastructure for data engineering, LLM training, and deployment Architect cloud-native solu...Show more
Last updated: 30+ days ago • Promoted
Senior Machine Learning Engineer

Senior Machine Learning Engineer

VirtualVocations • Concord, California, United States
Full-time
A company is looking for a Senior Machine Learning Engineer, Security.Key Responsibilities Design, build, and deploy machine learning models to detect and mitigate security threats Develop algor...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Ops Engineer

Machine Learning Ops Engineer

VirtualVocations • Fremont, California, United States
Full-time
A company is looking for a Machine Learning Ops Engineer.Key Responsibilities Design and implement AI cost tracking and observability frameworks across Azure, Google Cloud, and Snowflake Collabo...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineering Manager

Machine Learning Engineering Manager

VirtualVocations • Fremont, California, United States
Full-time
A company is looking for a Manager, ML Engineering to lead efforts in Accelerated Machine Learning tools and Analytics Pipelines on GPUs. Key Responsibilities Lead, mentor, and grow the engineerin...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Hedra • San Francisco, CA, United States
Full-time
Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show more
Last updated: 30+ days ago • Promoted