Talent.com
Tech Lead, AI Compute Infrastructure
Tech Lead, AI Compute InfrastructureHeyGen • Los Angeles, CA, United States
Tech Lead, AI Compute Infrastructure

Tech Lead, AI Compute Infrastructure

HeyGen • Los Angeles, CA, United States
30+ days ago
Job type
  • Full-time
Job description

About HeyGen

At HeyGen, our mission is to make visual storytelling accessible to all. Over the last decade, visual content has become the preferred method of information creation, consumption, and retention. But the ability to create such content, in particular videos, continues to be costly and challenging to scale. Our ambition is to build technology that equips more people with the power to reach, captivate, and inspire audiences.

Learn more at www.heygen.com. Visit our Mission and Culture doc here.

We are seeking a seasoned Technical Leader to build and scale the foundational compute infrastructure that powers our state‑of‑the‑art AI models—from multimodal training data pipelines to high‑throughput, low‑latency video generation.

Responsibilities

You will be the core engineer responsible for building the robust, efficient, and scalable platform that enables our research and production teams to rapidly iterate on HeyGen's generative video models. Your contributions will directly impact model performance, developer productivity, and the final quality of every AI‑generated video.

Optimize GPU Utilization : Design and implement mechanisms to aggressively optimize GPU and cluster utilization across thousands of devices for inference, training, data processing and large‑scale deployment of our state‑of‑art video generation models .

Develop Large‑Scale AI Job Framework : Build highly scalable, reliable frameworks for launching and managing massive, heterogeneous compute jobs, including multi‑modal high‑volume data ingestion / processing, distributed model training, and continuous evaluation / benchmarking.

Enhance Observability : Develop world‑class observability, tracing, and visualization tools for our compute cluster to ensure reliability, diagnose performance bottlenecks (e.g., memory, bandwidth, communication).

Accelerate Pipelines : Collaborate closely with AI researchers and AI engineers to integrate innovative acceleration techniques (e.g., custom CUDA kernels, distributed training libraries) into production‑ready, scalable training and inference pipelines.

Infrastructure Management : Champion the adoption and optimization of modern cloud and container technologies ( Kubernetes, Ray ) for elastic, cost‑efficient scaling of our distributed systems.

We are looking for a highly motivated engineer with deep experience operating and optimizing AI infrastructure at scale.

Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

5+ years of full‑time industry experience in large‑scale MLOps, AI infrastructure, or HPC systems .

Experience with data frameworks and standards like Ray, Apache Spark, LanceDB .

Strong proficiency in Python and a high‑performance language such as C++ for developing core infrastructure components.

Deep understanding and hands‑on experience with modern orchestration and distributed computing frameworks such as Kubernetes and Ray .

Experience with core ML frameworks such as PyTorch, TensorFlow, or JAX .

Preferred Qualifications

Master's or PhD in Computer Science or a related technical field.

Demonstrated Tech Lead experience, driving projects from conceptual design through to production deployment across cross‑functional teams.

Prior experience building infrastructure specifically for Generative AI models (e.g., diffusion models, GANs, or large language models) where cost and latency are critical.

Proven background in building and operating large‑scale data infrastructure (e.g., Ray, Apache Spark) to manage petabytes of multi‑modal data (video, audio, text).

  • Expertise in GPU acceleration and deep familiarity with low‑level compute programming, including CUDA, NCCL , or similar technologies for efficient inter‑GPU communication.

What HeyGen Offers

  • Competitive salary and benefits package.
  • Dynamic and inclusive work environment.
  • Opportunities for professional growth and advancement.
  • Collaborative culture that values innovation and creativity.
  • Access to the latest technologies and tools.
  • HeyGen is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

    #J-18808-Ljbffr

    Create a job alert for this search

    Tech Lead Ai • Los Angeles, CA, United States

    Related jobs
    Manager, Information Technology Enterprise Architecture

    Manager, Information Technology Enterprise Architecture

    L.A. Care Health Plan • Los Angeles, CA, United States
    Full-time
    Manager, Information Technology Enterprise Architecture.Job Category : Management / Executive.Department : IT Enterprise Data Management. Care Health Plan is an independent public agency created by the ...Show more
    Last updated: 19 days ago • Promoted
    Technical Lead

    Technical Lead

    Kaav Inc. • Burbank, CA, United States
    Full-time
    Role Technical Lead for Service Cloud Experience Cloud Customer Community CRM Analytics.Service Cloud Sales Cloud and Experience CloudCustomer functionality. Experienced with CongaApttus products do...Show more
    Last updated: 6 days ago • Promoted
    Head of Data Center Engineering

    Head of Data Center Engineering

    Confidential • Los Angeles, CA, United States
    Full-time
    Head of Data Center Engineering.Forward-thinking AI infrastructure company.Information Technology and Services.The Company is at the forefront of AI infrastructure, and is in need of a Head of Data...Show more
    Last updated: 17 days ago • Promoted
    Director of Data Center Engineering

    Director of Data Center Engineering

    Confidential • Los Angeles, CA, United States
    Full-time
    Director of Data Center Engineering.A leading AI infrastructure company dedicated to innovation.Information Technology and Services. Join us as the Director of Data Center Engineering, where you wil...Show more
    Last updated: 19 days ago • Promoted
    Lead Architect

    Lead Architect

    Computrition, Inc. • Los Angeles, CA, United States
    Full-time
    Lead Software Architect (Decision Maker) - CloudNative.Computrition - Jonas Software (https : / / www.Occasional travel requirements to Bedford MA office for occasional inperson whiteboard sessions.You...Show more
    Last updated: 23 days ago • Promoted
    Technical Lead

    Technical Lead

    Mondo Staffing • Burbank, CA, United States
    Full-time
    Apply now : Technical Lead, location is Hybrid (Burbank, CA).The start date is August 18, 2025 or two weeks from offer for this 1-year contract position with potential extension or conversion.Hybrid...Show more
    Last updated: 30+ days ago • Promoted
    Head of Data and AI Platforms

    Head of Data and AI Platforms

    Confidential • Los Angeles, CA, United States
    Full-time
    Premier technology organization.Information Technology and Services.The Company is in search of a Head of Product for Data and AI Platforms. This senior leadership role is pivotal in driving the str...Show more
    Last updated: 19 days ago • Promoted
    Client Technology Engineering Portfolio Lead

    Client Technology Engineering Portfolio Lead

    EY Studio+ Nederland • Los Angeles, California, USA
    Full-time
    At EY youll have the chance to build a career as unique as you are with the global scale support inclusive culture and technology to become the best version of you. And were counting on your unique ...Show more
    Last updated: 17 days ago • Promoted
    Senior Vice President of Network Architecture Strategy

    Senior Vice President of Network Architecture Strategy

    Confidential • Los Angeles, CA, United States
    Full-time
    Senior Vice President of Network Architecture Strategy.Innovative leader in AI infrastructure solutions.Information Technology and Services. Join us as the Senior Vice President of Network Architect...Show more
    Last updated: 19 days ago • Promoted
    Director of Data and AI Platforms

    Director of Data and AI Platforms

    Confidential • Los Angeles, CA, United States
    Full-time
    Director of Data and AI Platforms.A leading technology organization at the forefront of innovation.Information Technology and Services. We are seeking a dynamic Director of Product for Data and AI P...Show more
    Last updated: 30+ days ago • Promoted
    Information Technology Professional

    Information Technology Professional

    U.S. Navy • La Crescenta, CA, US
    Full-time +1
    To be eligible to enlist in the U.Navy, candidates must be between the ages of 18-34.At any given moment, hundreds of complex networked computer systems are operating in tandem to keep ships and su...Show more
    Last updated: 27 days ago • Promoted
    Lead AI AppSec Engineer

    Lead AI AppSec Engineer

    Capital Group • Los Angeles, CA, United States
    Full-time
    I can succeed as a Lead AI AppSec Engineer at Capital Group".As a LeadAIAppSecEngineer,you will work with application teams to ensure the security of custom andprocuredAI solutions.You'llcollaborat...Show more
    Last updated: 18 days ago • Promoted
    Senior Information Cloud Security Architect

    Senior Information Cloud Security Architect

    First American • Los Angeles, CA, United States
    Full-time
    Who We AreJoin a team that puts its People First! Since 1889, First American (NYSE : FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passi...Show more
    Last updated: 30+ days ago • Promoted
    AI / ML Engineering Lead

    AI / ML Engineering Lead

    RIT Solutions, Inc. • Glendale, CA, United States
    Full-time
    ALL ROLES ARE HYBRID IN THE OFFICE (2-3 DAYS / WEEK) UNLESS REMOTE IS NOTED.Please submit qualified candidates and include FULL LEGAL NAME as it appears on the passport. Iselin, NJ, Charlotte, NC or C...Show more
    Last updated: 30+ days ago • Promoted
    Generative AI Solution Architect

    Generative AI Solution Architect

    aKube, Inc. • Glendale, CA, United States
    Full-time
    Up to$107 / hr on W2 depending on experience (no C2C or 1099 or sub-contract).GC, USC, All valid EADs except OPT, CPT, H1B. AI / ML libraries and frameworks.Experience designing and delivering AI soluti...Show more
    Last updated: 4 days ago • Promoted
    Lead AI Platform Engineer

    Lead AI Platform Engineer

    Elevance Health • Los Angeles, CA, United States
    Full-time
    This role requires associates to be in-office 1 - 2 days per week, fostering collaboration and connectivity, while providing flexibility to support productivity and work-life balance.This approach ...Show more
    Last updated: 6 days ago • Promoted
    Technical Lead

    Technical Lead

    United IT Solutions • Burbank, CA, United States
    Full-time
    Develop PX Application which integrates with a lot of Web Services and is Consumer facing Application.Manage a team with 5 members from onshore and offshore. Provide Oversight to the current PX Team...Show more
    Last updated: 6 days ago • Promoted
    Infor Solution Architect

    Infor Solution Architect

    Phizenix • Los Angeles, CA, United States
    Full-time
    Job Title : Infor Syteline Developer.Location : Los Angeles, CA - Onsite Role.We are looking for a collaborative and solutions-focused. Infor CloudSuite Industrial (Syteline) systems.This role support...Show more
    Last updated: 30+ days ago • Promoted