Talent.com
GPU Performance Engineer
GPU Performance EngineerGenmo Inc. • San Francisco, CA, United States
GPU Performance Engineer

GPU Performance Engineer

Genmo Inc. • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what’s possible in video generation.

We’re seeking a GPU Performance Engineer to squeeze every last FLOP from our H100 infrastructure and optimize our model serving stack to its absolute limits.

The Role

You’ll be our performance optimization expert, using advanced profiling tools to identify bottlenecks and implementing solutions that achieve 5-10x speedups. From writing custom CUDA kernels to eliminating cold start latency, you’ll ensure our infrastructure delivers world-class performance. This role is perfect for someone who gets excited about microsecond optimizations and pushing hardware to its theoretical limits.

Key Responsibilities

  • Profile and optimize GPU workloads using Nsight Systems, nvprof, and custom instrumentation
  • Write high-performance CUDA and Triton kernels for critical model operations
  • Optimize cold start latency from seconds to milliseconds for our serving infrastructure
  • Tune memory access patterns, kernel fusion, and GPU utilization
  • Collaborate with ML engineers to optimize model implementations
  • Debug performance issues across the full stack from application to hardware
  • Implement custom memory pooling and allocation strategies
  • Share optimization techniques and build performance culture across teams

Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related field
  • 5+ years systems programming experience with 3+ years focused on GPU optimization
  • Expert proficiency with GPU profiling tools (Nsight Systems, nvprof)
  • Strong CUDA programming skills with production kernel development
  • Deep understanding of GPU architecture (memory hierarchy, SMs, warps)
  • Track record of achieving significant performance improvements (5-10x)
  • Experience with Python and C++ in production environments
  • We Value

  • Experience with Triton kernel development
  • Knowledge of CUTLASS or similar high-performance libraries
  • Background in ML-specific optimizations (attention, transformers)
  • RDMA / InfiniBand optimization experience
  • Contributions to GPU libraries or frameworks
  • Low-level debugging skills (PTX / SASS reading)
  • Genmo is an Equal Opportunity Employer. Candidates are evaluated without regard to age, race, color, religion, sex, disability, national origin, sexual orientation, veteran status, or any other characteristic protected by federal or state law. Genmo, Inc. is an E-Verify company and you may review the Notice of E-Verify Participation and the Right to Work posters in English and Spanish.

    #J-18808-Ljbffr

    Create a job alert for this search

    GPU Performance Engineer • San Francisco, CA, United States

    Similar jobs
    GPU Performance Engineer

    GPU Performance Engineer

    Genmo • San Francisco, CA, US
    Full-time
    We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the bo...Show more
    Last updated: 15 days ago • Promoted
    Performance ML Engineer : CUDA, GPU Systems

    Performance ML Engineer : CUDA, GPU Systems

    Relace • San Francisco, CA, United States
    Full-time
    A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance tuning and optimization.The ideal c...Show more
    Last updated: 30+ days ago • Promoted
    Thermal Engineer

    Thermal Engineer

    Planet Labs PBC • San Francisco, CA, United States
    Full-time
    We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...Show more
    Last updated: 18 days ago • Promoted
    GPU Systems Engineer : High-Performance C++

    GPU Systems Engineer : High-Performance C++

    10X Recruiting Partners • San Francisco, CA, United States
    Full-time
    A technology consulting firm is seeking a highly skilled Software Engineer (C++ Systems) to join their client's team in San Francisco. This role focuses on optimizing GPU virtualization performance ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Plastics Design EngineerMechanical Design Engineering • Berkeley, CA • Full time • On-site

    Senior Plastics Design EngineerMechanical Design Engineering • Berkeley, CA • Full time • On-site

    Form Energy • Berkeley, CA, United States
    Full-time
    Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...Show more
    Last updated: 30+ days ago • Promoted
    Performance Engineer

    Performance Engineer

    VirtualVocations • San Francisco, California, United States
    Full-time
    A company is looking for a Shipping Optimization Services Performance Engineer.Key Responsibilities Lead strategic engagements to guide shippers through complex optimization projects Execute hig...Show more
    Last updated: 4 hours ago • Promoted • New!
    Product Development Engineer, Reagents

    Product Development Engineer, Reagents

    Bruker • Emeryville, CA, United States
    Full-time +1
    Product Development Engineer, Reagents.Bruker is enabling scientists to make breakthrough discoveries and develop new applications that improve the quality of human life. Bruker's high-performance s...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Performance Optimization

    Software Engineer, Performance Optimization

    Fireworks Ai • Redwood City, California, United States
    Full-time
    Here at Fireworks, we’re building the future of generative AI infrastructure.Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference.We’...Show more
    Last updated: 30+ days ago • Promoted
    Performance Engineer, GPU

    Performance Engineer, GPU

    Anthropic • San Francisco, CA, United States
    Full-time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show more
    Last updated: 30+ days ago • Promoted
    Staff Engineer, Battery Engineering

    Staff Engineer, Battery Engineering

    Sila • Alameda, CA, US
    Full-time
    We are Sila, a next-generation battery materials company.Our mission is to power the world's transition to clean energy.To create this future, our team is building a better lithium-ion battery ...Show more
    Last updated: 16 hours ago • Promoted • New!
    Senior Scene Technician - Cal Performances

    Senior Scene Technician - Cal Performances

    University of California-Berkeley • Berkeley, CA, United States
    Full-time +1
    At the University of California, Berkeley, we are dedicated to fostering a community where everyone feels welcome and can thrive. Our culture of openness, freedom and belonging make it a special pla...Show more
    Last updated: 30+ days ago • Promoted
    Staff Network Engineer (Menlo Park, CA or Durham, NC) #4507

    Staff Network Engineer (Menlo Park, CA or Durham, NC) #4507

    GRAIL • Menlo Park, CA, US
    Full-time
    Our mission is to detect cancer early, when it can be cured.We are working to change the trajectory of cancer mortality and bring stakeholders together to adopt innovative, safe, and effective tech...Show more
    Last updated: 2 days ago • Promoted
    HPC / AI Data Performance Engineer

    HPC / AI Data Performance Engineer

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    Full-time +1
    In this exciting role, you will serve as a Data Performance Engineer in NERSC's Application Performance Group, architecting HPC and AI data services that advance fundamental science.You'll optimize...Show more
    Last updated: 30+ days ago • Promoted
    VAVE Engineer III - Plastics

    VAVE Engineer III - Plastics

    Bio-Rad Laboratories • Hercules, CA, United States
    Full-time
    As a Plastics / Polymer Engineer in Value Engineering, you'll support Bio-Rad products across consumables and instrumentation in our Diagnostics and Lifescience by optimizing plastic components end-t...Show more
    Last updated: 30+ days ago • Promoted
    Advanced Electronics / Computer Field Technician (Electronics Technician & Fire Controlman) - Full Time

    Advanced Electronics / Computer Field Technician (Electronics Technician & Fire Controlman) - Full Time

    U.S. Navy • West Menlo Park, CA, US
    Full-time
    The Navys Advanced Electronics / Computer Field (AECF) offers extensive training in electronics, computer systems, radar, communications, and weapons fire control systems,.Navys advanced missile sy...Show more
    Last updated: 2 days ago • Promoted
    Software Engineer, GPU Infrastructure - HPC

    Software Engineer, GPU Infrastructure - HPC

    Openai • San Francisco, California, United States
    Full-time
    The Fleet team at OpenAI supports the computing environment that powers our cutting-edge research and product development. We oversee large-scale systems that span data centers, GPUs, networking, an...Show more
    Last updated: 30+ days ago • Promoted
    System Engineer, GPU Fleet

    System Engineer, GPU Fleet

    Fluidstack • San Francisco, CA, United States
    Full-time
    About Fluidstack : At Fluidstack, we’re building the infrastructure for abundant intelligence.We partner with top AI labs, governments, and enterprises to unlock compute at the speed of light.We’re ...Show more
    Last updated: 1 day ago • Promoted
    Flight Dynamics / GNC Engineer (5221C), Space Sciences Laboratory - 83693

    Flight Dynamics / GNC Engineer (5221C), Space Sciences Laboratory - 83693

    InsideHigherEd • Berkeley, California, United States
    Full-time
    Flight Dynamics / GNC Engineer (5221C), Space Sciences Laboratory - 83693.At the University of California, Berkeley, we are dedicated to fostering a community where everyone feels welcome and can thr...Show more
    Last updated: 7 days ago • Promoted