Talent.com
Machine Learning Performance Engineer - CUDA Python - REMOTE with Travel
Machine Learning Performance Engineer - CUDA Python - REMOTE with TravelSimple Solutions • Jacksonville, FL, us
Machine Learning Performance Engineer - CUDA Python - REMOTE with Travel

Machine Learning Performance Engineer - CUDA Python - REMOTE with Travel

Simple Solutions • Jacksonville, FL, us
23 days ago
Job type
  • Temporary
  • Remote
  • Quick Apply
Job description

Job Description

Job Description :

Machine Learning Performance Engineer - CUDA Python

Duration : 6 month contract with the likelihood to extend

Location : Remote but candidates must be willing to travel to different customer sites.

  • Must be willing to travel
  • Must have strong pre-sales abilities i.e. presentation skills, communication skills, etc.
  • Must be willing to help train WWT employees and customers

We are needing very strong technical CUDA Python ML engineers. They can sit anywhere in the US but must be willing to travel 30% of the time. They will be very client facing so professionalism and presentation skills are very key to this role.

Your part here is optimizing the performance of our models – both training and inference. We care about efficient large-scale training, low-latency inference in real-time systems, and high-throughput inference in research. Part of this is improving straightforward CUDA, but the interesting part needs a whole-systems approach, including storage systems, networking, and host- and GPU-level considerations. Zooming in, we also want to ensure our platform makes sense even at the lowest level – is all that throughput actually goodput? Does loading that vector from the L2 cache really take that long?

  • An understanding of modern ML techniques and toolsets
  • The experience and systems knowledge required to debug a training run’s performance end to end
  • Low-level GPU knowledge of PTX, SASS, warps, cooperative groups, Tensor Cores, and the memory hierarchy
  • Debugging and optimization experience using tools like CUDA GDB, NSight Systems, NSight Compute
  • Library knowledge of Triton, CUTLASS, CUB, Thrust, cuDNN, and cuBLAS
  • Intuition about the latency and throughput characteristics of CUDA graph launch, tensor core arithmetic, warp-level synchronization, and asynchronous memory loads
  • Background in Infiniband, RoCE, GPUDirect, PXN, rail optimization, and NVLink, and how to use these networking technologies to link up GPU clusters
  • An understanding of the collective algorithms supporting distributed GPU training in NCCL or MPI
  • An inventive approach and the willingness to ask hard questions about whether we're taking the right approaches and using the right tools
  • Requirements

    Your part here is optimizing the performance of our models – both training and inference. We care about efficient large-scale training, low-latency inference in real-time systems, and high-throughput inference in research. Part of this is improving straightforward CUDA, but the interesting part needs a whole-systems approach, including storage systems, networking, and host- and GPU-level considerations. Zooming in, we also want to ensure our platform makes sense even at the lowest level – is all that throughput actually goodput? Does loading that vector from the L2 cache really take that long? An understanding of modern ML techniques and toolsets The experience and systems knowledge required to debug a training run’s performance end to end Low-level GPU knowledge of PTX, SASS, warps, cooperative groups, Tensor Cores, and the memory hierarchy Debugging and optimization experience using tools like CUDA GDB, NSight Systems, NSight Compute Library knowledge of Triton, CUTLASS, CUB, Thrust, cuDNN, and cuBLAS Intuition about the latency and throughput characteristics of CUDA graph launch, tensor core arithmetic, warp-level synchronization, and asynchronous memory loads Background in Infiniband, RoCE, GPUDirect, PXN, rail optimization, and NVLink, and how to use these networking technologies to link up GPU clusters An understanding of the collective algorithms supporting distributed GPU training in NCCL or MPI An inventive approach and the willingness to ask hard questions about whether we're taking the right approaches and using the right tools

    Create a job alert for this search

    Machine Learning Engineer • Jacksonville, FL, us

    Related jobs
    Physician (MD / DO) - Anesthesiology - General / Other in Fernandina Beach, FL

    Physician (MD / DO) - Anesthesiology - General / Other in Fernandina Beach, FL

    LocumJobsOnline • Fernandina Beach, FL, US
    Full-time +1
    LocumJobsOnline is working with CompHealth to find a qualified Anesthesiology MD in Fernandina Beach, Florida, 32034!.Whether you are searching for a position in your area or in another state, we h...Show more
    Last updated: 30+ days ago • Promoted
    Travel CT Technologist - Trauma & Stroke Imaging

    Travel CT Technologist - Trauma & Stroke Imaging

    GetMed Staffing, Inc. • Orange Park, FL, US
    Full-time
    CT Technologist for a travel job in Orange Park, Florida.Job Description & Requirements.GetMed Staffing is searching for a strong CT Tech to assist our traveler-friendly client.A minimum of 1-2...Show more
    Last updated: 8 days ago • Promoted
    Travel CT Technologist

    Travel CT Technologist

    IDR Healthcare • Saint Marys, GA, US
    Full-time
    IDR Healthcare is seeking a travel CT Technologist for a travel job in St.Job Description & Requirements.IDR Healthcare is an awarding winning staffing firm that believes it is a privilege and ...Show more
    Last updated: 15 days ago • Promoted
    Travel CT Tech - $1755 / Week

    Travel CT Tech - $1755 / Week

    Ventura MedStaff • Orange Park, FL, US
    Full-time
    Ventura MedStaff is seeking an experienced CT Tech for an exciting Travel Allied job in Orange Park, FL.Shift : 3x12 hr days Start Date : 10 / 20 / 2025 Duration : 12 weeks Pay : $1755 / Week.Founded in 20...Show more
    Last updated: 9 days ago • Promoted
    Lead Data Modeling Engineer (Azure) - Hybrid

    Lead Data Modeling Engineer (Azure) - Hybrid

    Acosta Group • Jacksonville, FL, US
    Full-time
    Lead Data Modeling Engineer (Azure) - Hybrid.Be among the first 25 applicants.Lead Data Modeling Engineer (Azure) - Hybrid. Reporting to the Senior Vice President / General Manager, the Lead Data Mo...Show more
    Last updated: 1 day ago • Promoted
    Travel CT Technologist - Trauma & Stroke Imaging

    Travel CT Technologist - Trauma & Stroke Imaging

    American Medical Staffing • Orange Park, FL, US
    Full-time
    American Medical Staffing is seeking a travel CT Technologist for a travel job in Orange Park, Florida.Job Description & Requirements. We’re living in the new normal.Lives and careers look...Show more
    Last updated: 4 days ago • Promoted
    Travel CT Technologist - Trauma & Stroke Imaging

    Travel CT Technologist - Trauma & Stroke Imaging

    AHS Staffing • Orange Park, FL, US
    Full-time
    AHS Staffing is seeking a travel CT Technologist for a travel job in Orange Park, Florida.Job Description & Requirements. Pay package is based on 12 hour shifts and 36 hours per week (subject to...Show more
    Last updated: 8 days ago • Promoted
    Travel Nurse RN - Med Surg

    Travel Nurse RN - Med Surg

    GLC On-The-Go • Macclenny, FL, US
    Full-time
    GLC On-The-Go is seeking a travel nurse RN Med Surg for a travel nursing job in Macclenny, Florida.Job Description & Requirements. RN Psychiatric - Macclenny, FL - 39-week contract.GLC – N...Show more
    Last updated: 12 days ago • Promoted
    Travel CT Tech - $1706.89 / Week

    Travel CT Tech - $1706.89 / Week

    Atlas MedStaff • Orange Park, FL, US
    Full-time
    Atlas MedStaff is seeking an experienced CT Tech for an exciting Travel Allied job in Orange Park, FL.Shift : 3x12 hr days Start Date : 10 / 20 / 2025 Duration : 12 weeks Pay : $1706.Atlas Medstaff is curr...Show more
    Last updated: 23 days ago • Promoted
    Travel CT Technologist - Trauma & Stroke Imaging

    Travel CT Technologist - Trauma & Stroke Imaging

    Skyline Med Staff Allied • Orange Park, FL, US
    Full-time
    Skyline Med Staff Allied is seeking a travel CT Technologist for a travel job in Orange Park, Florida.Job Description & Requirements. Join the Top- Rated Travel Healthcare Team!.Skyline Med Staf...Show more
    Last updated: 8 days ago • Promoted
    Travel CT Technologist - Trauma & Stroke Imaging

    Travel CT Technologist - Trauma & Stroke Imaging

    Anders Group • Orange Park, FL, US
    Full-time
    Anders Group is seeking a travel CT Technologist for a travel job in Orange Park, Florida.Job Description & Requirements. Pay package is based on 12 hour shifts and 36 hours per week (subject to...Show more
    Last updated: 8 days ago • Promoted
    Travel CT Technologist - Trauma & Stroke Imaging

    Travel CT Technologist - Trauma & Stroke Imaging

    Solomon Page • Orange Park, FL, US
    Full-time
    Solomon Page is seeking a travel CT Technologist for a travel job in Orange Park, Florida.Job Description & Requirements. Our client is looking to add a CT Technologist to their team.As a CT Tec...Show more
    Last updated: 8 days ago • Promoted
    Travel CT Tech - $1628 / Week

    Travel CT Tech - $1628 / Week

    Fusion Medical Staffing • Orange Park, FL, US
    Full-time
    Fusion Medical Staffing is seeking an experienced CT Tech for an exciting Travel Allied job in Orange Park, FL.Shift : Inquire Start Date : 10 / 20 / 2025 Duration : 12 weeks Pay : $1628 / Week.Facility in...Show more
    Last updated: 9 days ago • Promoted
    Travel CT Technologist

    Travel CT Technologist

    PRIDE Health • Orange Park, FL, US
    Full-time
    PRIDE Health is seeking a travel CT Technologist for a travel job in Orange Park, Florida.Job Description & Requirements. PRIDE Health is the minority-owned healthcare recruitment division of Pr...Show more
    Last updated: 4 days ago • Promoted
    Travel CT Technologist - Trauma & Stroke Imaging

    Travel CT Technologist - Trauma & Stroke Imaging

    Planet Healthcare • Orange Park, FL, US
    Full-time
    Planet Healthcare is seeking a travel CT Technologist for a travel job in Orange Park, Florida.Job Description & Requirements. Planet Healthcare Job ID #73230076.Pay package is based on 8 hour s...Show more
    Last updated: 8 days ago • Promoted
    Travel CT Tech - $1876 / Week

    Travel CT Tech - $1876 / Week

    Cynet Health • Orange Park, FL, US
    Full-time
    Cynet Health is seeking an experienced CT Tech for an exciting Travel Allied job in Orange Park, FL.Shift : 3x12 hr days Start Date : 10 / 20 / 2025 Duration : 12 weeks Pay : $1876 / Week.Ranked #5 Best Tr...Show more
    Last updated: 9 days ago • Promoted
    Applied AI / ML Engineer

    Applied AI / ML Engineer

    Catalyst Labs • Jacksonville, FL, US
    Full-time
    Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science.We stand out as an agency thats deeply embedded in our clients recruitment ope...Show more
    Last updated: 1 day ago • Promoted
    Travel CT Technologist - Trauma & Stroke Imaging

    Travel CT Technologist - Trauma & Stroke Imaging

    Zack Group • Orange Park, FL, US
    Permanent
    Zack Group is seeking a travel CT Technologist for a travel job in Orange Park, Florida.Job Description & Requirements.CT Tech CT Tech’s for positions in Orange Park, Florida.The ideal ca...Show more
    Last updated: 8 days ago • Promoted