Talent.com
Director, Technical Program Management - AI and ML Platforms

Director, Technical Program Management - AI and ML Platforms

Nvidia CorporationSanta Clara, CA, United States
3 days ago
Job type
  • Full-time
Job description

The DGX Cloud organization builds and operates the AI infrastructure that makes this innovation possible. We are seeking a Director of Technical Program Management (TPM) to lead AI / ML Platform initiatives within the DGX Cloud Infrastructure team. This role will coordinate extensive, multi-functional programs that compose how NVIDIA researchers develop, train, and deploy AI models on our global DGX Cloud platform. You will lead a team of TPMs responsible for orchestrating compute platforms, cluster bring-ups, workload scheduling, and platform enablement across NVIDIA's most advanced GPU systems.

As Director of Technical Program Management for AI / ML Platforms, your mission is to accelerate NVIDIA's research and product innovation by delivering a resilient, high-performance AI platform that seamlessly integrates hardware, orchestration, and developer productivity. You will bridge NVIDIA Research, DGX Engineering, and Cloud Operations ensuring our infrastructure evolves to meet the rapidly expanding scale and complexity of AI workloads.

What You'll Be Doing :

  • Lead and scale the Technical Program Management organization responsible for the DGX Cloud AI / ML platform, enabling over 1,000+ NVIDIA researchers globally.
  • Drive the roadmap for end-to-end AI / ML infrastructure, spanning cluster bring-up, workload orchestration, GPU resource management, and integration with MLOps pipelines.
  • Collaborate with leaders in technology and innovation to outline platform needs, synchronize computing approach with AI model advancement, and provide a seamless researcher journey.
  • Lead complex programs involving next-generation systems (e.g., GB200) and fleet-wide scaling initiatives across OCI, GCP, and other hyperscalers.
  • Own platform efficiency and capacity management, using deep understanding of scheduling systems (e.g., Slurm, hybrid models) to optimize job placement, utilization, and turnaround time.
  • Establish data-driven operational metrics availability, occupancy, wait times, throughput and use them to guide continuous improvement and prioritization.
  • Implement governance and visibility frameworks that drive alignment, predictability, and accountability across AI platform initiatives.
  • Represent DGX Cloud programs to senior leadership, clearly articulating impact, risk, and value across engineering and research organizations.

What We Need to See :

  • 15+ overall years of technical program management experience, including 7+ years leading and developing TPM teams in infrastructure, AI / ML, or platform engineering domains.
  • Demonstrated success in implementing AI and machine learning systems and platform initiatives at a large scale encompassing workload coordination, data pipeline incorporation, model training environments, and GPU fleet supervision.
  • Deep technical understanding of AI / ML workflows, job scheduling (Slurm, Kubernetes, hybrid orchestration), and large-scale distributed systems.
  • Proficiency in optimizing resource usage and monitoring performance metrics in compute-heavy settings.
  • Experience building platforms across cloud and on-prem hybrid architectures, integrating with internal and external MLOps stacks.
  • Proficiency with observability and telemetry tools (e.g., Grafana, Prometheus) for infrastructure monitoring and performance analysis.
  • Bachelor or Master in Computer Science, Engineering, or related field (or equivalent experience).
  • Ways to Stand Out from the Crowd :

  • Proficient in AI / ML systems, model lifecycle oversight, and developer tools for extensive training tasks.
  • Track record driving R&D productivity platforms and reducing friction for machine learning practitioners.
  • Experience in new product introduction (NPI) for research and infrastructure systems.
  • Deep familiarity with cloud compute and orchestration technologies, and a passion for automation and operational excellence.
  • Executive communication skills, able to translate complex technical programs into clear business and research outcomes.
  • NVIDIA is widely considered one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on our team. If you're driven, excited by tech and AI, creative and autonomous, we want to hear from you!

    #LI-Hybrid

    Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 264,000 USD - 402,500 USD.

    You will also be eligible for equity and benefits.

    Applications for this job will be accepted at least until November 3, 2025.

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    #J-18808-Ljbffr

    Create a job alert for this search

    Director Program Management • Santa Clara, CA, United States

    Related jobs
    • Promoted
    Director, Technical Program Management (AI / ML Products)

    Director, Technical Program Management (AI / ML Products)

    Capital One National AssociationSan Francisco, CA, United States
    Full-time +1
    Director, Technical Program Management (AI / ML Products).Are you interested in leading programs that deliver on critical business goals and build large scale products & platforms?.At Capital One, we...Show moreLast updated: 30+ days ago
    • Promoted
    Director, Product Management, AI

    Director, Product Management, AI

    DocuSign, Inc.San Francisco, CA, US
    Full-time
    Docusign brings agreements to life.Docusign solutions to accelerate the process of doing business and simplify people's lives. With intelligent agreement management, Docusign unleashes business-crit...Show moreLast updated: 30+ days ago
    • Promoted
    Director, Product Management, Enterprise AI and Machine Learning (ML) - AI Safety

    Director, Product Management, Enterprise AI and Machine Learning (ML) - AI Safety

    Capital OneSan Francisco, CA, US
    Full-time +1
    Director, Product Management, Enterprise AI and Machine Learning (ML) - AI Safety Product Management Product Management at Capital One is a booming, vibrant craft that requires reimagining the stat...Show moreLast updated: 30+ days ago
    • Promoted
    Director, Product Management, VMDR - Risk-Based Vulnerability Management

    Director, Product Management, VMDR - Risk-Based Vulnerability Management

    QualysFoster City, CA, United States
    Full-time
    Come work at a place where innovation and teamwork come together to support the most exciting missions in the world!.Director of Product Management - Risk-Based Vulnerability Management - VMDR.Come...Show moreLast updated: 30+ days ago
    • Promoted
    Director, Product Management - Auth0 AI Products

    Director, Product Management - Auth0 AI Products

    Okta for DevelopersSan Francisco, CA, United States
    Full-time
    Director, Product Management - Auth0 AI Products.Director, Product Management - Auth0 AI Products.Auth0, an Okta company, is the leading customer identity platform for developers.Our mission is to ...Show moreLast updated: 30+ days ago
    • Promoted
    Director of Product Management - AI and Analytics

    Director of Product Management - AI and Analytics

    SuccessFactorsPalo Alto, CA, US
    Full-time
    Director Of Product Management For Ai And Analytics Initiatives.At SAP, we keep it simple : you bring your best to us, and we'll bring out the best in you. We're builders touching over 20 industries ...Show moreLast updated: 1 day ago
    • Promoted
    Senior AI Program Director

    Senior AI Program Director

    Davis Wright TremaineSan Francisco, CA, United States
    Full-time
    This is an exciting opportunity to work for one of the top law firms in the U.Davis Wright Tremaine LLP is looking for a. Seattle, Los Angeles, San Francisco, Portland, New York, or Washington D.Thi...Show moreLast updated: 3 days ago
    • Promoted
    Director, AI Delivery Management

    Director, AI Delivery Management

    EPAM SystemsSan Francisco, CA, US
    Full-time
    EPAM is hiring a strategic AI Delivery Leader with experience guiding dynamic teams, designing impactful solutions, and solving complex business challenges for large, global organizations.This is...Show moreLast updated: 30+ days ago
    • Promoted
    Director, Technical Program Management (AI / ML Products)

    Director, Technical Program Management (AI / ML Products)

    Capital OneSan Francisco, CA, United States
    Full-time
    Director, Technical Program Management (AI / ML Products).Are you interested in leading programs that deliver on critical business goals and build large scale products & platforms?.At Capital One, we...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Director, Technical Program Management

    Director, Technical Program Management

    Stypi (Acquired by Salesforce)San Francisco, CA, US
    Full-time
    Director of Technical Program Management.Salesforce is the #1 AI CRM, where humans with agents drive customer success together. And innovation isn't a buzzword it's a way of life.The world of work ...Show moreLast updated: 14 hours ago
    • Promoted
    Director, AI Transformation

    Director, AI Transformation

    VisaFoster City, CA, United States
    Permanent
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 13 days ago
    • Promoted
    Sr. Director, Product Management - AI / ML Platform Intelligence

    Sr. Director, Product Management - AI / ML Platform Intelligence

    Capital OneSan Francisco, CA, United States
    Full-time +1
    Director, Product Management - AI / ML Platform Intelligence.Sr Director, Product Management (PXDP65).Product Management at Capital One is a booming, vibrant craft that requires reimagining the statu...Show moreLast updated: 30+ days ago
    • Promoted
    Director, AI / ML Forward Deployment Engineering

    Director, AI / ML Forward Deployment Engineering

    CareerArcSanta Clara, CA, United States
    Full-time
    WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded syst...Show moreLast updated: 16 days ago
    • Promoted
    Product Management Director, Common Services AI

    Product Management Director, Common Services AI

    Stypi (Acquired by Salesforce)San Francisco, CA, US
    Full-time
    Product Management Director, Common Services AI.Salesforce is seeking a visionary and experienced Product Management Director for Common Services AI to drive our cutting-edge artificial intelligenc...Show moreLast updated: 30+ days ago
    • Promoted
    Director, System Product Management

    Director, System Product Management

    SupermicroSan Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
    • Promoted
    Director, AI / ML Platform

    Director, AI / ML Platform

    VisaFoster City, CA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 3 days ago
    • Promoted
    Director, AI Platform - Product Management

    Director, AI Platform - Product Management

    Symphony Industrial AI, Inc.Palo Alto, CA, United States
    Full-time
    Director, AI Platform Product Management – SymphonyAI Retail.SymphonyAI Retail is seeking an innovative Director of AI Platform - Product Management to lead our next-generation AI Platform—powering...Show moreLast updated: 30+ days ago
    • Promoted
    Director, Technical Program Management

    Director, Technical Program Management

    WalmartSunnyvale, CA, United States
    Part-time
    Director, Technical Program Management.This range is provided by Walmart.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.We are seeking an exper...Show moreLast updated: 1 day ago