Talent.com
No longer accepting applications
Principal Engineer - AI Infrastructure Abstractions

Principal Engineer - AI Infrastructure Abstractions

Diversity Talent ScoutsSan Jose, CA, United States
18 days ago
Job type
  • Full-time
Job description

As a Principal AI Infrastructure Abstraction Engineer , you will design and implement the foundational systems that make shared AI compute environments scalable, secure, and developer-friendly. Your work will focus on creating abstractions that hide hardware complexity while providing predictable, cloud-native interfaces for AI workloads.

This position bridges infrastructure and applied AI-turning raw GPUs and accelerators into programmable, elastic, and multi-tenant resources for both internal developers and enterprise clients.

Key Responsibilities

  • Architect abstractions that map logical compute constructs (vGPUs, GPU pools, workload queues) to physical devices.
  • Build APIs, services, and control planes that expose GPU and accelerator resources with strong isolation and quality-of-service guarantees.
  • Develop mechanisms for secure GPU sharing, including time-slicing, partitioning, and namespace isolation.
  • Work with orchestration and scheduling systems to ensure intelligent mapping of resources based on utilization, priority, and network topology.
  • Define policies for quotas, fair allocation, and resource elasticity in shared environments.
  • Integrate with AI / ML frameworks (PyTorch, TensorFlow, Triton, etc.) to optimize model training and inference workflows.
  • Deliver observability and monitoring capabilities that trace resource usage from logical abstractions to hardware.
  • Partner with platform security teams to strengthen access controls, onboarding processes, and tenant isolation.
  • Support internal developer adoption of abstraction APIs while maintaining high performance and low overhead.
  • Contribute to long-term compute platform strategy with a focus on modularity, abstraction, and scale.

Minimum Qualifications

  • Bachelor's degree with 15+ years of experience, Master's with 12+ years, or PhD with 8+ years.
  • Proven track record building production-grade infrastructure systems, preferably in Go, Python, or C++.
  • Strong experience with containerization and orchestration platforms (Kubernetes, Docker, KubeVirt).
  • Background in designing logical abstractions for compute, storage, or networking in multi-tenant systems.
  • Familiarity with integrating with machine learning platforms (e.g., PyTorch, TensorFlow, Triton, MLFlow).
  • Preferred Qualifications

  • Hands-on experience with GPU sharing, scheduling, or isolation (MIG, MPS, vGPUs, time-slicing, or device plugin models).
  • Deep knowledge of resource management : quotas, prioritization, fairness, elasticity.
  • Strong ability to think across hardware / software boundaries and design abstractions that scale.
  • Create a job alert for this search

    Principal Engineer Ai • San Jose, CA, United States

    Related jobs
    • Promoted
    Principal DevOps Engineer

    Principal DevOps Engineer

    Informatica LLCRedwood City, CA, United States
    Full-time
    Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous minds eager to solve the world's most...Show moreLast updated: 23 days ago
    • Promoted
    Principal Platform Architect, Agentic AI

    Principal Platform Architect, Agentic AI

    NVIDIA CorporationSanta Clara, CA, United States
    Full-time
    NVIDIA has been transforming accelerated computing with innovation that’s fueled by great technology—and amazing people.As part of Nvidia's applied AI team for chip design, you will have the opport...Show moreLast updated: 2 days ago
    • Promoted
    Principal AI / ML Engineer

    Principal AI / ML Engineer

    WEX, Inc.San Francisco, CA, United States
    Full-time
    Lead and drive the development of technology and platform for the company's AI / ML engineering needs, ensure the functional richness, reliability, performance, and flexibility of this platform.Help ...Show moreLast updated: 30+ days ago
    • Promoted
    Principal AIOps Engineer, Enterprise AI Platform

    Principal AIOps Engineer, Enterprise AI Platform

    Palo Alto NetworksSanta Clara, California, United States
    Full-time
    At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Platform Architect, Agentic AI

    Principal Platform Architect, Agentic AI

    NVIDIASanta Clara, CA, United States
    Full-time
    Principal Platform Architect, Agentic AI.NVIDIA has been transforming accelerated computing with innovation that’s fueled by great technology—and amazing people. As part of Nvidia's applied AI team ...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Engineer - AI Tools

    Principal Engineer - AI Tools

    UberSan Francisco, CA, United States
    Full-time
    At Uber, Developer productivity is a cornerstone of our innovation engine - productive developers will deliver more features faster to our world-wide end users. We are seeking a world-class Principa...Show moreLast updated: 4 days ago
    • Promoted
    Principal Engineer, AI Engineering

    Principal Engineer, AI Engineering

    OktaSan Francisco, CA, United States
    Full-time
    Okta is The World's Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secur...Show moreLast updated: 4 days ago
    • Promoted
    Principal Ai Architect

    Principal Ai Architect

    IntappPalo Alto, CA, United States
    Full-time
    Intapp’s Intelligent Cloud platform.This executive-level, hands-on role is critical to ensuring our technology ecosystem is scalable, integrated, and AI-enabled. You’ll collaborate across engineerin...Show moreLast updated: 4 days ago
    • Promoted
    Principal Database Engineer

    Principal Database Engineer

    Informatica LLCRedwood City, CA, United States
    Full-time
    Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous minds eager to solve the world's most...Show moreLast updated: 17 days ago
    • Promoted
    Principal Engineer – Partner Platform, APIs & Ecosystem Services

    Principal Engineer – Partner Platform, APIs & Ecosystem Services

    QuizletSan Francisco, CA, United States
    Full-time
    Let’s Build the Future of Learning.Quizlet’s mission is to help every learner achieve their outcomes in the most effective and delightful way. Our $1B+ learning platform serves tens of millions of s...Show moreLast updated: 21 days ago
    • Promoted
    Principal AI Engineer

    Principal AI Engineer

    TENEX.AISan Jose, California, United States
    Full-time
    TENEX is an AI-native, automation-first, built-for-scale Managed Detection and Response (MDR) provider.We are a force multiplier for defenders, helping organizations enhance their cybersecurity pos...Show moreLast updated: 30+ days ago
    • Promoted
    Principal AWS Architect | GenAI Systems

    Principal AWS Architect | GenAI Systems

    Mogi I / O : OTT / Podcast / Short Video Apps for youSan Francisco, CA, United States
    Full-time
    United States – Bay Area, California.Experience : 8–15 years in Cloud Architecture and AI Systems.Compensation : USD 130,000 – 170,000 annually. A leading global technology and consulting firm is seek...Show moreLast updated: 13 days ago
    • Promoted
    Principal AI Architect

    Principal AI Architect

    Intapp, Inc.Palo Alto, CA, United States
    Full-time
    Principal AI Architect • • • •Location : • • Palo Alto, CA • •About the Role • •As the • •Principal AI Architect • •, you will define and lead the technical vision, architecture, and strategy for Intapp’s Intell...Show moreLast updated: 2 days ago
    • Promoted
    Principal Software Engineer AI Agents

    Principal Software Engineer AI Agents

    GoodleapSan Francisco, California, United States
    Full-time
    GoodLeap is a technology company delivering best-in-class financing and software products for sustainable solutions, from solar panels and batteries to energy-efficient HVAC, heat pumps, roofing, w...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Engineer – Partner Platform, APIs & Ecosystem Services

    Principal Engineer – Partner Platform, APIs & Ecosystem Services

    Icon VenturesSan Francisco, CA, United States
    Full-time
    Let’s Build the Future of Learning.Quizlet’s mission is to help every learner achieve their outcomes in the most effective and delightful way. Our $1B+ learning platform serves tens of millions of s...Show moreLast updated: 19 days ago
    • Promoted
    Principal Core Infrastructure Engineer

    Principal Core Infrastructure Engineer

    HighnoteSan Francisco, CA, US
    Full-time
    Join to apply for the Senior Core Infrastructure Engineer role at Highnote 3 days ago Be among the first 25 applicants Join to apply for the Senior Core Infrastructure Engineer role at Highno...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Engineer - AI / ML

    Principal Engineer - AI / ML

    QuizletSan Francisco, California, United States
    Full-time
    At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way.We’re a $1B+ learning platform used by two-thirds of U. We blend cognitive science wi...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Software Engineer

    Principal Software Engineer

    Informatica LLCRedwood City, CA, United States
    Full-time
    Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous, work-from-anywhere minds eager to so...Show moreLast updated: 30+ days ago