Talent.com
Senior ML Inference Platform Engineer
Senior ML Inference Platform EngineerAION • Seattle, WA, US
Senior ML Inference Platform Engineer

Senior ML Inference Platform Engineer

AION • Seattle, WA, US
30+ days ago
Job type
  • Full-time
  • Quick Apply
Job description

About AION

AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and full stack AI / ML lifecycle.

Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team across India, London and Seattle.

Who You Are

You're an ML systems engineer who's passionate about building high-performance inference infrastructure. You don't need to be an expert in everything - this field is evolving too rapidly for that - but you have strong fundamentals and the curiosity to dive deep into optimization challenges. You thrive in early-stage environments where you'll learn cutting-edge techniques while building production systems. You think systematically about performance bottlenecks and are excited to push the boundaries of what's possible in AI infrastructure.

Requirements

Key Responsibilities

  • Build and optimize LLM inference systems working towards 2-4x performance improvements over standard frameworks like vLLM and TensorRT-LLM.
  • Implement modern inference optimizations including KV-cache management, dynamic batching, speculative decoding, compression and quantization strategies.
  • Develop GPU optimization solutions using CUDA, with opportunities to learn advanced techniques like Triton kernel development and CUDA graphs.
  • Design model evaluation and benchmarking systems to assess performance across reasoning, coding, and safety metrics.
  • Research and integrate trending open-source models (DeepSeek R1, Qwen 3, Llama 4, Mistral variants) with optimized configurations.
  • Build performance monitoring and profiling tools for GPU cluster analysis, bottleneck identification, and cost optimization.
  • Create cost-performance optimization strategies that balance throughput, latency, and infrastructure costs.
  • Explore agent orchestration capabilities for multi-step reasoning and tool integration workflows.
  • Collaborate with tech and product teams to identify optimization opportunities and translate them into production improvements.

Skills & Experience

  • High agency individual looking to own and influence product architecture and company direction
  • 3+ years of software engineering experience with focus on performance-critical systems and production deployments.
  • Strong Python expertise and working knowledge of C++ for performance optimization.
  • Working understanding of deep learning fundamentals including transformer architectures, attention mechanisms, and neural network training / inference.
  • Hands-on experience of model serving and deployment techniques.
  • Experience with at least one modern inference framework (vLLM, TensorRT-LLM, SGLang or similar) in a production setting.
  • Hands-on experience with PyTorch including model development, training loops, and basic distributed computing concepts.
  • Understanding of distributed systems concepts including load balancing, auto-scaling, and fault tolerance.
  • Basic GPU programming experience with CUDA or willingness to quickly learn GPU optimization techniques.
  • Strong debugging and performance profiling skills for identifying and resolving system bottlenecks.
  • Benefits

  • Join the ground floor of a mission-driven AI startup revolutionizing compute infrastructure.
  • Work with a high-caliber, globally distributed team backed by major VCs.
  • Competitive compensation and benefits.
  • Fast-paced, flexible work environment with room for ownership and impact.
  • Hybrid model : 3 days in-office, 2 days remote with flexibility to work remotely for part of the year.
  • In case you got any questions about the role please reach out to hiring manager on linkedin or X .

    Create a job alert for this search

    Senior Engineer Platform • Seattle, WA, US

    Related jobs
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc. • Seattle, WA, United States
    Full-time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show more
    Last updated: 30+ days ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Hive • Seattle, Washington, United States
    Full-time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - Machine Learning Feature Store

    Senior Software Engineer - Machine Learning Feature Store

    Snowflake • Bellevue, Washington, United States
    Full-time
    Join our ML Feature Store team where we're building cutting-edge product capabilities that power complex feature transformations and low-latency feature serving. We're revolutionizing machine learni...Show more
    Last updated: 30+ days ago • Promoted
    Senior Staff Machine Learning Engineer

    Senior Staff Machine Learning Engineer

    Rippling • Seattle, Washington, United States
    Full-time
    Rippling is the first way for businesses to manage all of their HR & IT—payroll, benefits, computers, apps, and more—in one unified workforce platform. By connecting every business system to one sou...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer 2

    Data Engineer 2

    Funko • Washington, Everett, United States
    Full-time
    Welcome to the Funko-verse, a world built on pure imagination, a land governed by the philosophy that stories matter, a universe comprised of characters from countless fandoms, a galaxy of once upo...Show more
    Last updated: 30+ days ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Axon • Seattle, Oregon, USA
    Full-time
    Join Axon and be a Force for Good.At Axon were on a mission to Protect Life.Were explorers pursuing societys most critical safety and justice issues with our ecosystem of devices and cloud software...Show more
    Last updated: 1 day ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Helion Energy • Everett, Washington, United States
    Full-time
    We are a fusion power company based in Everett, WA, with the mission to build the world's first fusion power plant, enabling a future with unlimited clean electricity. Our vision is a world with cle...Show more
    Last updated: 17 days ago • Promoted
    AIML Senior Machine Learning Engineer ML Hub Core, Machine Learning Platform Technologies (MLPT)

    AIML Senior Machine Learning Engineer ML Hub Core, Machine Learning Platform Technologies (MLPT)

    Apple • Seattle, Oregon, USA
    Full-time
    We design and build machine learning infrastructures to support features that empowers billions of Apple users.As part of this group you will develop the systems we use to build AI / ML models throug...Show more
    Last updated: 21 days ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Continua • Seattle, Oregon, USA
    Full-time
    AI agent that users can invite into group conversations to make planning coordination and information retrieval effortless. Funded by Google and Bessemer Venture Partners Continua is developing the ...Show more
    Last updated: 13 days ago • Promoted
    AIML - Sr. Machine Learning Data Engineer, Machine Learning Platform Technologies

    AIML - Sr. Machine Learning Data Engineer, Machine Learning Platform Technologies

    Apple Inc. • Seattle, WA, United States
    Full-time
    AIML - Data Engineer, Machine Learning Platform Technologies.Seattle, Washington, United States Machine Learning and AI.Join us in building the machine learning platform that enables teams at Apple...Show more
    Last updated: 30+ days ago • Promoted
    Senior ML Engineer

    Senior ML Engineer

    Truveta • Seattle, Washington, United States
    Full-time
    Truveta is the world’s first health provider led data platform with a vision of Saving Lives with Data.Our mission is to enable researchers to find cures faster, empower every clinician to be an ex...Show more
    Last updated: 18 days ago • Promoted
    Senior Business Intelligence Engineer D&D Beyond

    Senior Business Intelligence Engineer D&D Beyond

    Hasbro • Renton, Washington, USA
    Full-time
    At Wizards of the Coast we connect people around the world through play and imagination.From our genre-defining games like Magic : The Gathering and Dungeons & Dragons to our growing multiverse ...Show more
    Last updated: 4 days ago • Promoted
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI • Seattle, WA, United States
    Full-time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer

    Software Engineer

    Fortive • Everett, Washington, United States
    Full-time
    The software team at Fluke Corporation is looking for a highly skilled.The ideal candidate will have a strong background in C / C++, C#, WPF, WCF, ASP. NET, and experience working with relational data...Show more
    Last updated: 30+ days ago • Promoted
    ML Engineer

    ML Engineer

    Catalyst Labs • Seattle, Oregon, USA
    Full-time
    Is a rapidly growing Tier 1 VC backed startup based in New York with $60 million in funding revolutionizing how outside sales and service teams work. Their AI technology captures and analyzes real-w...Show more
    Last updated: 10 days ago • Promoted
    AIML Sr Machine Learning Engineer, DMLI

    AIML Sr Machine Learning Engineer, DMLI

    Apple • Seattle, Oregon, USA
    Full-time
    As a Machine Learning (ML) Engineer you will be entrusted with the critical role of innovating and applying state of the art research in ML to tackle complex data problems.The solutions you develop...Show more
    Last updated: 23 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Fortive • Everett, Washington, United States
    Full-time
    The software team at Fluke Corporation is looking for a highly skilled.The ideal candidate will have a strong background in C / C++, C#, WPF, WCF, ASP. NET, and experience working with relational data...Show more
    Last updated: 30+ days ago • Promoted
    Senior Analyst, Configuration Information Management- NetworX

    Senior Analyst, Configuration Information Management- NetworX

    Molina Healthcare • Everett, WA, United States
    Full-time
    Serves as a subject matter expert on system capabilities, conducting research and root cause analysis to resolve complex business and technical issues. Ensures system configuration aligns with busin...Show more
    Last updated: 14 days ago • Promoted