Talent.com
Senior ML Inference Platform Engineer

Senior ML Inference Platform Engineer

AIONSeattle, WA, US
15 days ago
Job type
  • Full-time
  • Quick Apply
Job description

About AION

AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance, AION democratizes access to compute power for AI training, fine-tuning, inference, data labeling, and full stack AI / ML lifecycle.

Led by high-pedigree founders with previous exits, AION is well-funded by major VCs with strategic global partnerships. Headquartered in the US with global presence, the company is building its initial core team across India, London and Seattle.

Who You Are

You're an ML systems engineer who's passionate about building high-performance inference infrastructure. You don't need to be an expert in everything - this field is evolving too rapidly for that - but you have strong fundamentals and the curiosity to dive deep into optimization challenges. You thrive in early-stage environments where you'll learn cutting-edge techniques while building production systems. You think systematically about performance bottlenecks and are excited to push the boundaries of what's possible in AI infrastructure.

Requirements

Key Responsibilities

  • Build and optimize LLM inference systems working towards 2-4x performance improvements over standard frameworks like vLLM and TensorRT-LLM.
  • Implement modern inference optimizations including KV-cache management, dynamic batching, speculative decoding, compression and quantization strategies.
  • Develop GPU optimization solutions using CUDA, with opportunities to learn advanced techniques like Triton kernel development and CUDA graphs.
  • Design model evaluation and benchmarking systems to assess performance across reasoning, coding, and safety metrics.
  • Research and integrate trending open-source models (DeepSeek R1, Qwen 3, Llama 4, Mistral variants) with optimized configurations.
  • Build performance monitoring and profiling tools for GPU cluster analysis, bottleneck identification, and cost optimization.
  • Create cost-performance optimization strategies that balance throughput, latency, and infrastructure costs.
  • Explore agent orchestration capabilities for multi-step reasoning and tool integration workflows.
  • Collaborate with tech and product teams to identify optimization opportunities and translate them into production improvements.

Skills & Experience

  • High agency individual looking to own and influence product architecture and company direction
  • 3+ years of software engineering experience with focus on performance-critical systems and production deployments.
  • Strong Python expertise and working knowledge of C++ for performance optimization.
  • Working understanding of deep learning fundamentals including transformer architectures, attention mechanisms, and neural network training / inference.
  • Hands-on experience of model serving and deployment techniques.
  • Experience with at least one modern inference framework (vLLM, TensorRT-LLM, SGLang or similar) in a production setting.
  • Hands-on experience with PyTorch including model development, training loops, and basic distributed computing concepts.
  • Understanding of distributed systems concepts including load balancing, auto-scaling, and fault tolerance.
  • Basic GPU programming experience with CUDA or willingness to quickly learn GPU optimization techniques.
  • Strong debugging and performance profiling skills for identifying and resolving system bottlenecks.
  • Benefits

  • Join the ground floor of a mission-driven AI startup revolutionizing compute infrastructure.
  • Work with a high-caliber, globally distributed team backed by major VCs.
  • Competitive compensation and benefits.
  • Fast-paced, flexible work environment with room for ownership and impact.
  • Hybrid model : 3 days in-office, 2 days remote with flexibility to work remotely for part of the year.
  • In case you got any questions about the role please reach out to hiring manager on linkedin or X .

    Create a job alert for this search

    Senior Engineer Ml • Seattle, WA, US

    Related jobs
    • Promoted
    Principal Machine Learning Engineer, Monetization

    Principal Machine Learning Engineer, Monetization

    PinterestSeattle, WA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show moreLast updated: 29 days ago
    • Promoted
    Junior Machine Learning Engineer

    Junior Machine Learning Engineer

    VirtualVocationsEverett, Washington, United States
    Full-time
    A company is looking for a Junior Machine Learning Engineer to support the design, development, and deployment of machine learning models. Key Responsibilities Support the end-to-end lifecycle of ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AI Data Scientist

    AI Data Scientist

    VirtualVocationsEverett, Washington, United States
    Full-time
    A company is looking for a Data Scientist (AI Focus).Key Responsibilities Design and build natural language interaction capabilities using Python and Jupyter environments Act as a subject matter...Show moreLast updated: 18 hours ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    VirtualVocationsKent, Washington, United States
    Full-time
    A company is looking for a Machine Learning Engineer - Inventory Demand Forecasting (Retail / CPG).Key Responsibilities Build ML-based forecasting and inventory optimization systems for retail / eC...Show moreLast updated: 30+ days ago
    • Promoted
    Deep Learning Software Engineer

    Deep Learning Software Engineer

    VirtualVocationsTacoma, Washington, United States
    Full-time
    A company is looking for a Deep Learning Software Engineer, Inference and Model Optimization - New College Grad 2025.Key Responsibilities Train, develop, and deploy generative AI models using the...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    AI Developer Consultant

    AI Developer Consultant

    VirtualVocationsSeattle, Washington, United States
    Full-time
    A company is looking for a Freelance Developer Consultant (Kotlin) - AI Trainer.Key Responsibilities Code generation and code review Training and evaluation of large language models Collaborati...Show moreLast updated: 18 hours ago
    • Promoted
    AI Solutions Specialist

    AI Solutions Specialist

    VirtualVocationsKent, Washington, United States
    Full-time
    A company is looking for an AI Solutions Specialist to drive business innovation through artificial intelligence.Key Responsibilities Analyze complex datasets to uncover trends and actionable ins...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Agentic AI Engineer

    Agentic AI Engineer

    VirtualVocationsEverett, Washington, United States
    Full-time
    A company is looking for an Agent Forward Engineer to lead the design and deployment of AI agents and software for application development and migration. Key Responsibilities Design and implement ...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    AI Integration Engineer

    AI Integration Engineer

    VirtualVocationsKent, Washington, United States
    Full-time
    A company is looking for a Staff Game Engineer to build and support AI research applications in games.Key Responsibilities Integrate AI agents with games Build tools for debugging and improving ...Show moreLast updated: 14 hours ago
    • Promoted
    California Licensed Epic Analyst

    California Licensed Epic Analyst

    VirtualVocationsRenton, Washington, United States
    Full-time
    A company is looking for an Epic PB Implementation Cutover Analyst for a contract position focused on workflow mapping and appointment conversion planning. Key Responsibilities Support PB and Cade...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    VirtualVocationsEverett, Washington, United States
    Full-time
    Machine Learning Engineer to contribute to machine learning solutions for network security challenges.Key Responsibilities Contribute to all stages of AI / ML projects, from exploration to producti...Show moreLast updated: 30+ days ago
    • Promoted
    AI Developer

    AI Developer

    VirtualVocationsTacoma, Washington, United States
    Full-time
    A company is looking for an AI Developer to advance the development of intelligent applications remotely.Key Responsibilities : Design and develop AI applications with OpenAI GPT models, ensuring ...Show moreLast updated: 30+ days ago
    • Promoted
    Data Quality Senior Engineer

    Data Quality Senior Engineer

    VirtualVocationsTacoma, Washington, United States
    Full-time
    A company is looking for an IT Data Quality Senior Engineer to ensure data quality and compliance across various projects. Key Responsibilities Automate and optimize data validations, implement bu...Show moreLast updated: 2 days ago
    • Promoted
    Generative Machine Learning Engineer

    Generative Machine Learning Engineer

    VirtualVocationsTacoma, Washington, United States
    Full-time
    A company is looking for a Generative Machine Learning Engineer.Key Responsibilities Develop and implement Deep Learning Pipelines using cutting-edge models Sustain the infrastructure for Deep L...Show moreLast updated: 30+ days ago
    • Promoted
    AI Software Engineer

    AI Software Engineer

    VirtualVocationsRenton, Washington, United States
    Full-time
    A company is looking for an AI Software Engineer to join their Operations Engineering team.Key Responsibilities Design, develop, and deploy AI-driven features and intelligent agents for real-worl...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Engineer

    AI / ML Engineer

    VirtualVocationsTacoma, Washington, United States
    Full-time
    A company is looking for an AI / ML Engineer.Key Responsibilities Develop software in Python, C#, and C++ for LLM training, testing, and inference modules Lead the creation and implementation of A...Show moreLast updated: 30+ days ago
    • Promoted
    AI Engineer

    AI Engineer

    VirtualVocationsEverett, Washington, United States
    Full-time
    A company is looking for an AI Engineer, Principal.Key Responsibilities Collaborate with data scientists and ML engineers to containerize, deploy, and monitor AI / ML models Design and manage clou...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Engineer (AI Platforms)

    Data Engineer (AI Platforms)

    VirtualVocationsKent, Washington, United States
    Full-time
    A company is looking for a Data Engineer (AI Platforms) to contribute to building and optimizing data solutions for AI applications. Key Responsibilities Design, build, and optimize scalable data ...Show moreLast updated: 22 hours ago
    • Promoted
    PhD University Grad Machine Learning Engineer 2026 (USA)

    PhD University Grad Machine Learning Engineer 2026 (USA)

    PinterestSeattle, WA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show moreLast updated: 6 days ago
    • Promoted
    AI Implementation Specialist

    AI Implementation Specialist

    VirtualVocationsTacoma, Washington, United States
    Full-time
    A company is looking for an AI Implementation Specialist for Automated Portfolio Analysis and Reporting.Key Responsibilities Develop a GUI-based prototype for AI-powered document analysis and vis...Show moreLast updated: 30+ days ago