Talent.com
No longer accepting applications
Senior Software Engineer - ML / LLM Serving

Senior Software Engineer - ML / LLM Serving

AlldusSan Jose, CA, US
1 day ago
Job type
  • Full-time
Job description

Senior Software Engineer - ML / LLM Serving

This range is provided by Alldus. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$180,000.00 / yr - $220,000.00 / yr

Direct message the job poster from Alldus

About the Role

We are seeking a senior / staff Machine Learning Serving Software Engineer who thrives in a fast-paced, customer-focused environment and can build robust, flexible infrastructure to serve a diverse range of ML models including both LLMs and classical ML.

The Role

As an ML Serving Engineer, you will design, implement, and optimize infrastructure that powers the deployment and inference of machine learning models across varied customer environments. You’ll work closely with product, research, and customer engineering teams to deliver low-latency, secure, and scalable ML serving solutions.

Responsibilities

  • Design and build scalable, high-performance ML serving infrastructure capable of handling diverse model types (LLMs, recommendation systems, etc.).
  • Optimize inference pipelines for latency, throughput, and cost efficiency.
  • Integrate with a wide range of customer environments, adapting serving strategies to fit their infrastructure and compliance needs.
  • Deploy, monitor, and maintain ML models in production using modern deployment stacks.
  • Collaborate with ML researchers to operationalize new models and ensure seamless integration into customer workflows.
  • Ensure security and privacy best practices are applied to model deployment and inference, aligning with enterprise-grade data security requirements.
  • Stay up-to-date with the latest serving technologies and frameworks, evaluating and integrating them where relevant.

Qualifications

Required

  • 5+ years of professional software engineering experience, with at least 3+ years focused on ML serving, inference infrastructure, or similar domains.
  • Proven experience deploying and optimizing large language models (LLMs) in production.
  • Hands-on expertise with multiple ML serving frameworks (e.g., TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, Ray Serve, vLLM, etc.).
  • Strong programming skills in Python, Go, or C++.
  • Experience with distributed systems and container orchestration tools (Kubernetes, Docker).
  • Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry) and performance profiling for inference workloads.
  • Solid understanding of secure data handling and privacy-preserving ML practices.
  • Knowledge of cloud platforms (AWS, GCP, Azure) and hybrid / on-prem deployment scenarios.
  • Preferred

  • Prior experience serving multiple model types beyond LLMs, e.g., recommendation engines and classical ML models.
  • Exposure to model quantization, distillation, caching, and other optimization techniques for inference efficiency.
  • Experience working with enterprise customers or within compliance-heavy environments.
  • Details

  • Seniority level : Mid-Senior level
  • Employment type : Full-time
  • Job function : Software Development
  • Referrals increase your chances of interviewing at Alldus by 2x

    Related roles

    AI / ML Engineer (Multiple roles and seniority levels) – San Jose, CA (examples of similar roles and salary ranges).

    #J-18808-Ljbffr

    Create a job alert for this search

    Senior Software Engineer • San Jose, CA, US

    Related jobs
    • Promoted
    Senior System Software Engineer

    Senior System Software Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior System Software Engineer - AV Platform.Key Responsibilities Lead software integration to streamline embedded development across various vehicle subsystems Contr...Show moreLast updated: 30+ days ago
    • Promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior AI / ML Engineer specializing in Generative AI to develop and implement advanced AI solutions. Key Responsibilities Implement and optimize AI orchestration framewor...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software QA Engineer

    Senior Software QA Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior Software Tools Test Engineer to join their Pod DevOps, Tools, and Automation Team.Key Responsibilities Lead the validation efforts for non-product custom tools u...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Android Software Engineer

    Senior Android Software Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Software Engineer, Android.Key Responsibilities Design, develop, test, deploy, maintain and improve software Build best in class fitness and wellness mobile app...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Forward Deployed Engineer

    Senior Forward Deployed Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Forward Deployed Engineer, AI (Remote).Key Responsibilities Lead the design, development, and deployment of AI / ML-powered solutions tailored to customer needs A...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer

    Senior Software Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Software Engineer - (Remote).Key Responsibilities Perform technical design, coding, and testing of applications, serving as a subject matter expert Conduct soft...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Backend Software Engineer

    Senior Backend Software Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior Software Engineer I / II : Backend.Key Responsibilities Solve features and bugs while delivering high-impact software independently Support the growth of teammates...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer II

    Senior Software Engineer II

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Software Engineer II, IAM.Key Responsibilities Design and build reliable authentication and authorization systems for millions of users Propose technical soluti...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer III

    Software Engineer III

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Software Engineer III.Key Responsibilities Develop business workflows using JBPM in XML and API endpoints using Apache Camel XML DSL Debug and troubleshoot software, c...Show moreLast updated: 30+ days ago
    • Promoted
    Senior ML Engineer

    Senior ML Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Senior / Staff ML Engineer to enhance their AI capabilities for residential construction.Key Responsibilities Shape the vision for AI / ML integration into products and ope...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer, Enterprise GenAI

    Senior Software Engineer, Enterprise GenAI

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer - Autonomous Vehicles

    Senior Software Engineer - Autonomous Vehicles

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Software Development Engineer, Test - Autonomous Vehicles.Key Responsibilities Design, develop, and maintain software infrastructure for large-scale simulation o...Show moreLast updated: 30+ days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior MLOps Engineer to design and scale infrastructure for AI research and product development. Key Responsibilities Identify and resolve infrastructure and software b...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Software Engineer

    Lead Software Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Lead Software Engineer who will work remotely.Key Responsibilities Design, develop, and implement software systems using server-side frameworks and client-side technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer - AI / LLM

    Software Engineer - AI / LLM

    SupermicroSan Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer – Distributed Systems (Erlang Preferred)

    Senior Software Engineer – Distributed Systems (Erlang Preferred)

    SourceOwls, LLCRedwood City, CA, US
    Full-time
    Senior Software Engineer – Distributed Systems (Erlang Preferred) Location : Onsite 3–5 days / week Type : Full-Time Visa Sponsorship : Not Available Relocation Assistance : Not Available Benefits Includ...Show moreLast updated: 1 day ago
    • Promoted
    Senior Fullstack Software Engineer

    Senior Fullstack Software Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Fullstack Software Engineer (Automation).Key Responsibilities Collaborate with the software team to design, develop, test, and deploy cloud-based software soluti...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Sr. Software Engineer

    Sr. Software Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Sightline Sr.Key Responsibilities Develop, enhance, maintain, and support software within Enterprise Business Solutions environments, focusing on SAP and Workday system...Show moreLast updated: 11 hours ago
    • Promoted
    • New!
    Staff ML Engineer, Robotics

    Staff ML Engineer, Robotics

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Staff ML Engineer, Robotics.Key Responsibilities Develop and deploy ML models for perception / navigation tasks Design and implement sensor fusion and mapping pipelines ...Show moreLast updated: 11 hours ago
    • Promoted
    Software Engineer

    Software Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Software Engineer (L5) specializing in the Open Connect Platform.Key Responsibilities Implement microservices for analytics, insights, discovery, and visualizations De...Show moreLast updated: 30+ days ago