No longer accepting applications

Senior Software Engineer - ML / LLM Serving

AlldusSan Jose, CA, US

1 day ago

Job type

Full-time

Job description

Senior Software Engineer - ML / LLM Serving

This range is provided by Alldus. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$180,000.00 / yr - $220,000.00 / yr

Direct message the job poster from Alldus

About the Role

We are seeking a senior / staff Machine Learning Serving Software Engineer who thrives in a fast-paced, customer-focused environment and can build robust, flexible infrastructure to serve a diverse range of ML models including both LLMs and classical ML.

The Role

As an ML Serving Engineer, you will design, implement, and optimize infrastructure that powers the deployment and inference of machine learning models across varied customer environments. You’ll work closely with product, research, and customer engineering teams to deliver low-latency, secure, and scalable ML serving solutions.

Responsibilities

Design and build scalable, high-performance ML serving infrastructure capable of handling diverse model types (LLMs, recommendation systems, etc.).
Optimize inference pipelines for latency, throughput, and cost efficiency.
Integrate with a wide range of customer environments, adapting serving strategies to fit their infrastructure and compliance needs.
Deploy, monitor, and maintain ML models in production using modern deployment stacks.
Collaborate with ML researchers to operationalize new models and ensure seamless integration into customer workflows.
Ensure security and privacy best practices are applied to model deployment and inference, aligning with enterprise-grade data security requirements.
Stay up-to-date with the latest serving technologies and frameworks, evaluating and integrating them where relevant.

Qualifications

Required

5+ years of professional software engineering experience, with at least 3+ years focused on ML serving, inference infrastructure, or similar domains.

Proven experience deploying and optimizing large language models (LLMs) in production.

Hands-on expertise with multiple ML serving frameworks (e.g., TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, Ray Serve, vLLM, etc.).

Strong programming skills in Python, Go, or C++.

Experience with distributed systems and container orchestration tools (Kubernetes, Docker).

Familiarity with observability stacks (Prometheus, Grafana, OpenTelemetry) and performance profiling for inference workloads.

Solid understanding of secure data handling and privacy-preserving ML practices.

Knowledge of cloud platforms (AWS, GCP, Azure) and hybrid / on-prem deployment scenarios.

Preferred

Prior experience serving multiple model types beyond LLMs, e.g., recommendation engines and classical ML models.

Exposure to model quantization, distillation, caching, and other optimization techniques for inference efficiency.

Experience working with enterprise customers or within compliance-heavy environments.

Details

Seniority level : Mid-Senior level

Employment type : Full-time

Job function : Software Development

Referrals increase your chances of interviewing at Alldus by 2x

Related roles

AI / ML Engineer (Multiple roles and seniority levels) – San Jose, CA (examples of similar roles and salary ranges).

#J-18808-Ljbffr

Create a job alert for this search

Senior Software Engineer • San Jose, CA, US

Related jobs

Promoted

Senior System Software Engineer

VirtualVocationsHayward, California, United States

Full-time

A company is looking for a Senior System Software Engineer - AV Platform.Key Responsibilities Lead software integration to streamline embedded development across various vehicle subsystems Contr...Show moreLast updated: 30+ days ago

Promoted

Senior AI / ML Engineer

VirtualVocationsHayward, California, United States

Full-time

A company is looking for a Senior AI / ML Engineer specializing in Generative AI to develop and implement advanced AI solutions. Key Responsibilities Implement and optimize AI orchestration framewor...Show moreLast updated: 30+ days ago

Promoted

Senior Software QA Engineer

VirtualVocationsHayward, California, United States

Full-time

A company is looking for a Senior Software Tools Test Engineer to join their Pod DevOps, Tools, and Automation Team.Key Responsibilities Lead the validation efforts for non-product custom tools u...Show moreLast updated: 30+ days ago

Promoted

Senior Android Software Engineer

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Senior Software Engineer, Android.Key Responsibilities Design, develop, test, deploy, maintain and improve software Build best in class fitness and wellness mobile app...Show moreLast updated: 30+ days ago

Promoted

Senior Forward Deployed Engineer

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Senior Forward Deployed Engineer, AI (Remote).Key Responsibilities Lead the design, development, and deployment of AI / ML-powered solutions tailored to customer needs A...Show moreLast updated: 30+ days ago

Promoted

Senior Software Engineer

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Senior Software Engineer - (Remote).Key Responsibilities Perform technical design, coding, and testing of applications, serving as a subject matter expert Conduct soft...Show moreLast updated: 30+ days ago

Promoted

Senior Backend Software Engineer

VirtualVocationsHayward, California, United States

Full-time

A company is looking for a Senior Software Engineer I / II : Backend.Key Responsibilities Solve features and bugs while delivering high-impact software independently Support the growth of teammates...Show moreLast updated: 30+ days ago

Promoted

Senior Software Engineer II

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Senior Software Engineer II, IAM.Key Responsibilities Design and build reliable authentication and authorization systems for millions of users Propose technical soluti...Show moreLast updated: 30+ days ago

Promoted

Software Engineer III

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Software Engineer III.Key Responsibilities Develop business workflows using JBPM in XML and API endpoints using Apache Camel XML DSL Debug and troubleshoot software, c...Show moreLast updated: 30+ days ago

Promoted

Senior ML Engineer

VirtualVocationsHayward, California, United States

Full-time

A company is looking for a Senior / Staff ML Engineer to enhance their AI capabilities for residential construction.Key Responsibilities Shape the vision for AI / ML integration into products and ope...Show moreLast updated: 30+ days ago

Promoted

Senior Software Engineer, Enterprise GenAI

Scale AI, Inc.San Francisco, CA, United States

Full-time

Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...Show moreLast updated: 30+ days ago

Promoted

Senior Software Engineer - Autonomous Vehicles

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Senior Software Development Engineer, Test - Autonomous Vehicles.Key Responsibilities Design, develop, and maintain software infrastructure for large-scale simulation o...Show moreLast updated: 30+ days ago

Promoted

Senior MLOps Engineer

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Senior MLOps Engineer to design and scale infrastructure for AI research and product development. Key Responsibilities Identify and resolve infrastructure and software b...Show moreLast updated: 30+ days ago

Promoted

Lead Software Engineer

VirtualVocationsHayward, California, United States

Full-time

A company is looking for a Lead Software Engineer who will work remotely.Key Responsibilities Design, develop, and implement software systems using server-side frameworks and client-side technolo...Show moreLast updated: 30+ days ago

Promoted

Software Engineer - AI / LLM

SupermicroSan Jose, CA, United States

Full-time

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago

Promoted

Senior Software Engineer – Distributed Systems (Erlang Preferred)

SourceOwls, LLCRedwood City, CA, US

Full-time

Senior Software Engineer – Distributed Systems (Erlang Preferred) Location : Onsite 3–5 days / week Type : Full-Time Visa Sponsorship : Not Available Relocation Assistance : Not Available Benefits Includ...Show moreLast updated: 1 day ago

Promoted

Senior Fullstack Software Engineer

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Senior Fullstack Software Engineer (Automation).Key Responsibilities Collaborate with the software team to design, develop, test, and deploy cloud-based software soluti...Show moreLast updated: 30+ days ago

Promoted
New!

Sr. Software Engineer

VirtualVocationsHayward, California, United States

Full-time

A company is looking for a Sightline Sr.Key Responsibilities Develop, enhance, maintain, and support software within Enterprise Business Solutions environments, focusing on SAP and Workday system...Show moreLast updated: 11 hours ago

Promoted
New!

Staff ML Engineer, Robotics

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Staff ML Engineer, Robotics.Key Responsibilities Develop and deploy ML models for perception / navigation tasks Design and implement sensor fusion and mapping pipelines ...Show moreLast updated: 11 hours ago

Promoted

Software Engineer

VirtualVocationsFremont, California, United States

Full-time

A company is looking for a Software Engineer (L5) specializing in the Open Connect Platform.Key Responsibilities Implement microservices for analytics, insights, discovery, and visualizations De...Show moreLast updated: 30+ days ago