Talent.com
Associate Tech Specialist
Associate Tech SpecialistYantran LLC • San Jose, CA, United States
Associate Tech Specialist

Associate Tech Specialist

Yantran LLC • San Jose, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Location : San Jose, CA

Job Description : AI / ML Model Deployment Specialist / Production MLOps Engineer

Overview

We are seeking a highly skilled and specialized AI / ML Model Deployment Specialist / Production MLOps Engineer. The core focus of this role is to take existing machine learning models and optimize their deployment and serving infrastructure for high-performance, production-ready inference, with a strong emphasis on leveraging state-of-the-art AI model serving technologies.

Responsibilities

Model-Specific Optimization :

Analyze and understand the underlying logic and dependencies of various AI / ML models (primarily using PyTorch and TensorFlow) to identify bottlenecks in the inference pipeline.

High-Performance Serving Implementation :

Design, implement, and manage high-performance inference serving solutions utilizing specialized inference servers (e.g., vLLM) to achieve low latency and high throughput.

GPU Utilization Optimization :

Optimize model serving configurations specifically for GPU hardware to maximize resource efficiency and performance metrics in a production environment.

Containerization for Deployment :

Create minimal, secure, and production-ready Docker images for streamlined deployment of optimized models and inference servers across various environments.

Collaboration :

Work closely with core engineering and data science teams to ensure a smooth transition from model development to high-scale production deployment.

Required Skillsets

AI / ML Domain Expertise

Deep understanding of the AI / ML domain, with the core effort centered around model performance and serving, rather than general infrastructure.

ML Frameworks

Expertise in PyTorch and TensorFlow :

Proven ability to work with and troubleshoot model-specific dependencies, logic, and graph structures within these major frameworks.

Inference Optimization

Production Inference Experience :

Expertise in designing and implementing high-throughput, low-latency model serving solutions.

Specialized Inference Servers :

Mandatory experience with high-performance inference servers, specifically including

vLLM , or similar dedicated LLM serving frameworks.

GPU Optimization :

Demonstrated ability to optimize model serving parameters and infrastructure to maximize performance on NVIDIA or equivalent GPU hardware.

Deployment and Infrastructure

Containerization (Docker) :

Proficiency in creating minimal, secure, and efficient Docker images for model and server deployment.

Infrastructure Knowledge (Helpful, but Secondary) :

General knowledge of cloud platforms (AWS, GCP, Azure) and Kubernetes / orchestration is beneficial but the primary focus remains on model serving and optimization.

Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

8+ years of experience in MLOps, Machine Learning Engineering, or a specialized Inference Optimization role.

A portfolio or project experience demonstrating successful deployment of high-performance, containerized ML models to production scale.

Create a job alert for this search

Associate Specialist • San Jose, CA, United States

Related jobs
Associate Development Specialist

Associate Development Specialist

Cooley LLP • Palo Alto, CA, United States
Full-time
Associate Development Specialist.Cooley is seeking an Associate Development Specialist to join the Associate Development team in support of the associate performance management process.The Associat...Show more
Last updated: 30+ days ago • Promoted
Associate, F&S, Products & Technology

Associate, F&S, Products & Technology

Netflix • Los Gatos, CA, US
Full-time
Associate, F&S, Products & Technology.Los Gatos, California, United States of America Los Angeles, California, United States of America. Netflix is one of the world's leading entertainment services...Show more
Last updated: 30+ days ago • Promoted
Technical Support Associate

Technical Support Associate

Super Micro Computer • San Jose, CA, United States
Full-time
Supermicro® is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customer...Show more
Last updated: 12 days ago • Promoted
Associate Customer Success Specialist

Associate Customer Success Specialist

Lucid Software • San Jose, CA, US
Full-time
Associate Customer Success Specialist.Lucid Software is the leader in visual collaboration and work acceleration, helping teams see and build the future by turning ideas into reality.Our products i...Show more
Last updated: 5 days ago • Promoted
Technical Support Associate

Technical Support Associate

Supermicro • San Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
Last updated: 16 days ago • Promoted
Associate Team Leader

Associate Team Leader

San Jose Staffing • San Jose, CA, US
Full-time +1
Our Company At H&R Block, we believe in the power of people helping people.Our defining purpose is to provide help and inspire confidence in our clients, associates, and communities everywhere.We a...Show more
Last updated: 21 days ago • Promoted
Business Insurance Analytics Associate

Business Insurance Analytics Associate

Marsh LLC • San Ramon, CA, United States
Full-time
Award-winning, inclusive, Top Workplace culture doesn’t happen overnight.It’s a result of hard work by extraordinary people. The industry’s brightest talent drive our efforts to deliver purposeful w...Show more
Last updated: 16 days ago • Promoted
STEP Associate

STEP Associate

Sonepar USA • San Leandro, CA, United States
Full-time
There's a Place for You at OneSource Distributors.A career at OneSource is more than a job.You're investing in a brighter, more sustainable future together and joining a team that makes a real diff...Show more
Last updated: 30+ days ago • Promoted
Corporate Deals Tech Senior Associate

Corporate Deals Tech Senior Associate

Veterans Staffing • Palo Alto, CA, US
Full-time
At PwC, our people in deals focus on providing strategic advice and support to clients in areas such as mergers and acquisitions, divestitures, and restructuring. They help clients navigate complex ...Show more
Last updated: 2 days ago • Promoted
Venture Associate (Remote)

Venture Associate (Remote)

VC Lab • Palo Alto, CA, US
Remote
Full-time
Decile Group is transforming venture capital into a force for good.Through VC Lab the world's leading venture capital accelerator and DecileHub our fund operations platform we are working to la...Show more
Last updated: 24 days ago • Promoted
Associate Development Specialist

Associate Development Specialist

Cooley • Palo Alto, CA, US
Full-time
Associate Development Specialist.Cooley is seeking an Associate Development Specialist to join the Associate Development team in support of the associate performance management process.The Associat...Show more
Last updated: 30+ days ago • Promoted
Project Manager, Associate

Project Manager, Associate

Abacus Service Corporation • San Ramon, CA, US
Full-time
Education Degree Type Major / Certification Required Preferred Bachelor BA / BS.Job Description & Requirements.Show more
Last updated: 30+ days ago • Promoted
Senior Configuration Specialist

Senior Configuration Specialist

Joby Aviation • Santa Cruz, CA, United States
Full-time
Imagine a piloted air taxi that takes off vertically, then quietly carries you and your fellow passengers over the congested city streets below, enabling you to spend more time with the people and ...Show more
Last updated: 12 days ago • Promoted
Associate Tech Specialist

Associate Tech Specialist

Keylent Inc • San Jose, CA, United States
Full-time
Associate Tech Specialist TECHM-JOB-26346.Devops candidate having Kubernetes , Python or Golang Experience.Should have a deep understanding on Kubernetes, Containers, Load Balancing concepts.Should...Show more
Last updated: 2 days ago • Promoted
Travel CT Technologist - $3,175 per week

Travel CT Technologist - $3,175 per week

First Connect Health • Santa Cruz, CA, United States
Full-time
First Connect Health is seeking a travel CT Technologist for a travel job in Santa Cruz, California.Job Description & Requirements. Certificates required : BLS, ARRT-CT, ARRT-R, CA license, Fluoro.Jo...Show more
Last updated: 15 days ago • Promoted
QA Associate

QA Associate

Asahi Kasei • Fremont, CA, United States
Full-time
The Asahi Kasei Group operates with a commitment of creating for tomorrow.Our business sectors, Material, Homes, and Health Care, contribute to the development of society by anticipating the changi...Show more
Last updated: 16 days ago • Promoted
Associate Team Leader

Associate Team Leader

H&R Block • Mountain View, CA, US
Full-time +1
At H&R Block, we believe in the power of people helping people.Our defining Purpose is to provide help and inspire confidence in our clients, associates, and communities everywhere.We also believe ...Show more
Last updated: 24 days ago • Promoted
Strategy Associate

Strategy Associate

Oklo • Santa Clara, CA, US
Full-time
Thanks for your interest in Oklo! We are searching for a Strategy Associate to join our team.We're looking for a highly motivated Strategy Associate to support and accelerate the company's commerci...Show more
Last updated: 5 days ago • Promoted