Talent.com
Associate Tech Specialist

Associate Tech Specialist

Yantran LLCSan Jose, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Location : San Jose, CA

Job Description : AI / ML Model Deployment Specialist / Production MLOps Engineer

Overview

We are seeking a highly skilled and specialized AI / ML Model Deployment Specialist / Production MLOps Engineer. The core focus of this role is to take existing machine learning models and optimize their deployment and serving infrastructure for high-performance, production-ready inference, with a strong emphasis on leveraging state-of-the-art AI model serving technologies.

Responsibilities

Model-Specific Optimization :

Analyze and understand the underlying logic and dependencies of various AI / ML models (primarily using PyTorch and TensorFlow) to identify bottlenecks in the inference pipeline.

High-Performance Serving Implementation :

Design, implement, and manage high-performance inference serving solutions utilizing specialized inference servers (e.g., vLLM) to achieve low latency and high throughput.

GPU Utilization Optimization :

Optimize model serving configurations specifically for GPU hardware to maximize resource efficiency and performance metrics in a production environment.

Containerization for Deployment :

Create minimal, secure, and production-ready Docker images for streamlined deployment of optimized models and inference servers across various environments.

Collaboration :

Work closely with core engineering and data science teams to ensure a smooth transition from model development to high-scale production deployment.

Required Skillsets

AI / ML Domain Expertise

Deep understanding of the AI / ML domain, with the core effort centered around model performance and serving, rather than general infrastructure.

ML Frameworks

Expertise in PyTorch and TensorFlow :

Proven ability to work with and troubleshoot model-specific dependencies, logic, and graph structures within these major frameworks.

Inference Optimization

Production Inference Experience :

Expertise in designing and implementing high-throughput, low-latency model serving solutions.

Specialized Inference Servers :

Mandatory experience with high-performance inference servers, specifically including

vLLM , or similar dedicated LLM serving frameworks.

GPU Optimization :

Demonstrated ability to optimize model serving parameters and infrastructure to maximize performance on NVIDIA or equivalent GPU hardware.

Deployment and Infrastructure

Containerization (Docker) :

Proficiency in creating minimal, secure, and efficient Docker images for model and server deployment.

Infrastructure Knowledge (Helpful, but Secondary) :

General knowledge of cloud platforms (AWS, GCP, Azure) and Kubernetes / orchestration is beneficial but the primary focus remains on model serving and optimization.

Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

8+ years of experience in MLOps, Machine Learning Engineering, or a specialized Inference Optimization role.

A portfolio or project experience demonstrating successful deployment of high-performance, containerized ML models to production scale.

Create a job alert for this search

Associate Specialist • San Jose, CA, United States

Related jobs
  • Promoted
Associate Development Specialist

Associate Development Specialist

Cooley LLPPalo Alto, CA, United States
Full-time
Associate Development Specialist.Cooley is seeking an Associate Development Specialist to join the Associate Development team in support of the associate performance management process.The Associat...Show moreLast updated: 30+ days ago
  • Promoted
PE BD Technology Associate - San Francisco, CA or Austin, TX

PE BD Technology Associate - San Francisco, CA or Austin, TX

Wilcox BerrySan Francisco, CA, United States
Full-time
Private Equity firm specializing in B2B and SAAS technology investments is looking for a.The ideal candidate will have had some business development in the VC or PE space and some technology exposu...Show moreLast updated: 30+ days ago
  • Promoted
Growth & Strategy Associate

Growth & Strategy Associate

Resolve.AiSan Francisco, CA, United States
Full-time
At Resolve, we’re building Agentic AI that empowers software engineers by automating production engineering and SRE workflows. Our models deeply understand production systems — from code to database...Show moreLast updated: 13 days ago
  • Promoted
  • New!
Technical Support Associate

Technical Support Associate

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 22 hours ago
  • Promoted
Associate Product Manager - Key Components

Associate Product Manager - Key Components

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
Technology Transactions Associate

Technology Transactions Associate

Vanguard-IPSan Francisco, CA, United States
Full-time
AmLaw 50 Firm with Cravath level compensation.Outstanding formal training and lateral integration programs.Firm is active with representing both complainants and respondents at the ITC.Past winner ...Show moreLast updated: 30+ days ago
  • Promoted
Technical Specialist

Technical Specialist

AppleCupertino, CA, United States
Full-time
Apple Retail is where the best of Apple comes together.We bring our expertise to help people do what they love, delivering an only-at-Apple experience. At Apple, we believe inclusion is a shared res...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Strategy & Analytics, Product Senior Associate

Strategy & Analytics, Product Senior Associate

Faire IncSan Francisco, CA, United States
Full-time
Faire is an online wholesale marketplace built on the belief that the future is local - independent retailers around the globe are doing more revenue than Walmart and Amazon combined, but individua...Show moreLast updated: 12 hours ago
  • Promoted
  • New!
Business Technology Solutions Associate Consultant

Business Technology Solutions Associate Consultant

ZSSouth San Francisco, CA, United States
Full-time
As a management consulting and technology firm focused on improving life and how we live it, we transform ideas into impact by. Here you'll work side-by-side with a powerful collective of thinkers a...Show moreLast updated: 12 hours ago
  • Promoted
Associate / Senior Associate, Investment (Tech & Consumer, US) (To be based in San Francisco)

Associate / Senior Associate, Investment (Tech & Consumer, US) (To be based in San Francisco)

Temasek HoldingsSan Francisco, CA, United States
Permanent
Press Tab to Move to Skip to Content Link.Select how often (in days) to receive an alert : .Associate / Senior Associate, Investment (Tech & Consumer, US) (To be based in San Francisco).Temasek is a gr...Show moreLast updated: 30+ days ago
  • Promoted
New Business Strategy Associate

New Business Strategy Associate

AirOpsSan Francisco, CA, United States
Full-time
Today thousands of leading brands and agencies use.We’re building the platform and profession that will empower a million marketers to become modern leaders — not spectators — as AI reshapes how br...Show moreLast updated: 11 days ago
  • Promoted
(Senior) Strategy Associate

(Senior) Strategy Associate

Crypto.comSan Francisco, CA, US
Full-time
Work on high-impact and diverse projects.You will have the opportunity to advise management on emerging industry trends and key business decisions, optimize key processes, and orchestrate key initi...Show moreLast updated: 25 days ago
  • Promoted
  • New!
Strategic Project Associate

Strategic Project Associate

Mercor IncSan Francisco, CA, United States
Full-time
Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...Show moreLast updated: 11 hours ago
  • Promoted
Associate Platform Specialist, Apple Ads

Associate Platform Specialist, Apple Ads

Apple Inc.San Francisco, CA, United States
Full-time
Associate Platform Specialist, Apple Ads.At Apple, we believe in the power of technology to enrich people's lives! Everything we build is designed—first and foremost—with helping people in mind.We ...Show moreLast updated: 30+ days ago
  • Promoted
Associate Product Manager

Associate Product Manager

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 3 days ago
  • Promoted
Associate - PhD

Associate - PhD

Cornerstone ResearchMenlo Park, CA, United States
Full-time
US-CA-San Francisco and Silicon Valley | US-MA-Boston | BE-Brussels | US-IL-Chicago | UK-London | US-CA-Los Angeles | US-NY-New York | US-DC-Washington. Cornerstone Research provides economic and fi...Show moreLast updated: 30+ days ago
  • Promoted
Associate Specialist

Associate Specialist

University of California - San FranciscoSan Francisco, CA, United States
Full-time
Wednesday, Oct 29, 2025 at 11 : 59pm (Pacific Time).Applications received after this date will be reviewed by the search committee if the position has not yet been filled. Wednesday, Apr 14, 2027 at 1...Show moreLast updated: 30+ days ago
  • Promoted
Technology Collaboration Services Specialist

Technology Collaboration Services Specialist

Cooley LLPPalo Alto, CA, United States
Full-time
Technology Collaboration Services Specialist.Cooley is seeking a Technology Collaboration Services Specialist to join the Platforms & Applications team. Cooley Technology embraces a culture of custo...Show moreLast updated: 30+ days ago