Associate Tech Specialist

Yantran LLCSan Jose, CA, United States

Hace más de 30 días

Tipo de contrato

A tiempo completo

Descripción del trabajo

Location : San Jose, CA

Job Description : AI / ML Model Deployment Specialist / Production MLOps Engineer

Overview

We are seeking a highly skilled and specialized AI / ML Model Deployment Specialist / Production MLOps Engineer. The core focus of this role is to take existing machine learning models and optimize their deployment and serving infrastructure for high-performance, production-ready inference, with a strong emphasis on leveraging state-of-the-art AI model serving technologies.

Responsibilities

Model-Specific Optimization :

Analyze and understand the underlying logic and dependencies of various AI / ML models (primarily using PyTorch and TensorFlow) to identify bottlenecks in the inference pipeline.

High-Performance Serving Implementation :

Design, implement, and manage high-performance inference serving solutions utilizing specialized inference servers (e.g., vLLM) to achieve low latency and high throughput.

GPU Utilization Optimization :

Optimize model serving configurations specifically for GPU hardware to maximize resource efficiency and performance metrics in a production environment.

Containerization for Deployment :

Create minimal, secure, and production-ready Docker images for streamlined deployment of optimized models and inference servers across various environments.

Collaboration :

Work closely with core engineering and data science teams to ensure a smooth transition from model development to high-scale production deployment.

Required Skillsets

AI / ML Domain Expertise

Deep understanding of the AI / ML domain, with the core effort centered around model performance and serving, rather than general infrastructure.

ML Frameworks

Expertise in PyTorch and TensorFlow :

Proven ability to work with and troubleshoot model-specific dependencies, logic, and graph structures within these major frameworks.

Inference Optimization

Production Inference Experience :

Expertise in designing and implementing high-throughput, low-latency model serving solutions.

Specialized Inference Servers :

Mandatory experience with high-performance inference servers, specifically including

vLLM , or similar dedicated LLM serving frameworks.

GPU Optimization :

Demonstrated ability to optimize model serving parameters and infrastructure to maximize performance on NVIDIA or equivalent GPU hardware.

Deployment and Infrastructure

Containerization (Docker) :

Proficiency in creating minimal, secure, and efficient Docker images for model and server deployment.

Infrastructure Knowledge (Helpful, but Secondary) :

General knowledge of cloud platforms (AWS, GCP, Azure) and Kubernetes / orchestration is beneficial but the primary focus remains on model serving and optimization.

Qualifications

Bachelor's or Master's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

8+ years of experience in MLOps, Machine Learning Engineering, or a specialized Inference Optimization role.

A portfolio or project experience demonstrating successful deployment of high-performance, containerized ML models to production scale.

Crear una alerta de empleo para esta búsqueda

Associate Specialist • San Jose, CA, United States

Ofertas relacionadas

Oferta promocionada

Associate Development Specialist

Cooley LLPPalo Alto, CA, United States

A tiempo completo

Associate Development Specialist.Cooley is seeking an Associate Development Specialist to join the Associate Development team in support of the associate performance management process.The Associat...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Origination & Advisory Technology - Associate

Deutsche BankSan Francisco, CA, US

A tiempo completo

Origination & Advisory Technology.Corporate Title : Analyst - Associate.Our Origination & Advisory business provides the full range of investment banking products and services for large-cap and mid...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

PE BD Technology Associate - San Francisco, CA or Austin, TX

Wilcox BerrySan Francisco, CA, United States

A tiempo completo

Private Equity firm specializing in B2B and SAAS technology investments is looking for a.The ideal candidate will have had some business development in the VC or PE space and some technology exposu...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Growth & Strategy Associate

Resolve.AiSan Francisco, CA, United States

A tiempo completo

At Resolve, we’re building Agentic AI that empowers software engineers by automating production engineering and SRE workflows. Our models deeply understand production systems — from code to database...Mostrar másÚltima actualización: hace 13 días

Oferta promocionada
Nueva oferta

Technical Support Associate

SupermicroSan Jose, CA, United States

A tiempo completo

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Mostrar másÚltima actualización: hace 18 horas

Oferta promocionada

Technology Transactions Associate

Vanguard-IPSan Francisco, CA, United States

A tiempo completo

AmLaw 50 Firm with Cravath level compensation.Outstanding formal training and lateral integration programs.Firm is active with representing both complainants and respondents at the ITC.Past winner ...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Technology Collaboration Services Specialist

CooleyPalo Alto, CA, United States

A tiempo completo

Technology Collaboration Services Specialist.Cooley is seeking a Technology Collaboration Services Specialist to join the Platforms & Applications team. Cooley Technology embraces a culture of custo...Mostrar másÚltima actualización: hace 8 días

Oferta promocionada

Technical Specialist

ApplePleasanton, CA, United States

A tiempo completo

Apple Retail is where the best of Apple comes together.We bring our expertise to help people do what they love, delivering an only-at-Apple experience. At Apple, we believe inclusion is a shared res...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada
Nueva oferta

Business Technology Solutions Associate Consultant

ZSSouth San Francisco, CA, United States

A tiempo completo

As a management consulting and technology firm focused on improving life and how we live it, we transform ideas into impact by. Here you'll work side-by-side with a powerful collective of thinkers a...Mostrar másÚltima actualización: hace 8 horas

Oferta promocionada

Associate / Senior Associate, Investment (Tech & Consumer, US) (To be based in San Francisco)

Temasek HoldingsSan Francisco, CA, United States

Indefinido

Press Tab to Move to Skip to Content Link.Select how often (in days) to receive an alert : .Associate / Senior Associate, Investment (Tech & Consumer, US) (To be based in San Francisco).Temasek is a gr...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

New Business Strategy Associate

AirOpsSan Francisco, CA, United States

A tiempo completo

Today thousands of leading brands and agencies use.We’re building the platform and profession that will empower a million marketers to become modern leaders — not spectators — as AI reshapes how br...Mostrar másÚltima actualización: hace 11 días

Oferta promocionada

(Senior) Strategy Associate

Crypto.comSan Francisco, CA, US

A tiempo completo

Work on high-impact and diverse projects.You will have the opportunity to advise management on emerging industry trends and key business decisions, optimize key processes, and orchestrate key initi...Mostrar másÚltima actualización: hace 25 días

Oferta promocionada
Nueva oferta

Strategic Project Associate

Mercor IncSan Francisco, CA, United States

A tiempo completo

Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...Mostrar másÚltima actualización: hace 7 horas

Oferta promocionada

Growth Strategy Associate

ClipboardSan Francisco, CA, United States

A tiempo completo

We exist to lift as many people up the socioeconomic ladder as possible.We dramatically improve lives by letting professionals turn extra time and ambition into career growth and financial opportun...Mostrar másÚltima actualización: hace 13 días

Oferta promocionada

Associate Platform Specialist, Apple Ads

Apple Inc.San Francisco, CA, United States

A tiempo completo

Associate Platform Specialist, Apple Ads.At Apple, we believe in the power of technology to enrich people's lives! Everything we build is designed—first and foremost—with helping people in mind.We ...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Associate Specialist

University of California - San FranciscoSan Francisco, CA, United States

A tiempo completo

Wednesday, Oct 29, 2025 at 11 : 59pm (Pacific Time).Applications received after this date will be reviewed by the search committee if the position has not yet been filled. Wednesday, Apr 14, 2027 at 1...Mostrar másÚltima actualización: hace más de 30 días

Oferta promocionada

Technology Collaboration Services Specialist

Cooley LLPPalo Alto, CA, United States

A tiempo completo

Oferta promocionada

Associate– Strategic Advisory Services

Guggenheim SecuritiesSan Francisco, CA, United States

A tiempo completo

Associate– Strategic Advisory Services.We are seeking a highly motivated and intellectually curious Associate to support the development and delivery of our Strategic Advisory Services.This role is...Mostrar másÚltima actualización: hace 23 días