Talent.com
Model Efficiency Engineer
Model Efficiency EngineerVirtualVocations • Santa Clara, California, United States
No longer accepting applications
Model Efficiency Engineer

Model Efficiency Engineer

VirtualVocations • Santa Clara, California, United States
30+ days ago
Job type
  • Full-time
Job description

A company is looking for a Member of Technical Staff, Model Efficiency.

Key Responsibilities

Improve core performance metrics of ML systems by analyzing model execution and identifying bottlenecks

Collaborate with modeling and systems teams to experiment, measure, and implement optimizations that enhance inference efficiency

Develop advanced performance techniques, including GPU / CUDA optimizations and model execution strategies for large-scale architectures

Required Qualifications

5+ years of experience in writing high-performance, production-quality code

Strong programming skills in C++ or Python (Rust / Go also welcome)

Experience with large language models and the LLM inference ecosystem

Ability to diagnose and resolve performance bottlenecks across the model execution stack

A strong bias for action with a focus on shipping quickly, measuring impact, and iterating

Create a job alert for this search

Model • Santa Clara, California, United States

Related jobs
Mixed Signal Model Verification Engineer

Mixed Signal Model Verification Engineer

Signature Consultants • USA, California, San Jose
Full-time
Quick Apply
Mixed Signal Model Verification Engineer.Modeling : Extensive experience with.Verification Flow : Strong understanding of HDL / SPICE co-simulations. Circuit Expertise : Strong background in analog integ...Show more
Last updated: 5 days ago
Staff Engineer, CPU Performance Modeling Engineer

Staff Engineer, CPU Performance Modeling Engineer

Samsung Semiconductor • San Jose, California, USA
Full-time
To provide the best candidate experience amidst our high application volumes each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the Worlds Technolog...Show more
Last updated: 10 days ago • Promoted
Senior ML Accelerator Modeling Engineer

Senior ML Accelerator Modeling Engineer

Waymo • Mountain View, CA, United States
Full-time
Waymo is an autonomous driving technology company with the mission to be the world’s most trusted driver.Since its start as the Google Self‑Driving Car Project in 2009, Waymo has focused on buildin...Show more
Last updated: 10 days ago • Promoted
Staff CPU Performance Modeling Engineer

Staff CPU Performance Modeling Engineer

Samsung Semiconductor, Inc. • San Jose, CA, United States
Full-time
A leading technology firm in San Jose is seeking a highly skilled Staff Engineer specializing in CPU Performance Modeling. The role involves developing performance models, collaborating with archite...Show more
Last updated: 9 days ago • Promoted
GNC Modeling and Simulation Engineer

GNC Modeling and Simulation Engineer

Reliable Robotics • Mountain View, California, USA
Full-time +1
Were building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer more convenient and fundamentally trans...Show more
Last updated: 11 days ago • Promoted
Senior OPC Modeling Engineer (Hybrid – ML / CUDA)

Senior OPC Modeling Engineer (Hybrid – ML / CUDA)

Conductor • San Jose, CA, United States
Full-time
Join a forward-thinking company as a Senior Engineer in the Optical Proximity Correction Lab.This role offers the chance to innovate in chip-making technology while collaborating with talented engi...Show more
Last updated: 1 day ago • Promoted
Machine Learning Engineer, Staff - Model Factory

Machine Learning Engineer, Staff - Model Factory

D-matrix • Santa Clara, California, United States
Full-time
AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...Show more
Last updated: 30+ days ago • Promoted
Applied ML Engineer

Applied ML Engineer

Kognitos • San Jose, California, United States
Full-time
Kognitos is at the forefront of revolutionizing the trillion-dollar hyper-automation market.Our mission is to redefine how software is built and maintained by leveraging cutting-edge multi-agent au...Show more
Last updated: 30+ days ago • Promoted
Senior ML Model Engineer – Ads Optimization

Senior ML Model Engineer – Ads Optimization

Samsung Electronics America • Mountain View, CA, United States
Full-time
An advanced advertising technology company is seeking a Machine Learning Model Engineer to join their fast-growing team.This role offers the opportunity to work with cutting-edge technologies and u...Show more
Last updated: 12 days ago • Promoted
Hardware Modeling Engineer

Hardware Modeling Engineer

Cisco Systems, Inc. • San Jose, CA, United States
Full-time
The application window is expected to close on 12 / 23 / 2025.Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.This role requires the ...Show more
Last updated: 30+ days ago • Promoted
Hardware Reliability Engineer

Hardware Reliability Engineer

OSI Engineering • Cupertino, CA, US
Full-time
Hardware Reliability Engineer Job Summary The Hardware Reliability Engineering team is seeking an initiative-taking individual to join our efforts in ensuring and improving the durability and relia...Show more
Last updated: 25 days ago • Promoted
ML Engineer

ML Engineer

Catalyst Labs • Cupertino, California, USA
Full-time
Is a rapidly growing Tier 1 VC backed startup based in New York with $60 million in funding revolutionizing how outside sales and service teams work. Their AI technology captures and analyzes real-w...Show more
Last updated: 22 days ago • Promoted
Senior Foundation Model ML Engineer – Scalable Inference

Senior Foundation Model ML Engineer – Scalable Inference

Apple Inc. • Santa Clara, CA, United States
Full-time
A leading technology company in California is seeking a Machine Learning Engineer to optimize AI infrastructures and support cutting-edge models. Candidates should have over 5 years of relevant expe...Show more
Last updated: 11 days ago • Promoted
Senior Software Engineer, World Model Systems for AVs Equity

Senior Software Engineer, World Model Systems for AVs Equity

NVIDIA Corporation • Santa Clara, CA, United States
Full-time
A leading technology firm in California seeks a Senior Software Engineer for the World Model Systems Engineering team focused on autonomous vehicles. The role involves developing perception systems,...Show more
Last updated: 3 days ago • Promoted
Software Engineer DL & MLOps

Software Engineer DL & MLOps

ASML • San Jose, California, USA
Full-time
We are seeking a talented and innovative Deep Learning and MLOps engineer to become part of our dynamic this hybrid role you will blend expertise in both algorithms and infrastructure to create ad...Show more
Last updated: 24 days ago • Promoted
Software Engineer L4 / L5, Model Serving Systems, Machine Learning Platform

Software Engineer L4 / L5, Model Serving Systems, Machine Learning Platform

Netflix • Los Gatos, California, United States
Full-time
Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and lan...Show more
Last updated: 30+ days ago • Promoted
Architecture & Performance Modeling Engineer

Architecture & Performance Modeling Engineer

Eridu Corporation • Saratoga, California, United States, 95070
Full-time
Eridu AI is a Silicon Valley-based hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today’s AI performance is frequently limited...Show more
Last updated: 10 days ago
Mixed Signal Model Verification Engineer

Mixed Signal Model Verification Engineer

Tech Providers Inc. • Santa Clara, CA, US
Full-time
Verify behavioral models written in SystemVerilog (logic + RNM) for mixed-signal IP.Write constraints and stimuli for CAD tools to perform schematic vs. Debug mismatches between behavioral models an...Show more
Last updated: less than 1 hour ago • Promoted • New!