Talent.com

Performance engineer Jobs in Sunnyvale, CA

Create a job alert for this search

Performance engineer • sunnyvale ca

Last updated: 16 hours ago
  • Promoted
Inference Performance Engineer

Inference Performance Engineer

Acceler8 TalentSanta Clara, CA, United States
Full-time
Join Our Inference Performance Team : Optimizing Foundation Models On-Device.We’re building the future of on-device AI by making foundation models smarter, faster, and more efficient.Pinpoint perfor...Show moreLast updated: 23 days ago
  • Promoted
PRINCIPAL PERFORMANCE ENGINEER, SYSTEMS

PRINCIPAL PERFORMANCE ENGINEER, SYSTEMS

Inflection.xyzSanta Clara, CA, United States
Full-time
WE ARE BUILDING THE WORLDS FIRST CRYPTOGRAPHIC COMPUTER.Fabric believes hardware determines the boundaries of humanitys collective creativity and imagination. We are building hardware for the next g...Show moreLast updated: 28 days ago
  • Promoted
GPGPU Performance tooling engineer

GPGPU Performance tooling engineer

RivosSanta Clara, CA, US
Full-time
We are working on software to improve the Deep Learning ecosystem and help hardware engineers build great Deep Learning parallel systems. We are looking for a strong candidate with a background in w...Show moreLast updated: 21 days ago
  • Promoted
SOFTWARE ENGINEER, RO PERFORMANCE

SOFTWARE ENGINEER, RO PERFORMANCE

WaymoMountain View, CA, United States
Full-time
Waymo is an autonomous driving technology company with the mission to be the most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Wa...Show moreLast updated: 23 days ago
Server System Performance Engineer

Server System Performance Engineer

TikTokSan Jose, CA, United States
Full-time
Server platform team is responsible for architecting, designing, and building the best server and storage system to meet the requirements of high-performance, low cost and easy to operate.By joinin...Show moreLast updated: 30+ days ago
#18425 - Senior Performance Engineer

#18425 - Senior Performance Engineer

QualiTest GroupSanta Clara, CA, United States
Full-time
Are you interested in working with the World's leading AI-powered Quality Engineering Company? Ready to advance your career, team up with global thought leaders across industries and make a differe...Show moreLast updated: 25 days ago
Software Engineer, Performance

Software Engineer, Performance

NuroMountain View, CA, United States
Full-time
Nuro exists to better everyday life through robotics.Founded in 2016, Nuro has spent eight years developing autonomous driving (AD) technology and commercializing AD applications.The Nuro Driver™ i...Show moreLast updated: 30+ days ago
Senior Performance Engineer

Senior Performance Engineer

BroadcomPalo Alto, CA, United States
Full-time
If you are a first time user, please create your candidate login account before you apply for a job.If you already have a Candidate Account, please Sign-In before you apply.Broadcom's VMware Cloud ...Show moreLast updated: 21 days ago
  • Promoted
Performance Modeling Engineer

Performance Modeling Engineer

EtchedCupertino, CA, US
Full-time
Etched was founded on the belief that new hardware will be needed to run human- and superhuman-level artificial intelligence. Our ASIC systems deliver orders of magnitude higher performance than con...Show moreLast updated: 30+ days ago
Performance Modeling Engineer

Performance Modeling Engineer

Advanced Micro Devices, IncSanta Clara, CA, United States
Full-time
WHAT YOU DO AT AMD CHANGES EVERYTHING.We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that ...Show moreLast updated: 20 days ago
CPU Performance Engineer - Sr Engineer

CPU Performance Engineer - Sr Engineer

QualcommSanta Clara, CA, United States
Full-time
Engineering Group, Engineering Group > .As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation t...Show moreLast updated: 21 days ago
  • Promoted
  • New!
SOFTWARE ENGINEER, ML PERFORMANCE

SOFTWARE ENGINEER, ML PERFORMANCE

OpenReqCupertino, CA, United States
Full-time
Etched is building AI chips that are hard-coded for individual model architectures.Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower laten...Show moreLast updated: 16 hours ago
Sr Product Performance Engineer

Sr Product Performance Engineer

Intuitive Surgical, IncSunnyvale, CA, United States
Full-time
At Intuitive, we are united behind our mission : we believe that minimally invasive care is life-enhancing care.Through ingenuity and intelligent technology, we expand the potential of physicians to...Show moreLast updated: 3 days ago
  • Promoted
RESEARCH ENGINEER - PERFORMANCE OPTIMIZATION

RESEARCH ENGINEER - PERFORMANCE OPTIMIZATION

The Rundown AI, Inc.Palo Alto, CA, United States
Full-time
We are looking for engineers with significant problem solving experience in PyTorch, CUDA and distributed systems.You will work with Research Scientists to build & train cutting edge foundation mod...Show moreLast updated: 7 days ago
GPU Performance Engineer

GPU Performance Engineer

SAMSUNG3655 N 1st St, San Jose, CA, USA
Full-time
Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and...Show moreLast updated: 30+ days ago
HPC Performance Engineer

HPC Performance Engineer

1000 KLA CorporationMilpitas, CA
Full-time
Responsibilities for this exciting role will include : .Design, implementation & support of high-performance compute clusters. Solid knowledge on HPC systems, including CPU / GPU architecture, scalable / ...Show moreLast updated: 30+ days ago
Performance Engineer

Performance Engineer

Cerebras SystemsSan Jose, CA, United States
Full-time
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm...Show moreLast updated: 13 days ago
People also ask
Inference Performance Engineer

Inference Performance Engineer

Acceler8 TalentSanta Clara, CA, United States
23 days ago
Job type
  • Full-time
Job description

Join Our Inference Performance Team : Optimizing Foundation Models On-Device

We’re building the future of on-device AI by making foundation models smarter, faster, and more efficient. As part of the Inference Performance Team , you’ll work on challenging, high-impact projects to push the limits of what’s possible with foundation model inference.

What You’ll Do

  • Pinpoint performance bottlenecks and navigate quality-performance trade-offs in reference implementations (e.g., openai / whisper) and our optimized frameworks.
  • Design, prototype, and test performance improvements tailored to meet enterprise customer needs.
  • Drive innovation in our open-source inference frameworks by pitching and delivering new ideas.
  • Help expand support to new platforms—currently focused on Apple but actively growing into Android, Linux, and soon Windows.
  • Collaborate with ML Research Engineers to turn theoretical advances into practical, real-world optimizations.

Core Qualifications :

  • 3+ years of industry experience working on technically challenging problems.
  • Proficiency in Python or C / C++.
  • Experience with CUDA, OpenCL, or Metal.
  • A strong understanding of hardware acceleration (GPUs, NPUs, TPUs, CPUs).
  • Familiarity with modern ML frameworks like TensorFlow, PyTorch, Core ML, or ONNX.
  • Expertise in GPU kernel programming.
  • Contributions to major ML frameworks or open-source projects.
  • Why This Role?

    You’ll play a critical role in advancing the performance of foundation models across platforms like Apple, Android, and Linux—shaping the future of efficient, scalable on-device AI.