Machine Learning Engineer

Scouto AISan Francisco, CA, United States

19 days ago

Job type

Full-time

Job description

Overview

We are building a distributed LLM inference network that combines idle GPU capacity from around the world into a single cohesive plane of compute for running large-language models like DeepSeek and Llama 4. At any given moment, we have over 5,000 GPUs and hundreds of terabytes of VRAM connected to the network. We are a small, well-funded team working on difficult, high-impact problems at the intersection of AI and distributed systems. We primarily work in-person from our office in downtown San Francisco.

Responsibilities

Design and implement optimization techniques to increase model throughput and reduce latency across our suite of models
Deploy and maintain large language models at scale in production environments
Deploy new models as they are released by frontier labs
Implement techniques like quantization, speculative decoding, and KV cache reuse
Contribute regularly to open source projects such as SGLang and vLLM
Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vLLM, SGLang, CUDA, and other libraries to debug ML performance issues
Collaborate with the engineering team to bring new features and capabilities to our inference platform
Develop robust and scalable infrastructure for AI model serving
Create and maintain technical documentation for inference systems

Requirements

3+ years of experience writing high-performance, production-quality code

Strong proficiency with Python and deep learning frameworks, particularly PyTorch

Demonstrated experience with LLM inference optimization techniques

Hands-on experience with SGLang and vLLM, with contributions to these projects strongly preferred

Familiarity with Docker and Kubernetes for containerized deployments

Experience with CUDA programming and GPU optimization

Strong understanding of distributed systems and scalability challenges

Proven track record of optimizing AI models for production environments

Nice to Have

Familiarity with TensorRT and TensorRT-LLM

Knowledge of vision models and multimodal AI systems

Experience implementing techniques like quantization and speculative decoding

Contributions to open source machine learning projects

Experience with large-scale distributed computing

Compensation

We offer competitive compensation, equity in a high-growth startup, and comprehensive benefits. The base salary range for this role is $180,000 - $250,000, plus competitive equity and benefits including :

Full healthcare coverage

Quarterly offsites

Flexible PTO

Skills : pytorch, gpu optimization, deep learning frameworks, sglang, vllm, cuda programming, machine learning, python, llm

#J-18808-Ljbffr

Create a job alert for this search

Machine Learning Engineer • San Francisco, CA, United States

Related jobs

Promoted

Machine Learning Engineer, Recommendation

NewsBreakMountain View, CA, US

Full-time

Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy.With over 40 million monthly active users, our flagship platform delivers highly personalized loca...Show moreLast updated: 1 day ago

Promoted

Machine Learning Engineer, GenAI Applied ML

Scale AI, Inc.San Francisco, CA, United States

Full-time

At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer

Metric BioFremont, CA, US

Full-time

Metric Bio is recruiting on behalf of a San Francisco–based digital health company that is building an AI-powered platform to transform patient care and healthcare delivery.ML techniques to s...Show moreLast updated: 1 day ago

Promoted

Machine Learning Engineer, GenAI & LLM - AiDP - IS&T

Apple Inc.Sunnyvale, CA, United States

Full-time

Machine Learning Engineer, GenAI & LLM - AiDP - IS&T.Sunnyvale, California, United States Corporate Functions.As a pivotal member of Apple’s enterprise generative AI efforts, you will : - Innovate tr...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer

Robotics Technologies LLCSunnyvale, CA, United States

Permanent

Understands and translates business and functional needs into machine learning problem statements.Translates complex machine learning problem statements into specific deliverables and requirements....Show moreLast updated: 28 days ago

Promoted

Machine Learning Engineer

Jobright.aiSan Francisco, CA, United States

Full-time

Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer - Model Performance

InferenceSan Francisco, CA, United States

Full-time

Machine Learning Engineer to join our team, focusing on optimizing the performance of our cutting-edge AI inference systems. This role involves working with state-of-the-art large language models an...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer, Relevance

PatreonSan Francisco, CA, United States

Full-time

Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show moreLast updated: 1 day ago

Promoted

Machine Learning Engineer

AttainRedwood City, CA, United States

Full-time

Built for consumers and companies, alike.In a world driven by data, we believe consumers and businesses can coexist.Our founders had a vision to empower consumers to leverage their greatest asset—t...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer

Mercor, Inc.San Francisco, CA, United States

Full-time

Mercor is training models that predict how well someone will perform on a job better than a human can.We use our platform to source, vet, and onboard expert contractors who help train AI models in ...Show moreLast updated: 13 days ago

Promoted

Machine Learning Engineer

AdobeSan Jose, CA, United States

Full-time

Join Adobe as a Machine Learning Engineer in San Jose, CA, and be part of an exceptionally dedicated team that is redefining the digital landscape. This role offers an outstanding chance to apply yo...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer

QuantcastSan Francisco, CA, United States

Full-time

At Quantcast, we're redefining what's possible in digital advertising.As a global Demand Side Platform (DSP) powered by AI, we help marketers connect with the right audiences and deliver measurable...Show moreLast updated: 30+ days ago

Promoted

Founding Machine Learning Engineer

PetsAppBerkeley, CA, United States

Full-time

About us : We are Lighten, a seed-stage healthcare AI startup that's determined to solve fundamental challenges in driving patient insights from healthcare data. We are well funded by top-tier instit...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer 2

Intuit Inc.Mountain View, CA, United States

Full-time

In this role, you’ll be embedded inside a vibrant team of data scientists.You’ll be expected to help conceive, code, and deploy data science models at scale using the latest industry tools.Importan...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Engineer

HiveSan Francisco, CA, United States

Full-time

Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show moreLast updated: 30+ days ago

Promoted

AIML - Machine Learning Engineer, Foundation Models

Apple Inc.Cupertino, CA, United States

Full-time

AIML - Machine Learning Engineer, Foundation Models – Cupertino, California, United States.We are a group of engineers and researchers responsible for building foundation models at Apple.We build i...Show moreLast updated: 30+ days ago

Promoted

Founding Machine Learning Engineer

NomadicML Inc.San Francisco, CA, United States

Full-time

Harvard, where they both did research in the intersection of computation and evaluations.Between them, they have authored multiple published papers in the machine learning domain and hold numerous ...Show moreLast updated: 30+ days ago

Promoted

Senior Machine Learning Engineer

HarnhamHayward, CA, US

Full-time

STAFF MACHINE LEARNING ENGINEER.Hybrid – Bay Area (3 Days / Week Onsite).We’re a fast-growing online marketplace backed by a major global tech player. Our platform helps millions of people...Show moreLast updated: 27 days ago

Promoted

Machine Learning Engineer

Latinx in AI (LXAI)Redwood City, CA, United States

Full-time

Prove is the modern platform for continuous identity authentication and is used by over 1,000 enterprises and 500 financial institutions, including 9 of the top 10 U. Prove’s cloud solutions, and mo...Show moreLast updated: 28 days ago

Promoted

Machine Learning Engineer

SylogicSan Jose, CA, United States

Full-time

At Sylogic, we're on a mission to revolutionize infrastructure automation through artificial intelligence.We are a stealth startup headquartered in Silicon Valley and funded by Tier 1 VCs.We’re a p...Show moreLast updated: 30+ days ago