Talent.com
Machine Learning Engineer

Machine Learning Engineer

Scouto AISan Francisco, CA, United States
19 days ago
Job type
  • Full-time
Job description

Overview

We are building a distributed LLM inference network that combines idle GPU capacity from around the world into a single cohesive plane of compute for running large-language models like DeepSeek and Llama 4. At any given moment, we have over 5,000 GPUs and hundreds of terabytes of VRAM connected to the network. We are a small, well-funded team working on difficult, high-impact problems at the intersection of AI and distributed systems. We primarily work in-person from our office in downtown San Francisco.

Responsibilities

  • Design and implement optimization techniques to increase model throughput and reduce latency across our suite of models
  • Deploy and maintain large language models at scale in production environments
  • Deploy new models as they are released by frontier labs
  • Implement techniques like quantization, speculative decoding, and KV cache reuse
  • Contribute regularly to open source projects such as SGLang and vLLM
  • Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vLLM, SGLang, CUDA, and other libraries to debug ML performance issues
  • Collaborate with the engineering team to bring new features and capabilities to our inference platform
  • Develop robust and scalable infrastructure for AI model serving
  • Create and maintain technical documentation for inference systems

Requirements

  • 3+ years of experience writing high-performance, production-quality code
  • Strong proficiency with Python and deep learning frameworks, particularly PyTorch
  • Demonstrated experience with LLM inference optimization techniques
  • Hands-on experience with SGLang and vLLM, with contributions to these projects strongly preferred
  • Familiarity with Docker and Kubernetes for containerized deployments
  • Experience with CUDA programming and GPU optimization
  • Strong understanding of distributed systems and scalability challenges
  • Proven track record of optimizing AI models for production environments
  • Nice to Have

  • Familiarity with TensorRT and TensorRT-LLM
  • Knowledge of vision models and multimodal AI systems
  • Experience implementing techniques like quantization and speculative decoding
  • Contributions to open source machine learning projects
  • Experience with large-scale distributed computing
  • Compensation

    We offer competitive compensation, equity in a high-growth startup, and comprehensive benefits. The base salary range for this role is $180,000 - $250,000, plus competitive equity and benefits including :

  • Full healthcare coverage
  • Quarterly offsites
  • Flexible PTO
  • Skills : pytorch, gpu optimization, deep learning frameworks, sglang, vllm, cuda programming, machine learning, python, llm

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    Machine Learning Engineer, Recommendation

    Machine Learning Engineer, Recommendation

    NewsBreakMountain View, CA, US
    Full-time
    Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy.With over 40 million monthly active users, our flagship platform delivers highly personalized loca...Show moreLast updated: 1 day ago
    • Promoted
    Machine Learning Engineer, GenAI Applied ML

    Machine Learning Engineer, GenAI Applied ML

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Metric BioFremont, CA, US
    Full-time
    Metric Bio is recruiting on behalf of a San Francisco–based digital health company that is building an AI-powered platform to transform patient care and healthcare delivery.ML techniques to s...Show moreLast updated: 1 day ago
    • Promoted
    Machine Learning Engineer, GenAI & LLM - AiDP - IS&T

    Machine Learning Engineer, GenAI & LLM - AiDP - IS&T

    Apple Inc.Sunnyvale, CA, United States
    Full-time
    Machine Learning Engineer, GenAI & LLM - AiDP - IS&T.Sunnyvale, California, United States Corporate Functions.As a pivotal member of Apple’s enterprise generative AI efforts, you will : - Innovate tr...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Robotics Technologies LLCSunnyvale, CA, United States
    Permanent
    Understands and translates business and functional needs into machine learning problem statements.Translates complex machine learning problem statements into specific deliverables and requirements....Show moreLast updated: 28 days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Jobright.aiSan Francisco, CA, United States
    Full-time
    Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer - Model Performance

    Machine Learning Engineer - Model Performance

    InferenceSan Francisco, CA, United States
    Full-time
    Machine Learning Engineer to join our team, focusing on optimizing the performance of our cutting-edge AI inference systems. This role involves working with state-of-the-art large language models an...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, Relevance

    Machine Learning Engineer, Relevance

    PatreonSan Francisco, CA, United States
    Full-time
    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show moreLast updated: 1 day ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    AttainRedwood City, CA, United States
    Full-time
    Built for consumers and companies, alike.In a world driven by data, we believe consumers and businesses can coexist.Our founders had a vision to empower consumers to leverage their greatest asset—t...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Mercor, Inc.San Francisco, CA, United States
    Full-time
    Mercor is training models that predict how well someone will perform on a job better than a human can.We use our platform to source, vet, and onboard expert contractors who help train AI models in ...Show moreLast updated: 13 days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    AdobeSan Jose, CA, United States
    Full-time
    Join Adobe as a Machine Learning Engineer in San Jose, CA, and be part of an exceptionally dedicated team that is redefining the digital landscape. This role offers an outstanding chance to apply yo...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    QuantcastSan Francisco, CA, United States
    Full-time
    At Quantcast, we're redefining what's possible in digital advertising.As a global Demand Side Platform (DSP) powered by AI, we help marketers connect with the right audiences and deliver measurable...Show moreLast updated: 30+ days ago
    • Promoted
    Founding Machine Learning Engineer

    Founding Machine Learning Engineer

    PetsAppBerkeley, CA, United States
    Full-time
    About us : We are Lighten, a seed-stage healthcare AI startup that's determined to solve fundamental challenges in driving patient insights from healthcare data. We are well funded by top-tier instit...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer 2

    Machine Learning Engineer 2

    Intuit Inc.Mountain View, CA, United States
    Full-time
    In this role, you’ll be embedded inside a vibrant team of data scientists.You’ll be expected to help conceive, code, and deploy data science models at scale using the latest industry tools.Importan...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    HiveSan Francisco, CA, United States
    Full-time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show moreLast updated: 30+ days ago
    • Promoted
    AIML - Machine Learning Engineer, Foundation Models

    AIML - Machine Learning Engineer, Foundation Models

    Apple Inc.Cupertino, CA, United States
    Full-time
    AIML - Machine Learning Engineer, Foundation Models – Cupertino, California, United States.We are a group of engineers and researchers responsible for building foundation models at Apple.We build i...Show moreLast updated: 30+ days ago
    • Promoted
    Founding Machine Learning Engineer

    Founding Machine Learning Engineer

    NomadicML Inc.San Francisco, CA, United States
    Full-time
    Harvard, where they both did research in the intersection of computation and evaluations.Between them, they have authored multiple published papers in the machine learning domain and hold numerous ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    HarnhamHayward, CA, US
    Full-time
    STAFF MACHINE LEARNING ENGINEER.Hybrid – Bay Area (3 Days / Week Onsite).We’re a fast-growing online marketplace backed by a major global tech player. Our platform helps millions of people...Show moreLast updated: 27 days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Latinx in AI (LXAI)Redwood City, CA, United States
    Full-time
    Prove is the modern platform for continuous identity authentication and is used by over 1,000 enterprises and 500 financial institutions, including 9 of the top 10 U. Prove’s cloud solutions, and mo...Show moreLast updated: 28 days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    SylogicSan Jose, CA, United States
    Full-time
    At Sylogic, we're on a mission to revolutionize infrastructure automation through artificial intelligence.We are a stealth startup headquartered in Silicon Valley and funded by Tier 1 VCs.We’re a p...Show moreLast updated: 30+ days ago