Talent.com
Machine Learning Co-Design Researcher Job at Etched in San Jose

Machine Learning Co-Design Researcher Job at Etched in San Jose

MediabistroSan Jose, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Join to apply for the Machine Learning Co-Design Researcher role at Etched

About Etched : Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents.

Key Responsibilities :

  • Translate core mathematical operations from transformer models into optimized operation sequences for Sohu
  • Develop and leverage a deep understanding of Sohu to co-design both HW instructions and model architecture operations to maximize model performance
  • Implement high-performance software components for the Model Toolkit
  • Collaborate with hardware engineers to maximize chip utilization and minimize latency
  • Implement efficient batching strategies and execution plans for inference workloads
  • Design and implement cutting edge inference time compute scaling methods
  • Alter and fine-tune model architectures or inference time compute algorithms
  • Contribute to the evolution of our system architecture and programming model

Representative projects :

  • Optimize operation sequences to maximize Sohu's computational resources for specific transformer architectures such as Llama 4.
  • Research and implement efficient memory management for KV cache sharing and prefix optimization
  • Develop algorithms for continuous batching and batch interleaving to improve throughput and / or latency
  • Research and implement model-specific inference-time acceleration algorithms such as speculative decoding, tree search, KV cache sharing, priority scheduling, etc by interacting with the rest of the inference serving stack
  • Research and implement structured decoding and novel sampling algorithms for reasoning models
  • You may be a good fit if you have :

  • Co-design expertise across both SW and HW domains
  • Strong software engineering skills with systems programming experience
  • Deep knowledge of transformer model architectures and / or inference serving stacks (vLLM, SGLang, etc.)
  • Strong mathematical skills, esp. in linear algebra
  • Ability to reason about performance bottlenecks and optimization opportunities
  • Experience working cross-functionally in diverse software and hardware organizations
  • Strong Candidates May Also Have Experience With :

  • Experience with hardware accelerators, ASICs, or FPGAs
  • Experience with Rust programming language
  • Deep expertise in ML systems engineering and hardware / software co-design with demonstrated impact (contributions to open-source projects or published papers)
  • Track record of optimizing large co-designed SW / HW systems
  • Benefits :

  • Full medical, dental, and vision packages, with generous premium coverage
  • Housing subsidy of $2,000 / month for those living within walking distance of the office
  • Daily lunch and dinner in our office
  • Relocation support for those moving to West San Jose
  • Compensation Range : $150,000 - $275,000

    How We’re Different : Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

    We are a fully in-person team in West San Jose, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Researcher • San Jose, CA, United States

    Related jobs
    • Promoted
    • New!
    Founding Machine Learning Engineer

    Founding Machine Learning Engineer

    StampSan Francisco, CA, United States
    Full-time
    We believe that Human Intelligence is the most valuable currency — one that should only be spent on solving the most challenging and important problems. That’s why we’re building Stamp, the first AI...Show moreLast updated: 22 hours ago
    • Promoted
    Staff Machine Learning Engineer, Perception

    Staff Machine Learning Engineer, Perception

    Woven by ToyotaPalo Alto, CA, United States
    Full-time
    Woven by Toyota is enabling Toyota’s once-in-a-century transformation into a mobility company.Inspired by a legacy of innovating for the benefit of others, our mission is to challenge the current s...Show moreLast updated: 4 days ago
    • Promoted
    Machine Learning Researcher

    Machine Learning Researcher

    CognitionSan Francisco, CA, United States
    Full-time
    We are an applied AI lab building end-to-end software agents.We’re building collaborative AI teammates that enable engineers to focus on more interesting problems and empower engineering teams to s...Show moreLast updated: 23 days ago
    • Promoted
    Machine Learning Research Engineer (1 Year Fixed Term)

    Machine Learning Research Engineer (1 Year Fixed Term)

    Stanford UniversityPalo Alto, CA, United States
    Full-time +1
    Machine Learning Research Engineer (1 Year Fixed Term).Thank you for your interest in Stanford University.While we have instituted a hiring pause for non-critical staff positions, we are actively r...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Machine Learning Engineer (US)

    Machine Learning Engineer (US)

    Coram AISunnyvale, CA, United States
    Full-time
    AI is building the best business AI video system on the market.Powered by the next-generation video artificial intelligence, we deliver unprecedented insights and 10x better user experience than th...Show moreLast updated: 11 hours ago
    • Promoted
    Machine Learning Research Engineer

    Machine Learning Research Engineer

    Decagon AI, Inc.San Francisco, CA, United States
    Full-time
    Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experience.Our AI agents provide intelligent, human-like responses across chat, email, and voi...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Machine Learning Engineer – Context Engineering

    Sr. Machine Learning Engineer – Context Engineering

    GEICOPalo Alto, CA, United States
    Full-time
    Machine Learning Engineer – Context Engineering at GEICO.The GEICO AI org is seeking an experienced Sr.Staff Machine Learning Engineer to join our AI org. This person will play key roles for the dev...Show moreLast updated: 15 days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Metric BioHayward, CA, US
    Full-time
    Metric Bio is recruiting on behalf of a San Francisco–based digital health company that is building an AI-powered platform to transform patient care and healthcare delivery.ML techniques to s...Show moreLast updated: 8 days ago
    • Promoted
    Machine Learning Engineer - Perception, Odometry and Localization

    Machine Learning Engineer - Perception, Odometry and Localization

    Woven by ToyotaPalo Alto, CA, United States
    Full-time
    Woven by Toyota is enabling Toyota’s once-in-a-century transformation into a mobility company.Inspired by a legacy of innovating for the benefit of others, our mission is to challenge the current s...Show moreLast updated: 4 days ago
    • Promoted
    Sr. Machine Learning Engineer

    Sr. Machine Learning Engineer

    UnitySan Francisco, CA, United States
    Full-time
    At Unity, we’re committed to building a culture grounded in Empathy, Respect, and Opportunity.Within our fast-paced and collaborative environment, we’re tackling complex challenges that drive meani...Show moreLast updated: 17 days ago
    • Promoted
    Machine Learning Engineer, Integrity

    Machine Learning Engineer, Integrity

    OpenAISan Francisco, CA, United States
    Full-time
    The Integrity team at OpenAI is dedicated to ensuring that our cutting-edge technology is not only revolutionary, but also secure from a myriad of adversarial threats. We strive to maintain the inte...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Krea.ai, Inc.San Francisco, CA, United States
    Full-time
    At Krea, we are building next-generation AI creative tools.We are dedicated to making AI intuitive and controllable for creatives. Our mission is to build tools that empower human creativity, not re...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Research Engineer, Scientific Superintelligence

    Machine Learning Research Engineer, Scientific Superintelligence

    Autopoiesis SciencesSan Francisco, CA, United States
    Full-time
    At Autopoiesis Sciences, we're building the foundation for scientific superintelligence, AI that can autonomously advance knowledge across every scientific discipline. We're first developing an AI c...Show moreLast updated: 6 days ago
    AI Researcher, 3D Modeling & Computer Vision (U.S.)

    AI Researcher, 3D Modeling & Computer Vision (U.S.)

    AssetHubSan Francisco, CA, US
    Full-time
    Quick Apply
    We are seeking a highly skilled AI Researcher with deep expertise in 3D modeling, computer vision, and AI-assisted asset creation. In this role, you will push the boundaries of AI-native workflows f...Show moreLast updated: 24 days ago
    • Promoted
    Machine Learning Engineer (Search)

    Machine Learning Engineer (Search)

    Apple Inc.Cupertino, CA, United States
    Full-time
    Cupertino, California, United States Software and Services.Apple Maps and the thousands of applications it empowers are being used by millions every single day! As a fundamental tool for human acti...Show moreLast updated: 16 days ago
    • Promoted
    Machine Learning Researcher

    Machine Learning Researcher

    doeSan Francisco, CA, United States
    Full-time
    At Doe, we’re building an AI workforce that operates mission-critical workflows across private equity-backed rollups - starting with DSOs. These agents need to be fast, resilient, auditable, and sec...Show moreLast updated: 11 days ago
    • Promoted
    Staff Machine Learning Engineer – AI Research

    Staff Machine Learning Engineer – AI Research

    General MotorsMountain View, CA, United States
    Full-time
    Our AI Research team, reporting directly to the Chief AI Officer, is pioneering how cutting-edge machine learning can transform the way vehicles are designed, manufactured, and experienced.We are b...Show moreLast updated: 15 days ago
    • Promoted
    Research Engineer - Machine Learning & Systems

    Research Engineer - Machine Learning & Systems

    World LabsSan Francisco, CA, United States
    Full-time
    We are looking for a versatile Research Engineer with a strong background in machine learning or 3D, software development, and systems design. This role is ideal for someone excited about bridging c...Show moreLast updated: 21 days ago