Talent.com
Software Engineer, ML Performance
Software Engineer, ML PerformanceOpenReq • Cupertino, CA, United States
No longer accepting applications
Software Engineer, ML Performance

Software Engineer, ML Performance

OpenReq • Cupertino, CA, United States
30+ days ago
Job type
  • Full-time
Job description

About Etched

Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning.

Software Engineer, ML Performance

Running millions of tokens per second for large models (e.g Llama-3-70B) means running into new performance bottlenecks. Even with hardware optimization for the operations that usually bottleneck us (attention, kernel parallelism), we encounter novel bottlenecks and must design our own solutions to solve them.

You will work closely with our hardware and software teams to identify and mitigate performance bottlenecks, enabling our chips to achieve unprecedented throughput and efficiency. Your work will involve a blend of low-level programming, performance profiling, and hands-on debugging, all aimed at maximizing the performance of our custom-built AI hardware.

You will also play a key role in developing tools and methodologies to help our customers understand the full potential of our hardware.

Representative projects :

  • Writing new kernels to improve throughput for LLM embedding
  • Improving on PagedAttention to prevent fragmentation of the KV cache in memory
  • Debugging hardware issues on a simulated or emulated chip
  • Profile transformers running on our hardware, and fix bottlenecks
  • Develop ways for customers to work with our chip and understand how their workloads will run on it.

You may be a good fit if you :

  • Have 5+ years of low-level programming experience
  • Have a strong understanding of data flow and execution paths within embedded systems
  • Pick up slack, even if it goes outside your job description
  • Are results-oriented, and bias towards shipping products
  • Understand SoC and computer system architecture, especially for CPU, interconnect, and memory subsystems
  • Want to learn more about machine learning research
  • We encourage you to apply even if you do not believe you meet every single qualification.

    Strong candidates may also have experience with :

  • GPU kernel profiling and low-level programming
  • Transformer optimizations, such as FlashAttention
  • Ongoing research in machine learning
  • Palladium emulation
  • How we're different :

    Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

    We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

    Benefits :

  • Full medical, dental, and vision packages, with 100% of premium covered, 90% for dependents
  • Housing subsidy of $2,000 / month for those living within walking distance of the office
  • Daily lunch and dinner in our office
  • Relocation support for those moving to Cupertino
  • Create a job alert for this search

    Software Engineer • Cupertino, CA, United States

    Related jobs
    Sr. Software Engineer, ML

    Sr. Software Engineer, ML

    Relyance AI • San Francisco, CA, United States
    Full-time
    NLP for information extraction from legal documents, ML / NLP for information extraction from code and general ML in code analysis, and overall AI backend initiatives. You will partner with cross-func...Show more
    Last updated: 9 days ago • Promoted
    AI / ML System Software Engineer, Staff

    AI / ML System Software Engineer, Staff

    D-matrix • Santa Clara, California, United States
    Full-time
    AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - Model Performance

    Software Engineer - Model Performance

    AI Fund • San Francisco, CA, United States
    Full-time
    Software Engineer - Model Performance.Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dyn...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, SystemML - Scaling / Performance

    Software Engineer, SystemML - Scaling / Performance

    META • Menlo Park, CA, United States
    Full-time
    In this role, you will be a member of the Network.AI Software team and part of the bigger DC networking organization.The team develops and owns the software stack around NCCL (NVIDIA Collective Com...Show more
    Last updated: 15 days ago • Promoted
    Senior Software Engineer - ML Infrastructure

    Senior Software Engineer - ML Infrastructure

    Plaid • San Francisco, CA, United States
    Full-time
    Senior Software Engineer - ML Infrastructure.Plaid is evolving into an AI-first company, where data and machine learning are the key enablers of smarter, more secure insight products built on top o...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - ML Performance

    Software Engineer - ML Performance

    Baseten • San Ramon, California, United States
    Full-time
    We’re a growing team of builders backed by top-tier investors, including.ML teams at enterprises and category-defining AI-native companies like. Baseten to power their core production workloads with...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, ML

    Software Engineer, ML

    Heartflow • San Francisco, California, United States
    Full-time
    Heartflow is a medical technology company advancing the diagnosis and management of coronary artery disease, the #1 cause of death worldwide, using cutting-edge technology.The flagship product—an A...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - ML / LLM Serving

    Senior Software Engineer - ML / LLM Serving

    Alldus • San Jose, CA, United States
    Full-time
    Senior Software Engineer - ML / LLM Serving.This range is provided by Alldus.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Direct message the jo...Show more
    Last updated: 30+ days ago • Promoted
    Senior, Software Engineer - MLE

    Senior, Software Engineer - MLE

    Walmart • Sunnyvale, CA, United States
    Full-time +1
    We are seeking a highly motivated Machine Learning Engineer to join our Data Science team.In this role, you will not only design, develop, and deploy machine learning models at scale but also play ...Show more
    Last updated: 2 days ago • Promoted
    ML Perception Software Engineer

    ML Perception Software Engineer

    Applied Intuition • Sunnyvale, CA, United States
    Full-time
    Applied Intuition is the vehicle intelligence company that accelerates the global adoption of safe, AI-driven machines.Founded in 2017 and now valued at $15 billion following its recent Series F fu...Show more
    Last updated: 2 days ago • Promoted
    Lead ML Software Development Engineer

    Lead ML Software Development Engineer

    Adobe Inc. • San Jose, CA, United States
    Full-time
    Our Company Changing the world through digital experiences is what Adobe's all about.We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional ...Show more
    Last updated: 6 days ago • Promoted
    AI / ML System Software Engineer, Senior Staff

    AI / ML System Software Engineer, Senior Staff

    D-matrix • Santa Clara, California, United States
    Full-time
    AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - ML & Platform

    Software Engineer - ML & Platform

    Rhizome • San Francisco, CA, United States
    Full-time
    A changing climate demands Resilience by Design.We like solving hard problems with creativity, tenacity, and empathy for our customers. At the same time, we believe that being better stewards in our...Show more
    Last updated: 9 days ago • Promoted
    Software Engineer - ML Pricing

    Software Engineer - ML Pricing

    Pendo • San Francisco, CA, United States
    Full-time
    At Opendoor, pricing is at the core of our product — our models directly influence high-stakes decisions around real estate transactions across the country. We are looking for a mid-level.This is a ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - AI / ML

    Senior Software Engineer - AI / ML

    Microsoft Corporation • Mountain View, CA, United States
    Full-time
    OverviewJoin the PowerPoint team as we deliver modern, intelligent, and collaborative experiences that will delight millions of PowerPoint customers. Located in the heart of Silicon Valley in Mounta...Show more
    Last updated: 1 day ago • Promoted
    Senior Software Engineer, AI / ML

    Senior Software Engineer, AI / ML

    Striim • Palo Alto, CA, United States
    Full-time
    Striim, (pronounced stream with two is for integration and intelligence), is a unified data integration and streaming platform that connects clouds, data, and applications with unprecedented speed ...Show more
    Last updated: 1 day ago • Promoted
    Software Engineer - ML & Platform

    Software Engineer - ML & Platform

    Rhizome co • San Francisco, CA, United States
    Full-time
    A changing climate demands Resilience by Design.We like solving hard problems with creativity, tenacity, and empathy for our customers. At the same time, we believe that being better stewards in our...Show more
    Last updated: 4 days ago • Promoted
    Software Engineer - AI / ML

    Software Engineer - AI / ML

    Advantest America • San Jose, CA, United States
    Full-time
    Advantest America, a leading Semiconductor Test and Measurement Company, is seeking self-driven and creative engineers to build applications for new technologies related to machine learning and gen...Show more
    Last updated: 1 day ago • Promoted