Talent.com
Machine Learning Research Engineer

Machine Learning Research Engineer

EtchedCupertino, CA, US
30+ days ago
Job type
  • Full-time
Job description

About Etched

Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Etched Labs is the organization within Etched whose mission is to democratize generative AI, pushing the boundaries of what will be possible in a post-Sohu world.

Key responsibilities

  • Propose and conduct novel research to achieve results on Sohu that are unviable on GPUs
  • Translate core mathematical operations from the most popular Transformer-based models into maximally performant instruction sequences for Sohu
  • Develop deep architectural knowledge informing best-in-the-world software performance on Sohu HW, collaborating with HW architects and designers.
  • Co-design and finetune emerging model architectures for highest efficiency on Sohu
  • Guide and contribute to the Sohu software stack, performance characterization tools, and runtime abstractions by implementing frontier models using Python and Rust.

Representative projects

  • Propose and implement a novel test time compute algorithm that leverages Sohu's unique capabilities to unlock a product could never be achieved on a typical GPU
  • Implement diffusion models on Sohu to achieve GPU-impossible latencies that allow for real-time image generation
  • Optimize model instructions and scheduling algorithms to optimize for utilization, latency, throughput, and / or a mix of these metrics.
  • Implement model-specific inference-time acceleration techniques such as speculative decoding, tree search, KV cache sharing, priority scheduling, etc by interacting with the rest of the inference serving stack.
  • You may be a good fit if you have

  • An ML Research background with interests in HW co-design
  • Experience with Python, Pytorch, and / or JAX
  • Familiarity with transformer model architectures and / or inference serving stacks (vLLM, SGLang, etc.) and / or experience working in distributed inference / training environments
  • Experience working cross-functionally in diverse software and hardware organizations
  • Strong candidates may also have

  • ML Systems Research and HW Co-design backgrounds
  • Published inference-time compute research and / or efficient ML research
  • Experience with Rust
  • Familiarity with GPU kernels, the CUDA compilation stack and related tools, or other hardware accelerators
  • Benefits

  • Full medical, dental, and vision packages, with 100% of premium covered
  • Housing subsidy of $2,000 / month for those living within walking distance of the office
  • Daily lunch and dinner in our office
  • Relocation support for those moving to Cupertino
  • How we're different

    Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

    We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

    Create a job alert for this search

    Machine Learning Engineer • Cupertino, CA, US

    Related jobs
    Machine Learning Engineer / Applied Research Scientist

    Machine Learning Engineer / Applied Research Scientist

    AdobeSan Jose
    Full-time
    Changing the world through digital experiences is what Adobe’s all about.We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital exper...Show moreLast updated: 29 days ago
    Machine Learning Engineer

    Machine Learning Engineer

    Automation Technologies LLCMountain View, CA, US
    Full-time
    Specialized Area : Machine learning.Job Title : Machine Learning Engineer.Employment Type : W-2 (Consultant must be on our company payroll. Machine Learning or Artificial Intelligence using TensorFlow ...Show moreLast updated: 19 days ago
    Machine Learning Research Scientist, Active Learning

    Machine Learning Research Scientist, Active Learning

    NuroMountain View, California
    Full-time
    Nuro exists to better everyday life through robotics.Founded in 2016, Nuro has spent eight years developing autonomous driving (AD) technology and commercializing AD applications.The Nuro Driver™ i...Show moreLast updated: 29 days ago
    Machine Learning Engineer - Machine Learning Infrastructure

    Machine Learning Engineer - Machine Learning Infrastructure

    ByteDanceSan Jose
    Full-time
    ResponsibilitiesAbout The Team The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in o...Show moreLast updated: 29 days ago
    Machine Learning Engineer

    Machine Learning Engineer

    PaypalSan Jose, California, United States
    Full-time
    PayPal has been revolutionizing commerce globally for more than 25 years.Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empow...Show moreLast updated: 9 days ago
    Machine Learning Engineer

    Machine Learning Engineer

    PayPalSan Jose, California, United States of America
    Full-time
    PayPal is committed to fair and equitable compensation practices.Actual Compensation is based on various factors including but not limited to work location, and relevant skills and experience.The t...Show moreLast updated: 30+ days ago
    Machine Learning Research Engineer

    Machine Learning Research Engineer

    Strativ GroupMenlo Park, CA, United States
    Full-time
    ML Engineer / ML Research Engineer.A VC-backed frontier AI Startup rebuilding the physical world with superintelligence are looking for either an ML Engineer or ML Research Engineer to bolster thei...Show moreLast updated: 26 days ago
    Staff Machine Learning Engineer - Research & Innovation

    Staff Machine Learning Engineer - Research & Innovation

    SAPPalo Alto, CA, US
    Full-time +1
    At SAP, we enable you to bring out your best.Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for...Show moreLast updated: 30+ days ago
    Machine Learning Engineer

    Machine Learning Engineer

    ZoomSan Jose CA
    Full-time
    Be part of a team whose focus is to solve cutting edge AI problems and implementation that constantly advance the state-of-the-art. You will be working across various AI / LLM areas including GenAI, a...Show moreLast updated: 29 days ago
    Machine Learning Engineer

    Machine Learning Engineer

    FortinetSan Jose, CA, United States
    Full-time
    Job DescriptionFortinet is looking for a Machine Learning Engineer to assist FortiCNAPP Team! Be a valuable member of the team that owns and operates high-availability, cross-cloud, large-volume, A...Show moreLast updated: 30+ days ago
    Staff Research Engineer, Speech Machine Learning (TTS)

    Staff Research Engineer, Speech Machine Learning (TTS)

    Samsung Research AmericaMountain View, California, United States
    Full-time
    Bixby is an intelligent personal assistant which is only available as a built-in application on Samsung flagship devices and wearables. This application uses Natural Language Understanding to perfor...Show moreLast updated: 6 days ago
    Machine Learning Research Engineer

    Machine Learning Research Engineer

    PearsonUnited States
    Full-time +1
    The Applied ML Research Team is focused on maintaining and extending Pearson's leadership in automated writing analysis for formative, interim, and summative assessment product markets.Together wit...Show moreLast updated: 30+ days ago
    Machine Learning Engineer

    Machine Learning Engineer

    Tiger AnalyticsUnited States
    Full-time
    Tiger Analytics is looking for experienced Machine Learning Engineers to join our fast-growing advanced analytics consulting firm. Our employees bring deep expertise in Machine Learning, Data Scienc...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    VirtualVocationsSanta Clara, California, United States
    Full-time
    A company is looking for an AI Developer.Key Responsibilities Develop and implement Deep Learning Pipelines using cutting-edge models Sustain the infrastructure for Deep Learning inference on Az...Show moreLast updated: 25 days ago