Talent.com
Machine Learning - Compiler Engineer II, AWS Neuron, Annapurna Labs

Machine Learning - Compiler Engineer II, AWS Neuron, Annapurna Labs

AmazonCupertino, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Job ID : 2933964 | Amazon Web Services, Inc. - A97

Do you want to be part of AI revolution? At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to AI hardware and software infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it possible. AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS Inferentia and Trainium, our custom chips designed to accelerate deep-learning workloads.

This role is for a software engineer in the Compiler team for AWS Neuron. As part of this role, you will be responsible for building next generation Neuron compiler which transforms ML models written in ML frameworks (e.g, PyTorch, TensorFlow, and JAX) to be deployed on AWS Inferentia and Trainium based servers in the Amazon cloud. You will be responsible for solving hard compiler optimization problems to achieve optimum performance for variety of ML model families including massive scale large language models like Llama, Deepseek, and beyond as well as stable diffusion, vision transformers and multi-model models. You will be required to understand how these models work inside-out to make informed decisions on how to best coax the compiler to generate optimal implementation instruction. You will leverage your technical communications skill to partner with internal and external customers / stakeholders and will be involved in pre-silicon design, bringing new products / features to market, ultimately, making Neuron compiler highly performant and easy-to-use.

Experience in object-oriented languages like C++ / Java is a must, experience with compilers or building ML models using ML frameworks on accelerators (e.g., GPUs) is preferred but not required. Experience with technologies like OpenXLA, StableHLO, MLIR will be added bonus!

Key job responsibilities

You will design, implement, test, deploy and maintain innovative software solutions to transform Neuron compiler’s performance, stability and user-interface. You will work side by side with chip architects, runtime / OS engineers, scientists and ML Apps teams to seamlessly deploy state of the art ML models from our customers on AWS accelerators with optimal cost / performance benefits. You will have opportunity to work with open-source software (e.g., StableHLO, OpenXLA, MLIR) to pioneer optimizing advanced ML workloads on AWS software and hardware. You will also work on building innovative features that will deliver best possible experiences for our customers – developers across the globe.

A day in the life

As you design and code solutions to help our team drive efficiencies in compiler architecture, you’ll create compiler optimization and verification passes, build features surface features and peculiarities of AWS accelerators to developers, implement tools to analyze numerical errors, and resolve the root cause of compiler defects. You’ll also participate in design discussions, code review, and communicate with internal (other Neuron SDK and Amazon wide teams) and external stakeholders (open-source communities). Lastly, work in a startup-like development environment, where you’re always working on the most important stuff.

About the team

Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.

BASIC QUALIFICATIONS

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience programming with at least one software programming language

PREFERRED QUALIFICATIONS

  • Master's degree or PhD in Computer Science, or a related technical field.
  • 3+ years of experience writing production grade code in object-oriented languages such as C++ / Java.
  • Experience in compiler design for CPU / GPU / Vector engines / ML-accelerators.
  • Experience with OpenSource compiler toolset like LLVM / MLIR.
  • Experience with the following technologies : PyTorch, OpenXLA, StableHLO, JAX, TVM, deep learning models, and algorithms.
  • Experience with modern build systems like Bazel / CMake.
  • Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • Cupertino, CA, United States

    Related jobs
    • Promoted
    AI / ML Sr. Compiler Development Engineer

    AI / ML Sr. Compiler Development Engineer

    Advanced Micro Devices, Inc.San Jose, CA, United States
    Full-time
    WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded syst...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning - Compiler Engineer II, Annapurna Labs

    Machine Learning - Compiler Engineer II, Annapurna Labs

    Amazon Web Services (AWS)Cupertino, CA, United States
    Full-time
    The Product : AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer II, Special Projects

    Machine Learning Engineer II, Special Projects

    AmazonSan Francisco, CA, United States
    Full-time
    Innovators wanted! Are you an entrepreneur? A builder? A dreamer? This role is part of an Amazon Special Projects team that takes the company’s Think Big leadership principle to the limits.If you’r...Show moreLast updated: 10 days ago
    • Promoted
    Machine Learning - Compiler Engineer II, AWS Neuron, Annapurna Labs

    Machine Learning - Compiler Engineer II, AWS Neuron, Annapurna Labs

    Amazon Web Services (AWS)Cupertino, CA, United States
    Full-time
    Machine Learning - Compiler Engineer II, AWS Neuron, Annapurna Labs.Do you want to be part of AI revolution? At AWS our vision is to make deep learning pervasive for everyday developers and to demo...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer- AI / ML, AWS Neuron Distributed Training

    Software Engineer- AI / ML, AWS Neuron Distributed Training

    AmazonCupertino, CA, United States
    Full-time
    Annapurna Labs designs silicon and software that accelerates innovation.Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago-even yesterday.Ou...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    ApiphanySan Francisco, CA, United States
    Full-time
    Apiphany is a pioneering foundational AI company for physical product development.We empower global innovators in automotive, aerospace, medtech, and energy to transform mountains of unstructured t...Show moreLast updated: 17 days ago
    • Promoted
    Software Engineer III - Machine Learning

    Software Engineer III - Machine Learning

    WalmartSunnyvale, CA, United States
    Full-time +1
    We are seeking a Software Engineer III to contribute to the development of intelligent ranking systems that power personalized shopping experiences across our e-commerce platform.This role involves...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer- AI / ML, AWS Neuron

    Software Engineer- AI / ML, AWS Neuron

    AmazonCupertino, CA, United States
    Full-time
    AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine.Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Ap...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Software Engineer- AI / ML, AWS Neuron Distributed Training

    Sr. Software Engineer- AI / ML, AWS Neuron Distributed Training

    AmazonCupertino, CA, United States
    Full-time
    Annapurna Labs designs silicon and software that accelerates innovation.Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago-even yesterday.Ou...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer 5.5

    Machine Learning Engineer 5.5

    Adobe Systems GmbHSan Jose, CA, United States
    Full-time
    Changing the world through digital experiences is what Adobe’s all about.We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital exper...Show moreLast updated: 25 days ago
    • Promoted
    Machine Learning Engineer (I, II, or Sr.)

    Machine Learning Engineer (I, II, or Sr.)

    ikigailabs.ioSan Mateo, CA, United States
    Full-time
    The Ikigai platform unlocks the power of generative AI for tabular data.We enable business users to connect disparate data, leverage no-code AI / ML, and build enterprise-wide AI apps in just a few c...Show moreLast updated: 30+ days ago
    • Promoted
    AI / ML Compiler Development Engineer

    AI / ML Compiler Development Engineer

    Advanced Micro Devices, Inc.San Jose, CA, United States
    Full-time
    WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded syst...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Component Engineer, Annapurna Labs, Machine Learning Hardware

    Sr. Component Engineer, Annapurna Labs, Machine Learning Hardware

    Amazon Web Services (AWS)Cupertino, CA, United States
    Full-time
    Component Engineer, Annapurna Labs, Machine Learning Hardware role at Amazon Web Services (AWS).Annapurna Labs designs silicon and software that accelerates innovation, delivering custom chips, acc...Show moreLast updated: 4 days ago
    • Promoted
    Machine Learning Engineer, Gen AI Innovation Center, AWS

    Machine Learning Engineer, Gen AI Innovation Center, AWS

    AmazonSan Francisco, CA, United States
    Full-time
    The Generative AI Innovation Center at AWS empowers customers to harness state of the art AI technologies for transformative business opportunities. Our multidisciplinary team of strategists, scient...Show moreLast updated: 2 days ago
    • Promoted
    ASIC Engineer II, Cloud-Scale Machine Learning Acceleration team

    ASIC Engineer II, Cloud-Scale Machine Learning Acceleration team

    Amazon Web Services (AWS)Cupertino, CA, United States
    Full-time
    ASIC Engineer II, Cloud-Scale Machine Learning Acceleration team.Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Am...Show moreLast updated: 4 days ago
    • Promoted
    • New!
    ASIC Engineer II, Cloud-Scale Machine Learning Acceleration team

    ASIC Engineer II, Cloud-Scale Machine Learning Acceleration team

    AmazonCupertino, CA, United States
    Full-time
    ASIC Engineer II, Cloud-Scale Machine Learning Acceleration team.AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) an...Show moreLast updated: 13 hours ago
    • Promoted
    Machine Learning Compiler Engineer, Annapurna Labs

    Machine Learning Compiler Engineer, Annapurna Labs

    Amazon Web Services (AWS)Cupertino, CA, United States
    Full-time
    Machine Learning Compiler Engineer, Annapurna Labs.Machine Learning Compiler Engineer, Annapurna Labs.Get AI-powered advice on this job and more exclusive features. The AWS Neuron Compiler team is a...Show moreLast updated: 30+ days ago
    • Promoted
    AI Compiler Engineer

    AI Compiler Engineer

    IntelSan Jose, CA, United States
    Full-time
    We are seeking a highly skilled Compiler Engineer with experience in MLIR (Multi-Level Intermediate Representation) and performance-critical code generation. The ideal candidate will focus on design...Show moreLast updated: 30+ days ago