Talent.com
Sr SDE, AGI Inference- GenAI
Sr SDE, AGI Inference- GenAIAmazon • Sunnyvale, CA, United States
Sr SDE, AGI Inference- GenAI

Sr SDE, AGI Inference- GenAI

Amazon • Sunnyvale, CA, United States
22 hours ago
Job type
  • Full-time
Job description

The Sensory Inference team at AGI is a group of innovative developers working on ground-breaking multi-modal inference solutions that revolutionize how AI systems perceive and interact with the world. We push the limits of inference performance to provide the best possible experience for our users across a wide range of applications and devices. We are looking for talented, passionate, and dedicated Inference Engineers to join our team and build innovative, mission-critical, high-volume production systems that will shape the future of AI. You will have an enormous opportunity to make an impact on the design, architecture, and implementation of novel technologies used every day, potentially by people you know. This role offers the exciting chance to work in a highly technical domain at the boundary between fundamental AI research and production engineering such as Quantization, Speculative Decoding, and Long Context for inference efficiency.

Key job responsibilities

  • Develop high-performance inference software for a diverse set of neural models, typically in C / C++
  • Design, prototype, and evaluate new inference engines and optimization techniques
  • Participate in deep-dive analysis and profiling of production code
  • Optimize inference performance across various platforms (on-device, cloud-based CPU, GPU, proprietary ASICs)
  • Collaborate closely with research scientists to bring next-generation neural models to life
  • Partner with internal and external hardware teams to maximize platform utilization
  • Work in an Agile environment to deliver high-quality software against tight schedules
  • Hold a high bar for technical excellence within the team and across the organization

BASIC QUALIFICATIONS

  • 5+ years of non-internship professional software development experience
  • 5+ years of programming with at least one software programming language experience
  • 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Experience with inference frameworks such as PyTorch, TensorFlow, ONNXRuntime, TensorRT, LLaMA.cpp, etc.
  • PREFERRED QUALIFICATIONS

  • 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Experience with inference frameworks such as PyTorch, TensorFlow, ONNXRuntime, TensorRT, LLaMA.cpp
  • Proficiency in performance optimization for CPU, GPU, or AI hardware
  • Proficiency in kernel programming for accelerated hardware using programming models such as (but not limited to) CUDA, OpenMP, OpenCL, Vulkan, and Metal
  • Experience with latency-sensitive optimizations and real-time inference
  • Knowledge of model compression techniques (quantization, pruning, distillation, etc.)
  • Experience with LLM efficiency techniques like speculative decoding and long context
  • Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

    Los Angeles County applicants : Job duties for this position include : work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company's reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

    Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country / region you're applying in isn't listed, please contact your Recruiting Partner.

    Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $151,300 / year in our lowest geographic market up to $261,500 / year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and / or other benefits. For more information, please visit This position will remain posted until filled. Applicants should apply via our internal or external career site.

    Create a job alert for this search

    Sr • Sunnyvale, CA, United States

    Related jobs
    AGI Sr Inference Software Development Engineering, AGI Inference

    AGI Sr Inference Software Development Engineering, AGI Inference

    Amazon • Sunnyvale, CA, United States
    Full-time
    The Sensory Inference team at AGI is a group of innovative developers working on ground-breaking multi-modal inference solutions that revolutionize how AI systems perceive and interact with the wor...Show more
    Last updated: 30+ days ago • Promoted
    OB / GYN

    OB / GYN

    Kaiser Permanente - The Permanente Medical Group, Inc. -Northern California • Gilroy, US
    Full-time
    OB / GYN physician to join our team in.The Permanente Medical Group, Inc.Northern and Central California and an 80-year tradition of providing quality medical care. Board Certification or Eligibility....Show more
    Last updated: 30+ days ago • Promoted
    Test Products from Home – $25-$45 / hr + Freebies

    Test Products from Home – $25-$45 / hr + Freebies

    OCPA • Davenport, California, us
    Part-time +1
    Product Testers are wanted to work from home nationwide in the US to fulfill upcoming contracts with national and international companies. We guarantee 15-25 hours per week with an hourly pay of bet...Show more
    Last updated: 30+ days ago • Promoted
    Clinical Laboratory Scientist Generalist - Microbiology

    Clinical Laboratory Scientist Generalist - Microbiology

    Sutter Health • Livermore, CA, United States
    Full-time
    We are so glad you are interested in joining Sutter Health!.Executes procedures in assigned areas of the Laboratory to deliver accurate results in a timely manner. Maintains competence to perform pr...Show more
    Last updated: 14 days ago • Promoted
    Postdoctoral Scholar in Agricultural Sustainability in California

    Postdoctoral Scholar in Agricultural Sustainability in California

    University of California - Santa Cruz • Santa Cruz, CA, United States
    Full-time
    Postdoctoral Scholar in Agricultural Sustainability in California .Commensurate with qualifications and experience.Postdoctoral Scholar-Employee / Postdoctoral Scholar-Fellow / Postdoctoral Schola...Show more
    Last updated: 20 days ago • Promoted
    Travel Nurse RN - Infusion - $2,482 per week

    Travel Nurse RN - Infusion - $2,482 per week

    ALOIS Healthcare • Mountain House, CA, US
    Full-time
    ALOIS Healthcare is seeking a travel nurse RN Infusion for a travel nursing job in Tracy, California.Job Description & Requirements Specialty : Infusion Discipline : RN Start Date : 12 / 15 / 2025 Duratio...Show more
    Last updated: 1 day ago • Promoted
    Sr. Developer Advocate, Databricks AI Agentic Systems

    Sr. Developer Advocate, Databricks AI Agentic Systems

    Menlo Ventures • San Francisco, CA, United States
    Full-time
    San Francisco, Bellevue, Amsterdam.Are you a recognized technical leader in Generative AI and MLOps, driven to define the future of production AI Agentic Systems? This Senior Developer Advocate rol...Show more
    Last updated: 2 days ago • Promoted
    Sr. Staff Software Engineer - AI + Data Intelligence Platform

    Sr. Staff Software Engineer - AI + Data Intelligence Platform

    Menlo Ventures • San Francisco, CA, United States
    Full-time
    At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical ...Show more
    Last updated: 10 days ago • Promoted
    Strategic Projects Lead, Generative AI

    Strategic Projects Lead, Generative AI

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    Scale's Generative AI business unit is currently seeing historic levels of growth.As a Strategic Projects Lead (SPL), you will lead initiatives that will drive $XXM+ in new revenue for the business...Show more
    Last updated: 4 days ago • Promoted
    Endoscopy Technologist - Endoscopy GI (Coyote Acres)

    Endoscopy Technologist - Endoscopy GI (Coyote Acres)

    Christus Health • Coyote Acres, TX, US
    Full-time +1
    Assists with the coordination of care for patients in the endoscopy department.Responsible for the preparation, maintenance, and cleaning of equipment and supplies and may assist in performing inva...Show more
    Last updated: 2 hours ago • Promoted • New!
    Certified Medical Assistant - OBGYN

    Certified Medical Assistant - OBGYN

    Christus Health • Coyote Acres, TX, US
    Full-time
    The Certified Medical Assistant will perform various services and related activities in support of patient care including accurate data entry for patient registration and insurance verification.The...Show more
    Last updated: 19 days ago • Promoted
    Home-Based Market Research Contributor (Hiring Immediately)

    Home-Based Market Research Contributor (Hiring Immediately)

    Earn Haus • Mountain House, California, US
    Remote
    Full-time +1
    We are urgently looking for people interested in taking online surveys for Fortune 500 brands.If you are a self-starter, looking for flexible hours throughout the week, this may be for you! Earn up...Show more
    Last updated: 30+ days ago • Promoted
    Travel Nurse RN - Infusion - $2,210 per week

    Travel Nurse RN - Infusion - $2,210 per week

    KPG Healthcare • Mountain House, CA, US
    Permanent
    KPG Healthcare is seeking a travel nurse RN Infusion for a travel nursing job in Tracy, California.Job Description & Requirements Specialty : Infusion Discipline : RN Start Date : 12 / 15 / 2025 Duration : ...Show more
    Last updated: 1 day ago • Promoted
    Certified Medical Assistant - OBGYN (Coyote Acres)

    Certified Medical Assistant - OBGYN (Coyote Acres)

    Christus Health • Coyote Acres, TX, US
    Full-time +1
    The Certified Medical Assistant will perform various services and related activities in support of patient care including accurate data entry for patient registration and insurance verification.The...Show more
    Last updated: 2 hours ago • Promoted • New!
    Research Associate III / Senior Research Associate (Biodesigner), Assay Development / Bioanalytical Development

    Research Associate III / Senior Research Associate (Biodesigner), Assay Development / Bioanalytical Development

    Amber Bio • San Francisco Bay Area, United States
    Full-time
    Amber Bio is a biotechnology company pioneering new gene editing modalities using multi-kilobase edits to reach previously undruggable patient populations. Founded by pioneers in the CRISPR field fr...Show more
    Last updated: 30+ days ago • Promoted
    Travel Nurse RN - Infusion - $2,478 per week

    Travel Nurse RN - Infusion - $2,478 per week

    Slate Healthcare • Mountain House, CA, US
    Full-time
    Slate Healthcare is seeking a travel nurse RN Infusion for a travel nursing job in Tracy, California.Job Description & Requirements Specialty : Infusion Discipline : RN Start Date : 12 / 15 / 2025 Duratio...Show more
    Last updated: 1 day ago • Promoted
    Research & Strategy Analyst, Life Sciences

    Research & Strategy Analyst, Life Sciences

    Savills North America • Hayward, CA, United States
    Full-time
    Savills is seeking a Research & Strategy Analyst to join its Life Sciences Practice Group.This hybrid role blends market research, strategic insight, and business development support to empower bro...Show more
    Last updated: 4 days ago • Promoted
    High-Throughput Screening Research Associate II, III (Biodesigner II, III)

    High-Throughput Screening Research Associate II, III (Biodesigner II, III)

    Amber Bio • Fremont, CA, United States
    Full-time
    Amber Bio is a biotechnology company pioneering new gene editing modalities using multi-kilobase edits to reach previously undruggable patient populations. Founded by pioneers in the CRISPR field fr...Show more
    Last updated: 30+ days ago • Promoted