Talent.com
AI Accelerator Software Engineer-Communication Kernel
AI Accelerator Software Engineer-Communication KernelAmpere • Santa Clara, CA, United States
AI Accelerator Software Engineer-Communication Kernel

AI Accelerator Software Engineer-Communication Kernel

Ampere • Santa Clara, CA, United States
17 hours ago
Job type
  • Full-time
Job description

Description

Invent the future with us.

Ampere is a semiconductor design company for a new era, leading the future of computing with an innovative approach to CPU design focused on high-performance, energy efficient, sustainable cloud computing.

By providing a new level of predictable performance, efficiency, and sustainability Ampere is working with leading cloud suppliers and a growing partner ecosystem to deliver cloud instances, servers and embedded / edge products that can handle the compute demands of today and tomorrow.

Join us at Ampere and work alongside a passionate and growing team - we'd love to have you apply!

About the role :

In this role as an AI Accelerator Software Engineer-Communication Kernel, you will drive the development and optimization of cutting-edge AI frameworks. You will be at the forefront of advancing AI capabilities, helping to pave the way for high-performance and efficient computing solutions that will meet future AI demands.

What you'll achieve :

  • In this role, you will develop communication kernels and enable the cutting-edge distributed inference technologies like Tensor / Pipeline parallelism, disaggregated prefil and decode for Ampere accelerator.
  • Go deep into to the entire SW / HW stack to accelerate the deep learning including but not limited to inference serving, framework integration, compiler, runtime library, communication and compute kernel development, and performance tuning.
  • In this role, you will work on deep learning model enabling with performance and accuracy for popular frameworks like PyTorch and Llama.cpp and for serving platforms like vLLM and SGLang, positioning you at the forefront of AI innovation.
  • HW / SW codesign to optimize existing AI architectures to enhance computational efficiency, increase throughput, reduce latency, and improve the scalability, pushing the boundaries of what's possible in AI technology.
  • Be a key team member in building state-of-the-art software and hardware AI co-processors / accelerators, contribute to a collaborative and dynamic work environment, supporting continuous improvement and excellence.
  • Collaborate with cross-functional teams to integrate AI solutions into Ampere's cloud-native processor platforms and accelerators.

About you :

  • BS Computer Science, Mathematics or a related technical field & 12 years of related experience; or MS degree & 8 years; or PhD & 5 years
  • Previous development experience using or implementing a collective communication library like NCCL / RCCL / MPI, or working experience of socket programming
  • This position requires strong expertise in programming languages such as Python, C / C++ with a strong background in performance tuning.
  • Previous software development with a focus on AI frameworks - PyTorch, llama.cpp, ONNX, etc is a big plus.
  • Solid understanding of AI and machine learning concepts, including neural networks and data processing frameworks is also preferred.
  • Experience with high-performance computing systems and cloud-based architectures.
  • What we'll offer :

    At Ampere we believe in taking care of our employees and providing a competitive total rewards package that includes base pay, bonus (i.e., variable pay tied to internal company goals), long-term incentive, and comprehensive benefits. The full base pay range for this role is between $169,500 and $282,500, except in the San Francisco Bay Area where the range is between $178,500 and $297,500.

    Our benefits include health, wellness, and financial programs that support employees through every stage of life, with full benefits eligibility at 20 hours per week.

    Benefit highlights include :

  • Premium medical insurance, dental insurance, vision insurance, as well as income protection and a 401K retirement plan, so that you can feel secure in your health and financial future.
  • Unlimited Flextime and 10+ paid holidays so that you can embrace a healthy work-life balance.
  • A variety of healthy snacks, energizing espresso, and refreshing drinks to keep you fueled and focused throughout the day.
  • And there is much more than compensation and benefits. At Ampere, we foster an inclusive culture that empowers our employees to do more and grow more. We are passionate about inventing industry leading cloud-native designs that contribute to a more sustainable future. We are excited to share more about our career opportunities with you through the interview process.

    #LI-DR

    #LI-Hybrid

    Ampere is an inclusive and equal opportunity employer and welcomes applicants from all backgrounds. All qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, religion, age, veteran and / or military status, sex, sexual orientation, gender, gender identity, gender expression, physical or mental disability, or any other basis protected by federal, state or local law.

    Create a job alert for this search

    Accelerator Software • Santa Clara, CA, United States

    Related jobs
    Senior, Software Engineer - AI

    Senior, Software Engineer - AI

    Walmart • Sunnyvale, CA, United States
    Full-time +1
    Develop next generation Last Mile Delivery software applications, including very high capacity, guaranteed availability, and mass market usability without compromising quality.Engage in hands on en...Show more
    Last updated: 17 hours ago • Promoted • New!
    Software Engineer, AI Product

    Software Engineer, AI Product

    Sesame • San Francisco, CA, United States
    Full-time
    Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate with us in ways that feel natural and human. With this vision, we're designing a new kind of...Show more
    Last updated: 4 days ago • Promoted
    Senior Software Engineer, AI Model serving - San Francisco, USA

    Senior Software Engineer, AI Model serving - San Francisco, USA

    Speechify • San Francisco, California, United States
    Full-time
    PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify’s text-to-speech products to turn whatever ...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - AI Agent Infrastructure (Healthcare)

    Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • Hayward, CA, United States
    Full-time
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patient data, processing orders and prescri...Show more
    Last updated: 12 days ago • Promoted
    AI Accelerator Software Engineer-Graph Optimization

    AI Accelerator Software Engineer-Graph Optimization

    Ampere • Santa Clara, CA, United States
    Full-time
    Ampere is a semiconductor design company for a new era, leading the future of computing with an innovative approach to CPU design focused on high-performance, energy efficient, sustainable cloud co...Show more
    Last updated: 19 hours ago • Promoted • New!
    Senior Software Engineer - Voice AI Agents

    Senior Software Engineer - Voice AI Agents

    FleetWorks • San Francisco, CA, United States
    Full-time
    Every year, companies spend over a trillion dollars moving freight across the U.We’re building voice agents that transform the chaotic freight booking process into a modern, intelligent marketplace...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • Fremont, CA, United States
    Full-time
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...Show more
    Last updated: 10 days ago • Promoted
    Software Engineer - Applied AI

    Software Engineer - Applied AI

    Selector Software, Inc. • Santa Clara, CA, United States
    Full-time
    Selector is building an operational intelligence platform for digital infrastructure.By adopting an AI / ML-based analytics approach, the platform provides actionable multi-dimensional insights to ne...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - AI Agents

    Senior Software Engineer - AI Agents

    FleetWorks Technology, Inc. • San Francisco, CA, United States
    Full-time
    Build and optimize autonomous AI agents that negotiate freight rates, book loads, and handle carrier conversations over phone and email. Own end-to-end feature development, working directly with a h...Show more
    Last updated: 3 days ago • Promoted
    Software Engineer, Enterprise AI

    Software Engineer, Enterprise AI

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer- AI / ML, AWS Neuron

    Software Engineer- AI / ML, AWS Neuron

    Amazon • Cupertino, CA, United States
    Full-time
    AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine.Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Ap...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Agentic AI

    Senior Software Engineer, Agentic AI

    PayPal • San Jose, CA, United States
    Full-time
    PayPal has been revolutionizing commerce globally for more than 25 years.Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empow...Show more
    Last updated: 17 hours ago • Promoted • New!
    Software AI Engineer

    Software AI Engineer

    Jade Global • San Jose, CA, United States
    Full-time
    Design, develop, and implement AI / ML models and algorithms using Python.Deploy and manage AI solutions on cloud platforms (e. Collaborate with data scientists and other engineers to integrate AI mod...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, AI Agentic Experience (Auth0)

    Senior Software Engineer, AI Agentic Experience (Auth0)

    Okta • San Francisco, CA, United States
    Full-time
    Okta is The World's Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secur...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, AI Agents

    Software Engineer, AI Agents

    Asana • San Francisco, CA, United States
    Full-time
    We're building AI Teammates : agents that work like actual users in Asana and integrated apps.They triage bugs, respond to requests, draft project briefs, conduct research, and handle knowledge work...Show more
    Last updated: 10 days ago • Promoted
    Senior AI Software Engineer, GenAI Framework

    Senior AI Software Engineer, GenAI Framework

    NVIDIA • Santa Clara, CA, United States
    Full-time
    NVIDIA is now looking for AI Software Engineers for our GenAI Frameworks (Megatron Core () and NeMo Framework () ) team.Megatron Core and NeMo Framework are open-source, scalable and cloud-native f...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, AI for Quantum

    Senior Software Engineer, AI for Quantum

    PsiQuantum • Palo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
    Last updated: 17 hours ago • Promoted • New!
    Senior Software Engineer, AI Platform - Robotics

    Senior Software Engineer, AI Platform - Robotics

    NVIDIA • Santa Clara, CA, United States
    Full-time
    We’re building the infrastructure that powers GR00T, NVIDIA’s general-purpose humanoid robotics platform.This is not a typical DevOps job. You’ll help engineer the cloud-native backend that drives s...Show more
    Last updated: 30+ days ago • Promoted