Talent.com
Senior AI Software Engineer, LLM Inference Performance Analysis

Senior AI Software Engineer, LLM Inference Performance Analysis

NVIDIASanta Clara, CA, United States
14 hours ago
Job type
  • Full-time
Job description

We're now seeking a Senior AI Software Engineer, in our LLM Inference Performance Analysis and Optimization team!

NVIDIA leads the generative AI revolution. We're now seeking an experienced AI Software Engineer to optimize LLM inference performance. Our team collaborates with compiler, kernel, hardware, and framework teams to assess bottlenecks, create optimization methods, and validate improvements. If you’re passionate about system-level performance, compiler IR, and GPU kernel optimization for deep learning inference, we’d love to consider you for our team.

What you'll be doing :

Analyze the performance of LLMs on NVIDIA GPUs by employing advanced profiling and projection tools.

Find opportunities for performance improvements in the IR-based compiler middle end optimizer and / or in precompiled kernel optimizations driven by Graph IR transformations.

Build and develop new compiler passes and optimization techniques to deliver outstanding, robust, and maintainable compiler infrastructure and tools.

Collaborate closely with architecture teams to influence and co-design future hardware features that improve compiler and runtime efficiency.

Work with geographically distributed teams across compiler, hardware, kernel, and framework domains to drive performance improvements and resolve complex issues.

Contribute to a core team at the forefront of deep learning and LLM inference technology, spanning hardware architecture development, kernel optimization, and integration with higher-level deep learning frameworks.

What we need to see :

Master’s or PhD in Computer Science, Computer Engineering, or a related field, or equivalent experience.

5+ years relevant experience.

Strong hands-on programming expertise in C++ and Python, with solid software engineering fundamentals.

Skilled in innovative LLM architectures, covering inference optimization, profiling, and compiler-level performance tuning.

Significant background in optimizing kernels through information retrieval techniques and generating code, including graph transformations, fusion, scheduling, and developing custom kernel generation frameworks like OpenAI Triton or other compiler-based code generation pipelines.

Hands-on experience with deep learning frameworks like TensorRT-LLM, vLLM, SGLang, Jax / XLA, or related compiler / runtime environments.

Proven ability to analyze and optimize LLM performance bottlenecks across model development, kernel execution, and runtime systems.

Excellent communication and collaboration skills, with the ability to work independently and effectively across distributed teams in a fast-paced environment.

Display a robust determination to continuously improve software and hardware performance by engaging in profiling, analysis, and optimization.

Proficiency in CUDA programming and familiarity with GPU-accelerated deep learning frameworks and performance tuning techniques.

Ways to stand out from the crowd :

Showcase innovative applications of agentic AI tools that enhance productivity and workflow automation.

Proven background in LLVM, MLIR, and / or Clang compiler development.

Active engagement with the open-source LLVM or MLIR community to ensure tighter integration and alignment with upstream efforts.

NVIDIA is recognized as one of the world’s most desirable engineering environments, built by teams who value technical depth, innovation, and impact. We work alongside some of the best minds in GPU computing, systems software, and AI. If you’re driven by performance, enjoy solving complex problems, and thrive in an environment that rewards initiative and technical excellence, we’d love to hear from you!

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits () .

Applications for this job will be accepted at least until November 4, 2025.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Senior Software Engineer Ai • Santa Clara, CA, United States

Related jobs
  • Promoted
Senior AI / ML Engineer II

Senior AI / ML Engineer II

DigitalOceanSan Francisco, CA, United States
Full-time
Dive in and do the best work of your career at DigitalOcean.Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud.If you have a g...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior Lead AI Engineer (AI Foundations, LLM Core)

Senior Lead AI Engineer (AI Foundations, LLM Core)

Capital OneSan Jose, CA, United States
Full-time +1
Senior Lead AI Engineer (AI Foundations, LLM Core).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader ...Show moreLast updated: 14 hours ago
  • Promoted
  • New!
Senior AI Engineer (AI Foundations, LLM Core, Agentic AI)

Senior AI Engineer (AI Foundations, LLM Core, Agentic AI)

Capital OneSan Francisco, CA, United States
Full-time +1
Senior AI Engineer (AI Foundations, LLM Core, Agentic AI).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry ...Show moreLast updated: 14 hours ago
  • Promoted
AIML - Senior Software Engineer, Machine Learning Platform Technologies

AIML - Senior Software Engineer, Machine Learning Platform Technologies

AppleSan Francisco, CA, United States
Full-time
AIML - Senior Software Engineer, Machine Learning Platform Technologies.San Francisco Bay Area, California, United States Software and Services. We are looking for a Senior Software Engineer to shap...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior Lead AI Engineer AI Foundations LLM Core Agentic AI

Senior Lead AI Engineer AI Foundations LLM Core Agentic AI

Capital OneSan Francisco, CA, United States
Full-time +1
Senior Lead AI Engineer (AI Foundations, LLM Core, Agentic AI).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an indu...Show moreLast updated: 14 hours ago
  • Promoted
  • New!
AI Engineer with LLM

AI Engineer with LLM

Diverse LynxSan Jose, CA, United States
Full-time
Job Responsibilities : • Collaborate closely with global product and engineering teams to design AI and LLM solutions and support business objectives. Design, develop and deliver AI / ML models, levera...Show moreLast updated: 14 hours ago
  • Promoted
  • New!
LLM / ML Engineer (Inference)

LLM / ML Engineer (Inference)

ReductoSan Francisco, CA, United States
Full-time
We would love to meet you if you : .Philosophy : You are your own worst critic.You have a high bar for quality and don't rest until the job is done rightno settling for 90%. We want someone who ships f...Show moreLast updated: 14 hours ago
  • Promoted
Senior Software Engineer, AI / ML, GenAI, Ads

Senior Software Engineer, AI / ML, GenAI, Ads

Google Inc.Mountain View, CA, United States
Full-time
Senior Software Engineer, AI / ML, GenAI, Ads.Google place Pittsburgh, PA, USA.Bachelor’s degree or equivalent practical experience. Large Language Models, Multi-Modal, Large Vision Models) or GenAI-r...Show moreLast updated: 30+ days ago
  • Promoted
Lead AI Engineer (FM Hosting, LLM Inference)

Lead AI Engineer (FM Hosting, LLM Inference)

Capital OneSan Francisco, CA, United States
Part-time
You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing.You want to work on problems that will help change banking for good.Passion for s...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior Software Engineer, AI / ML

Senior Software Engineer, AI / ML

GLUE IncSan Francisco, CA, United States
Full-time
Glue is a well-funded startup working on the next generation of work communication tools.We believe that todays work chat is noisy, unstructured, and not designed for productivity.Were drawing from...Show moreLast updated: 14 hours ago
  • Promoted
  • New!
Senior AI Engineer - LLM, RAG

Senior AI Engineer - LLM, RAG

BrightAI CorporationPalo Alto, CA, United States
Full-time
AI is a high-growth Physical AI company transforming how businesses interact with the physical world through intelligent automation. Our AI platform processes visual, spatial, and temporal data from...Show moreLast updated: 14 hours ago
  • Promoted
AI Engineer - LLM Infra

AI Engineer - LLM Infra

YutoriSan Francisco, CA, United States
Full-time
Yutori is reimagining how people interact with the web by building AI agents that can reliably do everyday digital tasks. We are building the entire stack to be agent-first, from training our own mo...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Lead AI Engineer (AI Foundations, LLM Core)

Lead AI Engineer (AI Foundations, LLM Core)

Capital OneSan Jose, CA, United States
Full-time +1
Lead AI Engineer (AI Foundations, LLM Core).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in usin...Show moreLast updated: 13 hours ago
  • Promoted
  • New!
Senior AI Engineer (AI Foundations, LLM Core)

Senior AI Engineer (AI Foundations, LLM Core)

Capital OneSan Jose, CA, United States
Full-time +1
Senior AI Engineer (AI Foundations, LLM Core).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in us...Show moreLast updated: 13 hours ago
  • Promoted
  • New!
Senior AI Engineer AI Foundations LLM Core Agentic AI

Senior AI Engineer AI Foundations LLM Core Agentic AI

Capital OneSan Jose, CA, United States
Full-time +1
Senior AI Engineer (AI Foundations, LLM Core, Agentic AI).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry ...Show moreLast updated: 14 hours ago
  • Promoted
  • New!
Senior AI Engineer (AI Foundations, LLM Core and Agentic AI)

Senior AI Engineer (AI Foundations, LLM Core and Agentic AI)

Capital OneSan Francisco, CA, United States
Full-time +1
Senior AI Engineer (AI Foundations, LLM Core and Agentic AI).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an indust...Show moreLast updated: 13 hours ago
  • Promoted
  • New!
Senior Lead AI Engineer (AI Foundations, LLM Core, Agentic AI)

Senior Lead AI Engineer (AI Foundations, LLM Core, Agentic AI)

Capital OneSan Jose, CA, United States
Full-time +1
Senior Lead AI Engineer (AI Foundations, LLM Core, Agentic AI).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an indu...Show moreLast updated: 14 hours ago
  • Promoted
  • New!
Senior AI Engineer (LLM Core)

Senior AI Engineer (LLM Core)

Capital OneSan Francisco, CA, United States
Full-time +1
At Capital One, we are creating responsible and reliable AI systems, changing banking for good.For years, Capital One has been an industry leader in using machine learning to create real-time, pers...Show moreLast updated: 14 hours ago