Talent.com
AI Performance Optimization Engineer
AI Performance Optimization EngineerLightning AI • New York, NY, United States
AI Performance Optimization Engineer

AI Performance Optimization Engineer

Lightning AI • New York, NY, United States
1 day ago
Job type
  • Full-time
Job description

Who We Are

Lightning AI is the company reimagining the way AI is built. After creating and releasing PyTorch Lightning in 2019, Lightning AI was launched to reshape the development of artificial intelligence products for commercial and academic use.

We are on a mission to simplify AI development, making it accessible to everyone-from solo researchers to large enterprises. By removing the complexity of building and deploying AI tools, we empower innovators to focus on solving real-world problems. Our platform is built to scale with the latest AI advancements while staying intuitive and adaptable, so you can bring your ideas to life.

We have offices in New York City, San Francisco, and London and are backed by investors such as Coatue, Index Ventures, Bain Capital Ventures, and Firstminute.

Our Values

  • Move Fast : We act with speed and precision, breaking down big challenges into achievable steps.
  • Focus : We complete one goal at a time with care, collaborating as a team to deliver features with precision.
  • Balance : Sustained performance comes from rest and recovery. We ensure a healthy work-life balance to keep you at your best.
  • Craftsmanship : Innovation through excellence. Every detail matters, and we take pride in mastering our craft.
  • Minimal : Simplicity drives our innovation. We eliminate complexity through discipline and focus on what truly matters.

What We're Looking For

We are seeking a highly skilled AI Optimization Engineer to work on optimizing training and inference workloads on compute accelerators and clusters, through the Lightning Thunder compiler and the broader PyTorch Lightning ecosystem . This role sits at the intersection of deep learning research, compiler development, and large-scale system optimization . You'll be shaping technology that pushes the boundaries of model performance and efficiency, creating foundational software that will impact the entire machine learning ecosystem.

You will be joining the Engineering Team and report to our Tech Lead. This is a hybrid role based in either our New York City or San Francisco office with in-office requirements of 2 days per week. The salary range for this role is $120,000-$250,000.

What you'll do

  • Develop performance-oriented model optimizations at multiple levels :
  • Graph-level (e.g., operator fusion, kernel scheduling, memory planning)
  • Kernel-level (CUDA, Triton, custom operators for specialized hardware)
  • System-level (distributed training across GPUs / TPUs, inference serving at scale)
  • Advance the Thunder compiler by building optimization passes, graph transformations, and integration hooks to accelerate training and inference workloads.
  • Work across the software stack to ensure optimizations are accessible to end users through clean APIs, automated tooling, and seamless integration with PyTorch Lightning.
  • Design and implement profiling and debugging tools to analyze model execution, identify bottlenecks, and guide optimization strategies.

  • Collaborate with hardware vendors and ecosystem partners to ensure Thunder runs efficiently across diverse backends (NVIDIA, AMD, TPU, specialized accelerators).
  • Contribute to open-source projects by developing new features, improving documentation, and supporting community adoption.
  • Engage with researchers and engineers in the community, providing guidance on performance tuning and advocating for Thunder as the go-to optimization layer in ML workflows.
  • Work cross-functionally with Lightning's product and engineering teams to ensure compiler and optimization improvements align with the broader product vision.
  • What you'll need

  • Strong expertise with deep learning frameworks such as PyTorch, JAX, or TensorFlow.
  • Hands-on experience with model optimization techniques, including graph-level optimizations, quantization, pruning, mixed precision, or memory-efficient training.
  • Deep understanding of compiler internals (IR design, operator fusion, scheduling, optimization passes) or proven work in performance-critical software.

  • Experience with CUDA, Triton, or other GPU programming models for developing custom kernels.
  • Knowledge of distributed systems and parallelism strategies (data / model / pipeline parallelism, checkpointing, elastic scaling).
  • Familiarity with software engineering practices : designing APIs, building robust tooling, testing, CI / CD for performance-sensitive systems.
  • Proven track record contributing to open-source projects in ML, HPC, or compiler domains.
  • Excellent collaboration and communication skills, with the ability to partner across research, engineering, and external contributors.
  • Bachelor's degree in Computer Science, Engineering, or a related field. Advanced degree (Master's or PhD) in machine learning, compilers, or systems highly preferred.
  • Benefits and Perks

    We offer competitive base salaries and stock options with a 25% one year cliff and monthly vesting thereafter. For our international employees, we work with Velocity Global to pay you in your local currency and provide equitable benefits across the globe.

    In the US, we offer :

  • Medical, dental and vision
  • Life and AD&D insurance
  • Flexible paid time off plus 1 week of winter closure
  • Generous paid family leave benefits
  • $500 monthly meal reimbursement, including groceries & food delivery services
  • $500 one time home office stipend
  • $1,000 annual learning & development stipend
  • 100% Citibike membership (NYC only)
  • $45 / month gym membership
  • Additional various medical and mental health services
  • At Lightning AI, we are committed to fostering an inclusive and diverse workplace. We believe that diverse teams drive innovation and create better products. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, age, disability, veteran status, or any other protected characteristic. We are dedicated to building a culture where everyone can thrive and contribute to their fullest potential.

    Create a job alert for this search

    Engineer Performance • New York, NY, United States

    Related jobs
    AI / ML Software Engineer

    AI / ML Software Engineer

    VirtualVocations • Bronx, New York, United States
    Full-time
    A company is looking for an AI / ML Software Engineer.Key Responsibilities Rapidly prototype product features to explore new AI capabilities while writing clean, understandable code Deliver genera...Show more
    Last updated: 30+ days ago • Promoted
    Applied AI Engineer Consultant

    Applied AI Engineer Consultant

    VirtualVocations • Paterson, New Jersey, United States
    Full-time
    A company is looking for an Applied AI Engineer Consultant.Key Responsibilities Build production-ready Intelligent Automations and Agentic AI solutions for enterprise clients Guide clients in de...Show more
    Last updated: 30+ days ago • Promoted
    AI Integration Engineer

    AI Integration Engineer

    VirtualVocations • Newark, New Jersey, United States
    Full-time
    A company is looking for an AI Integration Engineer - ServiceNow / Workday.Key Responsibilities Lead deployment and configuration of ServiceNow / Workday connectors Translate HR / IT / Facilities use ca...Show more
    Last updated: 30+ days ago • Promoted
    AI Solution Architect

    AI Solution Architect

    EisnerAmper • Iselin, NJ, United States
    Full-time
    At EisnerAmper, we look for individuals who welcome new ideas, encourage innovation, and are eager to make an impact.Whether you're starting out in your career or taking your next step as a seasone...Show more
    Last updated: 30+ days ago • Promoted
    Advisory Machine Learning Engineer

    Advisory Machine Learning Engineer

    eMoney Advisor • Stamford, CT, United States
    Full-time
    The Advisory Machine Learning Engineer is a subject matter expert that works within the Data Science team to enable analytical and Machine Learning solutions to business problems.The Advisory Machi...Show more
    Last updated: 30+ days ago • Promoted
    AI Agent Developer

    AI Agent Developer

    VirtualVocations • Yonkers, New York, United States
    Full-time
    A company is looking for an AI Agent Developer.Key Responsibilities Architect and implement AI-powered agents using Large Language Models (LLMs) Connect agents to internal systems and databases,...Show more
    Last updated: 30+ days ago • Promoted
    AI Engineer

    AI Engineer

    VirtualVocations • New York, New York, United States
    Full-time
    A company is looking for an AI Engineer.Key Responsibilities Collaborate with data scientists and ML engineers to containerize, deploy, and monitor AI / ML models Design, build, and manage cloud i...Show more
    Last updated: 30+ days ago • Promoted
    Junior AI Engineer

    Junior AI Engineer

    VirtualVocations • Jamaica, New York, United States
    Full-time
    A company is looking for a Junior Research AI Engineer.Key Responsibilities Develop and implement AI and machine learning algorithms Conduct research on new AI methodologies and apply statistica...Show more
    Last updated: 7 days ago • Promoted
    Mid-Level AI Software Developer

    Mid-Level AI Software Developer

    VirtualVocations • Paterson, New Jersey, United States
    Full-time
    A company is looking for a Mid-Level AI Software Developer.Key Responsibilities Build proof-of-concept (PoC) solutions to explore new technologies and AI / ML approaches Design, develop, and maint...Show more
    Last updated: 5 days ago • Promoted
    Artificial Intelligence Engineer

    Artificial Intelligence Engineer

    VirtualVocations • Yonkers, New York, United States
    Full-time
    A company is looking for an Artificial Intelligence Engineer.Key Responsibilities Design and implement scalable agentic solutions for diverse organizational use cases Develop and maintain a cent...Show more
    Last updated: 30+ days ago • Promoted
    AI Solutions Architect

    AI Solutions Architect

    VirtualVocations • New York, New York, United States
    Full-time
    A company is looking for an AI Solutions Architect to design, implement, and optimize intelligent solutions within the Microsoft ecosystem. Key Responsibilities Design end-to-end AI and microservi...Show more
    Last updated: 30+ days ago • Promoted
    Staff Software Engineer, ML Platform with CJIS Certification

    Staff Software Engineer, ML Platform with CJIS Certification

    VirtualVocations • Bronx, New York, United States
    Full-time
    A company is looking for a Staff Software Engineer, ML Platform.Key Responsibilities Own and extend MLOps systems, leading development and maintenance Enhance ML R&D infrastructure for rapid exp...Show more
    Last updated: 5 days ago • Promoted
    Applied AI Engineer

    Applied AI Engineer

    VirtualVocations • Yonkers, New York, United States
    Full-time
    A company is looking for an Applied AI Engineer to develop and deploy AI models and hybrid AI frameworks.Key Responsibilities Develop and deploy AI models and hybrid AI frameworks Conduct resear...Show more
    Last updated: 30+ days ago • Promoted
    High Performance AI Engineer

    High Performance AI Engineer

    VirtualVocations • Bronx, New York, United States
    Full-time
    A company is looking for a High Performance AI Engineer to build groundbreaking multi-agent systems for the CUDA ecosystem. Key Responsibilities Design, build, and optimize agentic AI systems for ...Show more
    Last updated: 6 days ago • Promoted
    AI Tools Engineering Expert

    AI Tools Engineering Expert

    VirtualVocations • Astoria, New York, United States
    Part-time
    A company is looking for a Part-Time Industry Expert in AI Tools for Engineering.Key Responsibilities Consult on curriculum structure and project portfolio to align with industry best practices ...Show more
    Last updated: 1 day ago • Promoted
    Lead Machine Learning Engineer (Generative AI Focus)

    Lead Machine Learning Engineer (Generative AI Focus)

    Spectraforce Technologies • Newark, NJ, United States
    Full-time
    Title : Lead Machine Learning Engineer (Generative AI Focus).Location : Newark, NJ (Remote for Contract duration and Hybrid once converted). We are seeking a highly skilled and experienced Senior Mach...Show more
    Last updated: 17 days ago • Promoted
    AI Evaluation Analyst

    AI Evaluation Analyst

    VirtualVocations • Elizabeth, New Jersey, United States
    Full-time
    A company is looking for an AI Agent Evaluation Analyst.Key Responsibilities Review evaluation tasks and scenarios for logic, completeness, and realism Identify inconsistencies, missing assumpti...Show more
    Last updated: 22 days ago • Promoted
    AI Scientist

    AI Scientist

    Novartis Group Companies • East Hanover, NJ, United States
    Full-time
    Location : Basel, Switzerland or Cambridge, US.The Oncology Data Science team in Novartis Biomedical Research (NBR) provides computational biology, AI and data engineering expertise to the Oncology ...Show more
    Last updated: 4 hours ago • Promoted • New!