Talent.com
Software Engineer (AI Performance)
Software Engineer (AI Performance)Gimlet Labs, Inc • San Francisco, CA, United States
Software Engineer (AI Performance)

Software Engineer (AI Performance)

Gimlet Labs, Inc • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Gimlet Labs is building the foundation for the next generation of AI applications. As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck. Gimlet is redefining AI inference from the ground up, combining cutting-edge research with an integrated hardware-software stack that delivers breakthrough performance, efficiency, and model quality. Gimlet pairs its inference stack with a seamless developer experience, allowing users to deploy, manage, and monitor AI workloads from frameworks like PyTorch and LangChain at production scale in seconds.

Gimlet is spun out of a Stanford research project under Professors Zain Asgar and Sachin Katti. The founding team has deep experience across AI, distributed systems, and hardware with previous successful exits.

Gimlet Labs is seeking a Software Engineer focused on AI Performance. You will be researching and implementing techniques to drive performance and quality optimizations across the latest AI models. You will implement techniques such as quantization, KV caching, and FlashAttention to enable inference efficiency. You will design parallelism strategies to distribute data and workloads across compute nodes at production scale. You will dive deep into GPU code and kernel optimizations to accelerate AI workloads.

Responsibilities

  • Evaluating and implementing cutting-edge AI research for model performance and efficiency
  • Architecting infrastructure for distributed AI workloads across both the software stack and GPU kernel layers
  • Profiling, benchmarking, and analyzing system performance, identifying bottlenecks and optimization opportunities in execution runtimes targeting various hardware systems

Qualifications

  • Bachelor’s degree in computer science, engineering, applied mathematics or comparable area of study
  • Experience with performance optimization
  • Preferred Qualifications

  • Graduate degree in computer science, engineering, applied mathematics or comparable area of study
  • Familiarity with compilers and compiler frameworks such as MLIR
  • Experience with PyTorch, TensorFlow, vLLM, ONNX and other AI frameworks
  • Software development experience with Python, C++, and CUDA
  • #J-18808-Ljbffr

    Create a job alert for this search

    Software Engineer Ai • San Francisco, CA, United States

    Similar jobs
    AI Engineer

    AI Engineer

    Wallman Unlimited Company • San Francisco, CA, US
    Full-time
    We're looking for an AI Engineer to help build the core agent systems that power autonomous financial audits.This is a high-ownership role with responsibility from architecture through deployme...Show more
    Last updated: 9 days ago • Promoted
    Senior Performance Engineer : High-Scale AI Systems

    Senior Performance Engineer : High-Scale AI Systems

    OpenAI • San Francisco, CA, United States
    Full-time
    A leading AI research organization in San Francisco is seeking an experienced Performance Engineer to enhance system performance and reliability. The role involves collaboration across teams to opti...Show more
    Last updated: 30+ days ago • Promoted
    AI Software Engineer

    AI Software Engineer

    unitQ • San Francisco, CA, US
    Full-time
    AI and advanced analytics to enable businesses to proactively monitor and improve product quality based on real-time user feedback from both public and private channels. Backed by leading investors ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Agentic AI Hardware

    Senior Software Engineer, Agentic AI Hardware

    Flux Enterprise • San Francisco, CA, United States
    Full-time
    A forward-thinking technology company in San Francisco is seeking a Senior Software Engineer to develop the next generation of AI systems. The ideal candidate has over 8 years of experience, strong ...Show more
    Last updated: 4 hours ago • Promoted • New!
    AI Agent Software Engineer

    AI Agent Software Engineer

    Cooperidge Consulting Firm • San Francisco, CA, US
    Full-time
    This role offers the opportunity to build production-grade AI agents that autonomously handle complex, real-world customer interactions traditionally managed by human support teams.The engineer in ...Show more
    Last updated: 9 days ago • Promoted
    Senior Software Engineer - AI Platform

    Senior Software Engineer - AI Platform

    StubHub • San Francisco, CA, US
    Full-time
    StubHub is on a mission to redefine the live event experience on a global scale.Whether someone is looking to attend their first event or their hundredth, we're here to delight them all the way...Show more
    Last updated: 8 days ago • Promoted
    Principal Software Engineer, Managed AI

    Principal Software Engineer, Managed AI

    Crusoe • San Francisco, CA, US
    Full-time
    Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrif...Show more
    Last updated: 30+ days ago • Promoted
    AI Performance Engineer

    AI Performance Engineer

    Cornelis Networks • San Francisco, CA, United States
    Full-time
    Cornelis Networks delivers the world's highest performance scale-out networking solutions for AI and HPC datacenters.Our differentiated architecture seamlessly integrates hardware, software and sys...Show more
    Last updated: 12 days ago • Promoted
    Full Stack AI Software Engineer

    Full Stack AI Software Engineer

    Freshworks • San Mateo, CA, US
    Full-time
    Organizations everywhere struggle under the crushing costs and complexities of "solutions" that promise to simplify their lives. To create a better experience for their customers and emplo...Show more
    Last updated: 2 days ago • Promoted
    Staff AI Software Engineer

    Staff AI Software Engineer

    VirtualVocations • Oakland, California, United States
    Full-time
    A company is looking for a Staff AI Software Engineer to design, build, and scale production-grade AI systems.Key Responsibilities Design, implement, and ship AI-driven features and systems into ...Show more
    Last updated: 10 days ago • Promoted
    AI Software Engineer

    AI Software Engineer

    unitQ Monitor • San Francisco, CA, United States
    Full-time
    At unitQ, we leverage AI and advanced analytics to enable businesses to proactively monitor and improve product quality based on real‑time user feedback from both public and private channels.Backed...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, New Products & AI-Driven Systems

    Software Engineer, New Products & AI-Driven Systems

    EvenUp • San Francisco, CA, United States
    Full-time
    A technology-driven company in San Francisco is seeking a skilled Software Engineer to own core systems and workflows supporting customer-facing products. The ideal candidate will have a strong back...Show more
    Last updated: 11 days ago • Promoted
    Fullstack Software Engineer, AI Agents

    Fullstack Software Engineer, AI Agents

    11x • San Francisco, CA, United States
    Full-time
    Fullstack Software Engineer, AI Agents – 11x.At 11x, we’re building autonomous digital workers that handle routine work end‑to‑end, freeing humans to focus on what they do best—creating, innovating...Show more
    Last updated: 25 days ago • Promoted
    Senior Software Engineer, AI Products

    Senior Software Engineer, AI Products

    CloudTrucks • San Francisco, CA, United States
    Full-time
    The trucking industry is the backbone of the global economy.Roughly 70 percent of what we consume in the U.Those trucks are powered by over 3. Trucking is a massive industry, but it is a traditional...Show more
    Last updated: 30+ days ago • Promoted
    Principal Software Engineer, Managed AI

    Principal Software Engineer, Managed AI

    Crusoe Energy Systems LLC • San Francisco, CA, United States
    Full-time
    Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, spe...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer (AI Products)

    Senior Software Engineer (AI Products)

    Toma • San Francisco, CA, United States
    Full-time
    We're building the AI platform for underserved industries.LLM usage has seen a meteoric rise in the past year, but there is still a significant gap between agentic innovation and its use in the rea...Show more
    Last updated: 23 days ago • Promoted
    Software Engineer, AI

    Software Engineer, AI

    Cleric • San Francisco, CA, US
    Full-time
    We’re building an autonomous AI agent that investigates and resolves production incidents.Our agent combines LLMs with tools to understand systems, reason through problems, and take correctiv...Show more
    Last updated: 2 days ago • Promoted
    Senior Software Engineer, AI Products

    Senior Software Engineer, AI Products

    Cambio • San Francisco, CA, United States
    Full-time
    Senior Software Engineer, AI Products.Cambio is a software platform for world‑class real estate decarbonization.We help commercial real estate owners and tenants bring their real estate portfolios ...Show more
    Last updated: 18 days ago • Promoted