Talent.com
Software Engineer (AI Performance)
Software Engineer (AI Performance)Gimlet Labs, Inc • San Francisco, CA, United States
Software Engineer (AI Performance)

Software Engineer (AI Performance)

Gimlet Labs, Inc • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Gimlet Labs is building the foundation for the next generation of AI applications. As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck. Gimlet is redefining AI inference from the ground up, combining cutting-edge research with an integrated hardware-software stack that delivers breakthrough performance, efficiency, and model quality. Gimlet pairs its inference stack with a seamless developer experience, allowing users to deploy, manage, and monitor AI workloads from frameworks like PyTorch and LangChain at production scale in seconds.

Gimlet is spun out of a Stanford research project under Professors Zain Asgar and Sachin Katti. The founding team has deep experience across AI, distributed systems, and hardware with previous successful exits.

Gimlet Labs is seeking a Software Engineer focused on AI Performance. You will be researching and implementing techniques to drive performance and quality optimizations across the latest AI models. You will implement techniques such as quantization, KV caching, and FlashAttention to enable inference efficiency. You will design parallelism strategies to distribute data and workloads across compute nodes at production scale. You will dive deep into GPU code and kernel optimizations to accelerate AI workloads.

Responsibilities

  • Evaluating and implementing cutting-edge AI research for model performance and efficiency
  • Architecting infrastructure for distributed AI workloads across both the software stack and GPU kernel layers
  • Profiling, benchmarking, and analyzing system performance, identifying bottlenecks and optimization opportunities in execution runtimes targeting various hardware systems

Qualifications

  • Bachelor’s degree in computer science, engineering, applied mathematics or comparable area of study
  • Experience with performance optimization
  • Preferred Qualifications

  • Graduate degree in computer science, engineering, applied mathematics or comparable area of study
  • Familiarity with compilers and compiler frameworks such as MLIR
  • Experience with PyTorch, TensorFlow, vLLM, ONNX and other AI frameworks
  • Software development experience with Python, C++, and CUDA
  • #J-18808-Ljbffr

    Create a job alert for this search

    Software Engineer Ai • San Francisco, CA, United States

    Related jobs
    Senior Software Engineer (AI)

    Senior Software Engineer (AI)

    Transparent Search Group • San Francisco, CA, United States
    Full-time
    Founding AI Engineer, Growth Systems.San Francisco, CA (On-site, Hayes Valley).Full-Time | $180K - $220K + Equity.Visa sponsorship + relocation available. You'll join a small, world-class team backe...Show more
    Last updated: 17 days ago • Promoted
    Senior Software Engineer, AI Products

    Senior Software Engineer, AI Products

    TruckSmarter • San Francisco, CA, United States
    Full-time
    Logistics is one of the single largest industries in the world.Globally, logistics is an $8-$12 trillion dollar industry and in the US alone, ~ $2 trillion, representing ~10% of GDP.A single percen...Show more
    Last updated: 17 days ago • Promoted
    Software Engineer, AI Agents Tooling & Platforms

    Software Engineer, AI Agents Tooling & Platforms

    Cloudflare Inc • San Francisco, CA, United States
    Full-time
    At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...Show more
    Last updated: 30+ days ago • Promoted
    Principal Software Engineer - Automotive AI Frameworks

    Principal Software Engineer - Automotive AI Frameworks

    Renesas Electronics Corporation • San Francisco, CA, United States
    Full-time
    ADAS領域では、AIを活用した機能や性能への要求が日々高度化しています。そのため、お客様のアプリケーション開発をより容易かつ効率的にする、抽象度の高い使いやすいフレームワークの提供が急務となっています。ルネサスでは、顧客のユースケースを深く理解し、R-Car SoCのシステム性能やAIアクセラレータの能力を最大限に引き出すフレームワークの開発を推進しています。.R-Car SoC向けシステ...Show more
    Last updated: 30+ days ago • Promoted
    Software engineer, generative AI

    Software engineer, generative AI

    Writer • San Francisco, California, United States
    Full-time
    We are seeking a talented software engineer with generative AI experience who is deeply proficient with python and typescript to join our dynamic and growing team at Writer.As a key member of our e...Show more
    Last updated: 5 days ago • Promoted
    Senior Software Engineer - AI tools

    Senior Software Engineer - AI tools

    Airwallex • San Francisco, CA, US
    Full-time
    About Airwallex Airwallex is the only unified payments and financial platform for global businesses.Powered by our unique combination of proprietary infrastructure and software, we empower over 150...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, AI

    Senior Software Engineer, AI

    Peregrine Technologies • San Francisco, California, United States
    Full-time
    Backed by leading investors from Silicon Valley, Peregrine supports public safety agencies across the country — from Los Angeles to Louisville to Atlanta — empowering public servants to improve ope...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Enterprise AI

    Software Engineer, Enterprise AI

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer (AI Performance)

    Software Engineer (AI Performance)

    Gimlet Labs, Inc • San Francisco, CA, United States
    Full-time
    Gimlet Labs is building the foundation for the next generation of AI applications.As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck.Gimlet is redefi...Show more
    Last updated: 11 days ago • Promoted
    Software Engineer (AI Agents)

    Software Engineer (AI Agents)

    Pylon • San Francisco, California, United States
    Full-time
    At Pylon, we're building the future of B2B Post Sales.We’re building the all-in-one B2B post-sales support platform powered by conversational data and layered with intelligence to help our customer...Show more
    Last updated: 15 days ago • Promoted
    Software Engineer (AI Agents)

    Software Engineer (AI Agents)

    Pylon Labs • San Francisco, CA, United States
    Full-time
    At Pylon, we're building the future of B2B Post Sales.We're building the all-in-one B2B post-sales support platform powered by conversational data and layered with intelligence to help our customer...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, AI

    Software Engineer, AI

    Mlabs • San Francisco, California, United States
    Remote
    Full-time
    Software Engineer, AI (0→1 Product Focus).Location : San Francisco Bay Area.We are a high-growth, well-funded technology company pioneering the Composable Customer Data Platform (CDP) and AI Decisio...Show more
    Last updated: 12 days ago • Promoted
    Principal Software Engineer AI Agents

    Principal Software Engineer AI Agents

    Goodleap • San Francisco, California, United States
    Full-time
    GoodLeap is a technology company delivering best-in-class financing and software products for sustainable solutions, from solar panels and batteries to energy-efficient HVAC, heat pumps, roofing, w...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer (AI Performance)

    Software Engineer (AI Performance)

    Gimlet Labs • San Francisco, California, United States
    Full-time
    Gimlet Labs is building the foundation for the next generation of AI applications.As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck.Gimlet is redefi...Show more
    Last updated: 30+ days ago • Promoted
    AI Software Engineer

    AI Software Engineer

    unitQ • San Francisco, CA, United States
    Full-time
    AI and advanced analytics to enable businesses to proactively monitor and improve product quality based on real-time user feedback from both public and private channels. Backed by leading investors ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, AI Agentic Experience (Auth0)

    Senior Software Engineer, AI Agentic Experience (Auth0)

    Okta, Inc. • San Francisco, CA, United States
    Full-time
    Okta is The World's Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secur...Show more
    Last updated: 17 days ago • Promoted
    Lead Software Engineer, AI Solutions

    Lead Software Engineer, AI Solutions

    Centari • San Francisco, CA, United States
    Full-time
    Join our dynamic product team as a Lead Software Engineer, AI Solutions, where you will play a vital role in shaping Centari's innovative AI technology. Collaborating with a passionate team of engin...Show more
    Last updated: 2 hours ago • Promoted • New!
    Senior Software Engineer, AI Agentic Experience (Auth0)

    Senior Software Engineer, AI Agentic Experience (Auth0)

    Okta • San Francisco, CA, United States
    Full-time
    Senior Software Engineer, AI Agentic Experience (Auth0) — join to apply for the Senior Software Engineer, AI Agentic Experience (Auth0) role at Okta. Design and Build Developer Tooling that helps de...Show more
    Last updated: 30+ days ago • Promoted