Talent.com
Senior GenAI Research Engineer Optimization and Kernels

Senior GenAI Research Engineer Optimization and Kernels

DatabricksSan Francisco, California, USA
2 days ago
Job type
  • Full-time
Job description

At Databricks we are obsessed with enabling data teams to solve the worlds toughest problems from security threat detection to cancer drug development. We do this by building and running the worlds best data and AI platform so our customers can focus on the high-value challenges that are central to their own missions.

The Mosaic AI organization enables companies to develop AI models and systems using their own data with technologies ranging from pre-training LLMs from scratch to augmented generation using the latest retrieval techniques. Mosaic AI does so by producing novel science and putting it into production. Mosaic AI is committed to the belief that a companys AI models are just as valuable as any other core IP and that high-quality AI models should be available to all.

Job Description

As a research engineer on the Scaling team you will be responsible for keeping up with the latest developments in deep learning and advancing the scientific frontier by creating new techniques that go beyond the state of the art. You will work together on a collaborative team of researchers and engineers with diverse backgrounds and technical training. And most importantly you will love our customers : our goal is to make our customers successful in applying state-of-the-art LLMs and AI systems and we encode our scientific expertise into our products to make that possible.

The Impact you will have

As a research engineer on the Scaling Team at Databricks you will :

  • Drive performance improvements through advanced optimization techniques including kernel fusion mixed precision memory layout optimization tiling strategies and tensorization for training-specific patterns
  • Design implement and optimize high-performance GPU kernels for training workloads (e.g. attention mechanisms custom layers gradient computation activation functions) targeting NVIDIA architectures
  • Design and implement distributed training frameworks for large language models including parallelism strategies (data tensor pipeline ZeRO-based) and optimized communication patterns for gradient synchronization and collective operations
  • Profile debug and optimize end-to-end training workflows to identify and resolve performance bottlenecks applying memory optimization techniques like activation checkpointing gradient sharding and mixed precision training.

What We Look for

  • BS / MS / PhD in Computer Science or related field with hands-on experience writing and tuning CUDA kernels for ML training applications or hands-on experience in distributed training frameworks (PyTorch DDP DeepSpeed Megatron-LM FSDP)
  • Strong understanding of NVIDIA GPU architecture (memory hierarchy tensor cores warp scheduling SM occupancy) and proficiency with CUDA debugging / profiling tools (Nsight NVProf)
  • Deep understanding of parallelism techniques and memory optimization strategies for large-scale model training with proven ability to debug and optimize distributed workloads
  • Strong software engineering skills in Python and PyTorch with experience supporting production training workflows and knowledge of LLM training dynamics including hyperparameter tuning and optimization strategies.
  • Required Experience :

    Senior IC

    Key Skills

    Administration And Accounting,ABAP,BPO,Art And Craft,Employee Benefits,ACCA

    Employment Type : Full Time

    Experience : years

    Vacancy : 1

    Create a job alert for this search

    Senior Research Engineer • San Francisco, California, USA

    Related jobs
    • Promoted
    Senior GenAI Research Scientist Scaling

    Senior GenAI Research Scientist Scaling

    DatabricksSan Francisco, California, USA
    Full-time
    At Databricks we are obsessed with enabling data teams to solve the worlds toughest problems from security threat detection to cancer drug development. We do this by building and running the worlds ...Show moreLast updated: 2 days ago
    • Promoted
    Senior Research Engineer, LLM

    Senior Research Engineer, LLM

    GeniesSan Francisco, CA, United States
    Full-time
    Genies is an avatar technology company powering the next era of interactive digital identity through AI companions.With the Avatar Framework and intuitive creation tools, Genies enables developers,...Show moreLast updated: 6 days ago
    • Promoted
    AI Engineer, Evaluation and Reliability

    AI Engineer, Evaluation and Reliability

    Mice GroupsRedwood City, CA, US
    Permanent
    Senior Engineer, AI Evaluation and Reliability / Contract-to-Hire or Direct Hire / Redwood City / Hybrid, onsite 3 days per week / This position pays $70-80 / hr. W2 for Contract, $140-190K annually u...Show moreLast updated: 2 days ago
    • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    HiveSan Francisco, California, United States
    Full-time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Research Engineer - Datasets

    Senior Research Engineer - Datasets

    black.aiSan Francisco, CA, United States
    Full-time
    Join the team redefining how the world experiences design.Hey, hello, g'day, mabuhay, kia ora, 你好, hallo, vítejte!.We know job hunting can be a little time consuming and you're probably keen to fin...Show moreLast updated: 30+ days ago
    • Promoted
    Research Engineer, GenAI

    Research Engineer, GenAI

    KiddomSan Francisco, CA, United States
    Full-time +1
    This range is provided by Kiddom.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Kiddom is a groundbreaking educational platform that promotes s...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    CuraiSan Francisco, California, United States
    Full-time
    Curai Health is an AI-powered virtual clinic on a mission to improve access to care at scale.As the pioneer in deploying machine learning into clinical workflows, Curai Health enables its dedicated...Show moreLast updated: 30+ days ago
    • Promoted
    Senior AI Research Engineer

    Senior AI Research Engineer

    CyberhavenSan Francisco, California, United States
    Full-time
    Joining Cyberhaven offers a unique opportunity to revolutionize data protection through cutting-edge AI technology.Cyberhaven is dedicated to overcoming the challenges faced by traditional data sec...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Research and Development Technician, Advanced Development

    Sr. Research and Development Technician, Advanced Development

    Calyxo, Inc.Pleasanton, CA, United States
    Full-time
    The company was founded in 2016 to address the profound need for improved kidney stone treatment.Kidney stone disease is a common, painful condition that consumes vast amounts of healthcare resourc...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    SukiRedwood City, California, United States
    Full-time
    The Future of Healthcare Needs You.At Suki, we’re building technology that listens, understands, and gets out of the way — so clinicians can get back to being clinicians. AI to automate clinical doc...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Firmware EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

    Senior Firmware EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

    Form EnergyBerkeley, CA, United States
    Full-time
    Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...Show moreLast updated: 27 days ago
    • Promoted
    Research Engineer Machine Learning & Systems

    Research Engineer Machine Learning & Systems

    World LabsSan Francisco, California, United States
    Full-time
    We are looking for a versatile Research Engineer with a strong background in machine learning or 3D, software development, and systems design. This role is ideal for someone excited about bridging c...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer - GenAI Platform

    Senior Machine Learning Engineer - GenAI Platform

    Databricks Inc.San Francisco, CA, United States
    Full-time
    Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine-tune, train and deploy custom AI models on their own data, for maxi...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Deep Learning Engineer - Genomics

    Senior Deep Learning Engineer - Genomics

    NVIDIASanta Clara, CA, United States
    Full-time
    NVIDIA is at the forefront of redefining healthcare by harnessing the power of GPU computing and AI to redefine data analysis in fields such as genomics and personalized medicine.We are on the look...Show moreLast updated: 6 days ago
    • Promoted
    Senior Research Engineer

    Senior Research Engineer

    Mem0San Francisco, CA, United States
    Full-time
    Own the end-to-end lifecycle of memory features—from research to production.You’ll fine‑tune models for extraction, updates, consolidation / forgetting, and conflict resolution; turn customer pain po...Show moreLast updated: 6 days ago
    • Promoted
    Senior Research Associate

    Senior Research Associate

    X-Thema, Inc.Hercules, CA, US
    Full-time
    Senior Research Associate Company : X-Thema, Inc.Location : Hercules, CA Position Type : Full Time Experience : Unspecified Education : Masters degree in Biology, Pharmacology or equivalent Salary $63,3...Show moreLast updated: 21 days ago
    • Promoted
    Senior Research Engineer - Datasets

    Senior Research Engineer - Datasets

    CanvaSan Francisco, CA, United States
    Full-time
    Senior Research Engineer - Datasets at Canva.Join to apply for the Senior Research Engineer - Datasets role at Canva.At Canva, our mission is to empower the world to design.This role focuses on bui...Show moreLast updated: 6 days ago
    • Promoted
    [2026] Senior Machine Learning Engineer, AI Platform - PhD Early Career

    [2026] Senior Machine Learning Engineer, AI Platform - PhD Early Career

    RobloxSan Mateo, California, United States
    Full-time
    The Foundation AI Group is on a mission to establish Roblox as the standard for 3D foundational models (3DFMs), democratizing creation by making it simple for anyone to generate high-quality, immer...Show moreLast updated: 1 day ago