Talent.com
Research Engineer (LLMs and Generative Models)

Research Engineer (LLMs and Generative Models)

GenBio AIPalo Alto, CA, US
20 days ago
Job type
  • Full-time
Job description

Job Description

Job Description

Headquartered in Silicon Valley, we are a newly established start-up, where a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming the landscape of biology and medicine through the power of Generative AI. Our team comprises leading minds and innovators in AI and Biological Science, pushing the boundaries of what is possible. We are dreamers who reimagine a new paradigm for biology and medicine.

GenBio AI is building the AI-Driven Digital Organism (AIDO) and the AIDO Virtual Cell Lab, a platform where researchers can design, perturb, and observe biological systems entirely in silico using biological foundation models and LLMs. We are looking for a Research Engineer specializing in building inference and finetuning infrastructure, with a focus on LLMs and generative models (e.g. AIDO.StructureDiffusion). You’ll work on closing the gap between research and production in the AIDO model ecosystem, bringing new models to internal and external users through on-demand inference and finetuning.

Key Responsibility

  • Design and own AIDO’s internal model ecosystem , including scalable infrastructure for serving, finetuning, distillation, and inference across many model sizes and architectures.
  • Develop reusable pipelines for on-demand model finetuning using internal and external datasets, ensuring reproducibility and cost-efficiency.
  • Build APIs and inference tools that integrate deeply with downstream biology simulators.
  • Productionize foundation model interfaces , including transformer-based LLMs and diffusion / auto-regressive architectures, with an emphasis on biological data modalities (DNA, RNA, protein, etc.).
  • Collaborate with research and product teams to enable virtual experiments powered by generative AI, including agentic workflows and user-facing tooling.
  • Support distillation, quantization, and routing strategies to optimize model throughput and enable multi-model orchestration.
  • Prioritize observability, reliability, and safety in generative workflows through better logging, traceability, and rollback mechanisms.
  • Ensure scalability and automation throughout the model lifecycle : training, testing, deployment, and adaptation.
  • Automate everything.

Qualifications

  • M.S. or equivalent practical experience in MLOps, Computer Science, Engineering, or related field.
  • 2+ years of experience developing, deploying, and evaluating LLMs or generative models (transformers, diffusion models, VAEs, autoregressive architectures, etc.).
  • Proficiency with deep learning research and production stacks, such as PyTorch, HuggingFace Transformers & Accelerate, or Megatron-LM / DeepSpeed.
  • Strong programming skills in Python, with experience developing model services and backend APIs (Flask, FastAPI, or similar).
  • Familiarity with GPU-accelerated tools (e.g., CUDA, cuDNN, Triton) and profilers (PyTorch Profiler, Nsight Systems, TensorBoard)
  • Familiarity with resource coordination platforms (e.g., SLURM, Kubernetes), and managed solutions (Vertex AI, SageMaker, OCI Data Science)
  • Familiarity with ML automation frameworks (e.g. Kubeflow, Argo Workflows, Apache Airflow, Metaflow).
  • Expertise in cloud computing (GCP, OCI, AWS).
  • Strong software engineering practices : testing, version control, CI / CD pipelines.
  • Ability to work in a fast-moving research environment, productionizing new models as they become available.
  • Preferred Qualifications

  • Ph.D. degree in Computer Science, Engineering, or related field. Experience in life sciences or healthcare is a plus.
  • Experience with biological data modalities (e.g., DNA, RNA, protein sequences, cell imaging).
  • Prior work on multimodal or multiscale models across text, sequences, images, or structure.
  • Background in model distillation, quantization, and memory / latency optimization.
  • Knowledge of RESTful API design and data security.
  • Strong written and verbal communication skills, especially across research, product, and engineering.
  • Deep curiosity about biology and excitement to build tools that democratize access to scientific exploration.
  • Join us as we embark on this journey to redefine the future of biology and medicine.

    We are an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. GenBio AI participates in the U.S. Department of Homeland Security’s E-Verify program to confirm the employment eligibility of all newly hired employees. For more information on E-Verify, please visit www.e-verify.gov.

    We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

    Create a job alert for this search

    Research Engineer • Palo Alto, CA, US

    Related jobs
    • Promoted
    Machine Learning Research Engineer - Robotics

    Machine Learning Research Engineer - Robotics

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale's Robotics business unit is dedicated to solving the data bottleneck in Physical AI.This position will be a key contributor in conducting applied research in Robotics and developing ML pipeli...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Research Engineer, Enterprise ML Systems

    Machine Learning Research Engineer, Enterprise ML Systems

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Show moreLast updated: 14 days ago
    • Promoted
    Machine Learning Research Engineer, Agents - Enterprise GenAI

    Machine Learning Research Engineer, Agents - Enterprise GenAI

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Show moreLast updated: 11 days ago
    • Promoted
    Distributed ML Systems Engineer- Inference

    Distributed ML Systems Engineer- Inference

    Together AISan Francisco, CA, United States
    Full-time
    Together AI is seeking a Distributed ML Systems Engineer to design and build scalable machine learning systems that power our accelerated AI initiatives. This role involves developing large-scale, f...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Machine Learning Research Engineer, Enterprise ML Systems

    Machine Learning Research Engineer, Enterprise ML Systems

    Scale AISan Francisco, CA, United States
    Full-time
    AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Show moreLast updated: 11 hours ago
    • Promoted
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AISan Francisco, CA, United States
    Full-time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show moreLast updated: 30+ days ago
    • Promoted
    Research Engineer

    Research Engineer

    eGainSunnyvale, CA, United States
    Full-time
    Fortune 500 clients and government agencies trust eGain AI knowledge solution to improve customer experience and reduce cost of service. Top rated by Gartner, eGain AI Knowledge Hub orchestrates AI ...Show moreLast updated: 6 days ago
    • Promoted
    ML Research Engineer

    ML Research Engineer

    OumiPalo Alto, CA, United States
    Full-time
    Get AI-powered advice on this job and more exclusive features.Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that AI will have a transformative impact on h...Show moreLast updated: 7 days ago
    • Promoted
    Senior Research And Development Engineer

    Senior Research And Development Engineer

    Cambridge RecruitersPleasanton, CA, US
    Full-time
    Commercial-Stage, 200-Person Minimally-Invasive Interventional medical device company.Very well-funded with $50MM financing recently secured and next-gen products in development.The Instruments Eng...Show moreLast updated: 5 days ago
    • Promoted
    Senior Research Engineer, Responsibility

    Senior Research Engineer, Responsibility

    DeepMindMountain View, CA, United States
    Full-time
    This role is for a Research Engineer focusing on a range of projects within the Responsibility portfolio at Google DeepMind. This role is crucial for fulfilling DeepMind's mission to build AI respon...Show moreLast updated: 8 days ago
    • Promoted
    Applied ML Research Engineer - Robotics

    Applied ML Research Engineer - Robotics

    Chef RoboticsSan Francisco, CA, United States
    Full-time
    Chef Robotics is on a mission to accelerate the advent of intelligent machines in the physical world.As the rise of LLMs like ChatGPT has shown, AI has the potential to drive immense change.However...Show moreLast updated: 30+ days ago
    • Promoted
    Research Engineer, World Models

    Research Engineer, World Models

    1X Technologies ASPalo Alto, CA, United States
    Full-time
    We're an AI and robotics company based in Palo Alto, California, on a mission to build a truly abundant society through general-purpose robots capable of performing any kind of work autonomously.We...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Research Engineer (1 Year Fixed Term)

    Machine Learning Research Engineer (1 Year Fixed Term)

    Stanford UniversityStanford, CA, US
    Part-time +1
    The Enigma Project (enigmaproject.Department of Ophthalmology at Stanford University School of Medicine, dedicated to understanding the computational principles of natural intelligence using the to...Show moreLast updated: 2 days ago
    • Promoted
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Research Scientist / Research Engineer, Post-Training

    Machine Learning Research Scientist / Research Engineer, Post-Training

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale works with the industry's leading AI labs to provide high quality data and accelerate progress in GenAI research.We are looking for Research Scientists and Research Engineers with expertise i...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Systems Engineer, Research Tools

    Machine Learning Systems Engineer, Research Tools

    AnthropicSan Francisco, CA, United States
    Full-time
    Anthropic's mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Machine Learning Research Engineer

    Machine Learning Research Engineer

    KiddomSan Francisco, CA, United States
    Full-time +1
    This range is provided by Kiddom.Your actual pay will be based on your skills and experience talk with your recruiter to learn more. Kiddom is a groundbreaking educational platform that promotes stu...Show moreLast updated: 11 hours ago
    • Promoted
    Machine Learning Research Engineer

    Machine Learning Research Engineer

    ProphecySan Francisco, CA, United States
    Full-time
    Prophecy is the world's most advanced data integration platform, designed to make complex data work simple and powerful.Designed natively for modern cloud data platforms, Prophecy uniquely serves t...Show moreLast updated: 30+ days ago