Research Engineer (LLMs and Generative Models)

GenBio AIPalo Alto, CA, US

20 days ago

Job type

Full-time

Job description

Job Description

Headquartered in Silicon Valley, we are a newly established start-up, where a collective of visionary scientists, engineers, and entrepreneurs are dedicated to transforming the landscape of biology and medicine through the power of Generative AI. Our team comprises leading minds and innovators in AI and Biological Science, pushing the boundaries of what is possible. We are dreamers who reimagine a new paradigm for biology and medicine.

GenBio AI is building the AI-Driven Digital Organism (AIDO) and the AIDO Virtual Cell Lab, a platform where researchers can design, perturb, and observe biological systems entirely in silico using biological foundation models and LLMs. We are looking for a Research Engineer specializing in building inference and finetuning infrastructure, with a focus on LLMs and generative models (e.g. AIDO.StructureDiffusion). You’ll work on closing the gap between research and production in the AIDO model ecosystem, bringing new models to internal and external users through on-demand inference and finetuning.

Key Responsibility

Design and own AIDO’s internal model ecosystem , including scalable infrastructure for serving, finetuning, distillation, and inference across many model sizes and architectures.
Develop reusable pipelines for on-demand model finetuning using internal and external datasets, ensuring reproducibility and cost-efficiency.
Build APIs and inference tools that integrate deeply with downstream biology simulators.
Productionize foundation model interfaces , including transformer-based LLMs and diffusion / auto-regressive architectures, with an emphasis on biological data modalities (DNA, RNA, protein, etc.).
Collaborate with research and product teams to enable virtual experiments powered by generative AI, including agentic workflows and user-facing tooling.
Support distillation, quantization, and routing strategies to optimize model throughput and enable multi-model orchestration.
Prioritize observability, reliability, and safety in generative workflows through better logging, traceability, and rollback mechanisms.
Ensure scalability and automation throughout the model lifecycle : training, testing, deployment, and adaptation.
Automate everything.

Qualifications

M.S. or equivalent practical experience in MLOps, Computer Science, Engineering, or related field.

2+ years of experience developing, deploying, and evaluating LLMs or generative models (transformers, diffusion models, VAEs, autoregressive architectures, etc.).

Proficiency with deep learning research and production stacks, such as PyTorch, HuggingFace Transformers & Accelerate, or Megatron-LM / DeepSpeed.

Strong programming skills in Python, with experience developing model services and backend APIs (Flask, FastAPI, or similar).

Familiarity with GPU-accelerated tools (e.g., CUDA, cuDNN, Triton) and profilers (PyTorch Profiler, Nsight Systems, TensorBoard)

Familiarity with resource coordination platforms (e.g., SLURM, Kubernetes), and managed solutions (Vertex AI, SageMaker, OCI Data Science)

Familiarity with ML automation frameworks (e.g. Kubeflow, Argo Workflows, Apache Airflow, Metaflow).

Expertise in cloud computing (GCP, OCI, AWS).

Strong software engineering practices : testing, version control, CI / CD pipelines.

Ability to work in a fast-moving research environment, productionizing new models as they become available.

Preferred Qualifications

Ph.D. degree in Computer Science, Engineering, or related field. Experience in life sciences or healthcare is a plus.

Experience with biological data modalities (e.g., DNA, RNA, protein sequences, cell imaging).

Prior work on multimodal or multiscale models across text, sequences, images, or structure.

Background in model distillation, quantization, and memory / latency optimization.

Knowledge of RESTful API design and data security.

Strong written and verbal communication skills, especially across research, product, and engineering.

Deep curiosity about biology and excitement to build tools that democratize access to scientific exploration.

Join us as we embark on this journey to redefine the future of biology and medicine.

We are an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. GenBio AI participates in the U.S. Department of Homeland Security’s E-Verify program to confirm the employment eligibility of all newly hired employees. For more information on E-Verify, please visit www.e-verify.gov.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Create a job alert for this search

Research Engineer • Palo Alto, CA, US

Related jobs

Promoted

Machine Learning Research Engineer - Robotics

Scale AI, Inc.San Francisco, CA, United States

Full-time

Scale's Robotics business unit is dedicated to solving the data bottleneck in Physical AI.This position will be a key contributor in conducting applied research in Robotics and developing ML pipeli...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Research Engineer, Enterprise ML Systems

Scale AI, Inc.San Francisco, CA, United States

Full-time

AI is becoming vitally important in every function of our society.At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, ...Show moreLast updated: 14 days ago

Promoted

Machine Learning Research Engineer, Agents - Enterprise GenAI

Scale AI, Inc.San Francisco, CA, United States

Full-time

Promoted

Distributed ML Systems Engineer- Inference

Together AISan Francisco, CA, United States

Full-time

Together AI is seeking a Distributed ML Systems Engineer to design and build scalable machine learning systems that power our accelerated AI initiatives. This role involves developing large-scale, f...Show moreLast updated: 30+ days ago

Promoted
New!

Machine Learning Research Engineer, Enterprise ML Systems

Scale AISan Francisco, CA, United States

Full-time

Promoted

ML Research Engineer, ML Systems

Scale AISan Francisco, CA, United States

Full-time

Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show moreLast updated: 30+ days ago

Promoted

Research Engineer

eGainSunnyvale, CA, United States

Full-time

Fortune 500 clients and government agencies trust eGain AI knowledge solution to improve customer experience and reduce cost of service. Top rated by Gartner, eGain AI Knowledge Hub orchestrates AI ...Show moreLast updated: 6 days ago

Promoted

ML Research Engineer

OumiPalo Alto, CA, United States

Full-time

Get AI-powered advice on this job and more exclusive features.Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that AI will have a transformative impact on h...Show moreLast updated: 7 days ago

Promoted

Senior Research And Development Engineer

Cambridge RecruitersPleasanton, CA, US

Full-time

Commercial-Stage, 200-Person Minimally-Invasive Interventional medical device company.Very well-funded with $50MM financing recently secured and next-gen products in development.The Instruments Eng...Show moreLast updated: 5 days ago

Promoted

Senior Research Engineer, Responsibility

DeepMindMountain View, CA, United States

Full-time

This role is for a Research Engineer focusing on a range of projects within the Responsibility portfolio at Google DeepMind. This role is crucial for fulfilling DeepMind's mission to build AI respon...Show moreLast updated: 8 days ago

Promoted

Applied ML Research Engineer - Robotics

Chef RoboticsSan Francisco, CA, United States

Full-time

Chef Robotics is on a mission to accelerate the advent of intelligent machines in the physical world.As the rise of LLMs like ChatGPT has shown, AI has the potential to drive immense change.However...Show moreLast updated: 30+ days ago

Promoted

Research Engineer, World Models

1X Technologies ASPalo Alto, CA, United States

Full-time

We're an AI and robotics company based in Palo Alto, California, on a mission to build a truly abundant society through general-purpose robots capable of performing any kind of work autonomously.We...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Research Engineer (1 Year Fixed Term)

Stanford UniversityStanford, CA, US

Part-time +1

The Enigma Project (enigmaproject.Department of Ophthalmology at Stanford University School of Medicine, dedicated to understanding the computational principles of natural intelligence using the to...Show moreLast updated: 2 days ago

Promoted

ML Research Engineer, ML Systems

Scale AI, Inc.San Francisco, CA, United States

Full-time

Promoted

Machine Learning Research Scientist / Research Engineer, Post-Training

Scale AI, Inc.San Francisco, CA, United States

Full-time

Scale works with the industry's leading AI labs to provide high quality data and accelerate progress in GenAI research.We are looking for Research Scientists and Research Engineers with expertise i...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Systems Engineer, Research Tools

AnthropicSan Francisco, CA, United States

Full-time

Anthropic's mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 30+ days ago

Promoted
New!

Machine Learning Research Engineer

KiddomSan Francisco, CA, United States

Full-time +1

This range is provided by Kiddom.Your actual pay will be based on your skills and experience talk with your recruiter to learn more. Kiddom is a groundbreaking educational platform that promotes stu...Show moreLast updated: 11 hours ago

Promoted

Machine Learning Research Engineer

ProphecySan Francisco, CA, United States

Full-time

Prophecy is the world's most advanced data integration platform, designed to make complex data work simple and powerful.Designed natively for modern cloud data platforms, Prophecy uniquely serves t...Show moreLast updated: 30+ days ago