Talent.com
Senior Data Scientist
Senior Data ScientistThe Chan Zuckerberg Biohub, Inc. • Redwood City, CA, United States
Senior Data Scientist

Senior Data Scientist

The Chan Zuckerberg Biohub, Inc. • Redwood City, CA, United States
10 days ago
Job type
  • Full-time
Job description

Biohub is leading the new era of AI-powered biology to cure or prevent disease through its 501c3 medical research organization, with the support of the Chan Zuckerberg Initiative.

The Team

Biohub supports the science and technology that will make it possible to help scientists cure, prevent, or manage all diseases by the end of this century. While this may seem like an audacious goal, in the last 100 years, biomedical science has made tremendous strides in understanding biological systems, advancing human health, and treating disease.

Achieving our mission will only be possible if scientists are able to better understand human biology. To that end, we have identified four grand challenges that will unlock the mysteries of the cell and how cells interact within systems — paving the way for new discoveries that will change medicine in the decades that follow :

  • Building an AI-based virtual cell model to predict and understand cellular behavior
  • Developing state-of-the-art imaging systems to observe living cells in action
  • Instrumenting tissues to better understand inflammation, a key driver of many diseases
  • Engineering and harnessing the immune system for early detection, prevention, and treatment of disease

As a Senior Data Scientist, you'll lead the creation of groundbreaking datasets that power our AI / ML efforts within and across our scientific grand challenges. Working at the intersection of data science, biology, and AI, your work will focus on creating large, AI-ready datasets, spanning single-cell sequencing, immune receptor profiling, and mass spectrometry peptidomics data. You will define data needs, format standards, analysis approaches and quality metrics and build pipelines to ingest, transform, and validate data products that form the foundation of our experiments.

Our Data Ecosystem :

These efforts will form a part of, and interoperate with our larger data ecosystem. We are generating unprecedented scientific datasets that drive biological innovation :

  • Billions of standardized cells of single-cell transcriptomic data, with a focus on measuring genetic and environmental perturbations
  • 10s of thousands of donor-matched DNA & RNA samples
  • 10s PBs-scale static and dynamic imaging datasets
  • 100s TBs-scale mass spectrometry datasets
  • Diverse, large multi-modal biological datasets that enable biological bridges across measurement types and facilitate multi-modal model training to define how cells act.
  • When analysis of a dataset is complete, you will help publish it through public resources like CELLxGENE Discover, the CryoET Portal, and the Virtual Cell Platform, used by tens of thousands of scientists monthly to advance understanding of genetic variants, disease risk, drug toxicities, and therapeutic discovery.

    Your Impact

    You'll collaborate with cross-functional teams to lead dataset definition, ingestion, transformation, and delivery for AI modeling and experimental analysis. Success means delivering high-quality, usable datasets that directly address modeling challenges and accelerate scientific progress. Join us in building the data foundation that will transform our understanding of human biology and move us along the path to curing, preventing, and managing all disease.

    What You'll Do

  • Contribute the tools required for a robust data ecosystem : build single cell data ingestion pipelines, select data formats, standards, and database schemas, and write validation tools, QC approaches, and analysis pipelines.
  • Collaborate with Platform Scientists, ML engineers, AI Researchers, and Data Engineers to iteratively evaluate, refine and grow datasets to improve our biological understanding of inflammation.
  • Work closely with Platform Scientists to identify technical variables and devise approaches to harmonize data across generation sites to enable joint analysis.
  • Discover and define new data generation opportunities, and manage the delivery of those data products to our scientific teams.
  • What You'll Bring

  • 10+ years of experience with large scale high throughput biological data (single cell sequencing, immune receptor profiling, mass spectrometry).
  • Demonstrated ability to deliver multiple large biological data products.
  • Experience with big data : extraction, transport, loading, databases, standardization, validation, QC, and analysis.
  • Strong fundamentals in statistical reasoning and machine learning.
  • Experience with biological data analysis and QC best practices
  • Excellent written and verbal communication skills.
  • Enthusiasm to ramp up on technologies and learn new domains.
  • Experience working in a multidisciplinary environment (scientific platforms, engineering, product, AI Research).
  • Compensation

    The Redwood City, CA base pay range for this role is $190,000 - $261,800. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.

    Work Mode

    As we grow, we’re excited to strengthen in-person connections and cultivate a collaborative, team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in-office days determined by the team’s manager. The exact schedule will be at the hiring manager’s discretion and communicated during the interview process.

    Benefits for the Whole You

    We’re thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.

  • Provides a generous employer match on employee 401(k) contributions to support planning for the future.
  • Paid time off to volunteer at an organization of your choice.
  • Relocation support for employees who need assistance moving
  • If you’re interested in a role but your previous experience doesn’t perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.

    Reasonable Accommodation Notice

    We provide (and state and federal law requires) reasonable accommodations to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job or to perform your job (reach out to your recruiter or accommodations@chanzuckerberg.com ). Examples of reasonable accommodation include making a change to the application process or work procedures, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.

    For reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

    As set forth in the organization’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

    #J-18808-Ljbffr

    Create a job alert for this search

    Senior Data Scientist • Redwood City, CA, United States

    Related jobs
    Senior Data Scientist

    Senior Data Scientist

    EchoTwin AI • San Francisco, California, United States
    Full-time
    EchoTwin AI is the intelligence layer powering self-healing cities—urban systems that not only detect issues in real time, but also trigger automated corrective actions or surface prioritized insig...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Decagon • San Francisco, California, United States
    Full-time
    Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experience.Our AI agents provide intelligent, human-like responses across chat, email, and voi...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Verse • San Francisco, California, United States
    Full-time
    Organizations today are under growing pressure to navigate the transition to clean energy — not just to meet sustainability goals, but to manage risk, control costs, and build long-term resilience....Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Biohub • Redwood City, California, United States
    Full-time
    Biohub is leading the new era of AI-powered biology to cure or prevent disease through its 501c3 medical research organization, with the support of the Chan Zuckerberg Initiative.Biohub supports th...Show more
    Last updated: 17 days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Abridge • San Francisco, CA, United States
    Full-time
    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare.Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation eff...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist, Growth Product

    Senior Data Scientist, Growth Product

    Block • San Francisco, CA, United States
    Full-time
    Block is one company built from many blocks, all united by the same purpose of economic empowerment.The blocks that form our foundational teams — People, Finance, Counsel, Hardware, Information Sec...Show more
    Last updated: 5 days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Epoch Biodesign • San Francisco, CA, United States
    Full-time
    Crusoe is building the World’s Favorite AI-first Cloud infrastructure company.We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to p...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist, Analytics (Growth)

    Senior Data Scientist, Analytics (Growth)

    Airwallex • San Francisco, California, United States
    Full-time
    Airwallex is the only unified payments and financial platform for global businesses.Powered by our unique combination of proprietary infrastructure and software, we empower over 150,000 businesses ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    EchoTwin AI, Inc. • San Francisco, CA, United States
    Full-time
    EchoTwin AI is pioneering AI-driven infrastructure intelligence, redefining how cities are managed.Powered by a proprietary visual intelligence engine with full spatial reasoning, EchoTwin transfor...Show more
    Last updated: 20 days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    ICONIQ • San Francisco, California, United States
    Full-time
    The Senior Data Scientist will be a key member of the Data & Analytics team reporting directly to the Head of Data.This role blends advanced data science and machine learning applications—including...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Autodesk • San Francisco, California, United States
    Full-time
    Job Requisition ID # 25WD93554 Develop and implement a set of techniques or analytics applications to transform raw data into meaningful information using data-oriented programming languages and vi...Show more
    Last updated: 3 days ago • Promoted
    Senior Data Scientist - Retailer

    Senior Data Scientist - Retailer

    Faire • San Francisco, California, United States
    Full-time
    Faire is an online wholesale marketplace built on the belief that the future is local — independent retailers around the globe are doing more revenue than Walmart and Amazon combined, but individua...Show more
    Last updated: 30+ days ago • Promoted
    Data Scientist / Senior Data Scientist

    Data Scientist / Senior Data Scientist

    C3 Ai • Redwood City, California, United States
    Full-time
    C3 AI (NYSE : AI), is the Enterprise AI application software company.C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing,...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Plum, Inc. • San Francisco, CA, United States
    Full-time
    PLUM is a fintech company empowering financial institutions to grow their business through a cutting-edge suite of AI-driven software, purpose-built for lenders and their partners across the financ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist (Geospatial)

    Senior Data Scientist (Geospatial)

    Terradot • San Francisco, California, United States
    Full-time +1
    We are seeking a cross-functional Senior Data Scientist (Geospatial) to lead the development and deployment of scalable geospatial solutions that drive critical business decisions.In this role, you...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Fictiv • Oakland, California, United States
    Full-time
    Fictiv Exists to Enable Hardware Innovators to Build Better Products, Faster.Fictiv exists to help product innovators create. Fictiv is a global manufacturing and supply chain company that enables o...Show more
    Last updated: 9 hours ago • Promoted • New!
    Senior Data Scientist – GenAI

    Senior Data Scientist – GenAI

    Liberate • San Francisco, CA, United States
    Full-time
    Liberate is reimagining how the $2.By starting with voice, the most valuable and complex channel in insurance, the company proved that even the hardest problems can be automated.Now expanding into ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist, Analytics

    Senior Data Scientist, Analytics

    Discord • San Francisco Bay, California, United States
    Full-time
    Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform : . Over 90% of our users play games, spending a comb...Show more
    Last updated: 30+ days ago • Promoted