Talent.com
Research Scientist - Data (Santa Clara)
Research Scientist - Data (Santa Clara)Storm3 • Santa Clara, CA, United States
No longer accepting applications
Research Scientist - Data (Santa Clara)

Research Scientist - Data (Santa Clara)

Storm3 • Santa Clara, CA, United States
20 hours ago
Job type
  • Full-time
Job description

Research Scientist - Data focus

💊 Foundation Models, AI Research Institute

🌎 San Francisco Bay Area, USA

💸 $200,000 - $350,000 salary + bonus

Come join a revolutionary AI research lab in SF Bay Area that is poised to develop & publish high-impact breakthroughs in GenAI - across LLMs and Multimodal AI.

As part of the team, youll work at the intersection of data, large-scale training, and foundation model innovation. You will collaborate with world-class researchers, data scientists, and engineers to solve critical challenges in creating robust, scalable, and reasoning-capable LLMs. Your research will shape the way data is curated, processed, and leveraged to train the next generation of intelligent systems.

Responsibilities :

  • Lead research on data-centric approaches for LLMs , including pretraining corpus design, data valuation, and speculative decoding strategies.
  • Develop pipelines to process challenging data sources into structured and reproducible training datasets.
  • Build and optimize agentic data pipelines , integrating retrieval, self-curation, and multi-agent feedback for high-quality training and evaluation data.
  • Collaborate with researchers on alignment and reasoning-focused training that leverage data-driven approaches for improving LLM capabilities.
  • Prototype and deploy evaluation frameworks to measure data quality, coverage, and downstream impact on LLM reasoning.
  • Publish findings at top-tier venues (e.g., NeurIPS, ICLR, ACL, EMNLP) and represent the institute at international conferences.
  • Contribute to open-source tools, datasets, and benchmarks that advance the global foundation model research community.

Requirements :

  • Masters degree in Computer Science, Data Science, or a related technical field (PhD strongly preferred)
  • Experience collecting and curating high-quality text data including multi-lingual data.
  • Hands-on experience with large-scale dataset curation and preprocessing for ML / LLM training.
  • Prior works synthesizing complex datasets. Code, math, and agentic data are higher priority
  • Experience with ML infrastructure for scalable training, evaluation, and debugging .
  • Experience at the intersection of data and post-training (RL / SFT)
  • Proven ability to independently drive research questions related to data quality, scaling, or reasoning .
  • Preferred Experience :

  • Experience with retrieval-augmented generation (RAG) , agentic data pipelines, or reasoning benchmarks.
  • Contributions to speculative decoding, self-curation, or reinforcement learning from synthetic data .
  • Background in knowledge graphs, semantic search, or indexing systems .
  • Strong publication record in leading AI conferences.
  • Prior contributions to open-source ML data tools or benchmarks .
  • Prior work on speculative decoding / contributions to LLM serving engines
  • Prior work on training LLM-as-a-judge
  • Deep expertise with tokenization / training tokenizers
  • Why apply :

  • Opportunity to build out a new division at the forefront of AI innovation
  • FAANG competitive salary & package
  • Work alongside superstars from FAANG labs & leading AI companies
  • Medical, Dental and Vision Insurance
  • Relocation package available
  • 🌎 San Francisco Bay Area, USA

    📧 Interested in applying? Please click on the Easy Apply button or alternatively email me your resume at stefani.lukic@storm3.com

    Create a job alert for this search

    Research Data Scientist • Santa Clara, CA, United States

    Related jobs
    Data Scientist, Innovation

    Data Scientist, Innovation

    Picarro • Santa Clara, CA, United States
    Full-time
    Lead Data Scientist, Innovation.Job Location : Santa Clara, CA (preferred) or Remote- US-based.Picarro is transforming gas utility operations with innovative solutions for methane emissions manageme...Show more
    Last updated: 30+ days ago • Promoted
    Research Scientist : 25-04150 (No C2C)

    Research Scientist : 25-04150 (No C2C)

    Akraya Inc • Sunnyvale, California, United States
    Full-time
    Quick Apply
    Primary Skills : Optical Metrology (Expert), Vision Science (Proficient), Python(Advanced), Data Analysis (Expert), Statistical Analysis (Advanced). Duration : 12 Months with possible extemsion.Locati...Show more
    Last updated: 30+ days ago
    Staff Research Scientist (Imaging Optimization)(E)

    Staff Research Scientist (Imaging Optimization)(E)

    KLA • Milpitas, CA, United States
    Full-time
    KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem.Virtually every electronic device in the world is produced using our technologies.No laptop, smartpho...Show more
    Last updated: 2 days ago • Promoted
    Research Scientist

    Research Scientist

    Insight Recruitment • Hayward, CA, United States
    Full-time
    Insight are representing an early-stage AI startup who are fundamentally changing how media content is produced.They are building a cutting-edge multimodal AI platform capable of generating complex...Show more
    Last updated: 1 day ago • Promoted
    Staff Data Scientist, Full Stack (Santa Clara)

    Staff Data Scientist, Full Stack (Santa Clara)

    Palo Alto Networks • Santa Clara, CA, US
    Full-time +1
    At Palo Alto Networks everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and mo...Show more
    Last updated: 1 day ago • Promoted
    HPE Labs - AI Research Lab Research Associate (Intern)

    HPE Labs - AI Research Lab Research Associate (Intern)

    HPE • Milpitas, CA, United States
    Full-time
    HPE Labs - AI Research Lab Research Associate (Intern).This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office. Hewlett Packard Enterprise is the...Show more
    Last updated: 4 days ago • Promoted
    Research scientist (Robotics, AI)

    Research scientist (Robotics, AI)

    IntelliPro Group Inc. • Santa Clara, CA, US
    Full-time
    Quick Apply
    Research scientist (Robotics, AI) Position Type : Full time Location : Santa Clara, CA, USA Salary Range : $150,000 - $230, 000 (USD) Job ID# : 158284​​​​​​...Show more
    Last updated: 30+ days ago
    Product Development Scientist (Santa Clara)

    Product Development Scientist (Santa Clara)

    NotCo • Santa Clara, CA, United States
    Full-time
    We are a group of people united by a strong purpose.We want you to get to know us, and Giuseppe, our very own artificial intelligence. We love good food, but above all, we love the planet, and that'...Show more
    Last updated: 1 day ago • Promoted
    Staff Data Scientist, Platform Performance

    Staff Data Scientist, Platform Performance

    Intuit Inc. • Mountain View, CA, United States
    Full-time
    We are seeking an experienced and highly motivated Staff Data Scientist to join our Platform Analytics team at Intuit.This role will serve as a thought leader and strategic partner to product, plat...Show more
    Last updated: 13 days ago • Promoted
    Senior Research Engineer (Santa Clara)

    Senior Research Engineer (Santa Clara)

    Harrison Clarke • Santa Clara, CA, US
    Part-time
    A fast-growing, deeply technical AI company is looking for a.This is an opportunity to work at the frontier of AI, helping design and evaluate models that can understand, write, and reason about co...Show more
    Last updated: 1 day ago • Promoted
    Research Scientist (San Jose)

    Research Scientist (San Jose)

    Goliath Partners • San Jose, CA, US
    Part-time
    We're working with a San Francisco client that's got a research team of 50~ professionals and looking to further expand it. They are specifically looking to flesh out their Research Group by hiring ...Show more
    Last updated: 1 day ago • Promoted
    Data Science AI Modeler (Life Sciences Biotech) (Santa Clara)

    Data Science AI Modeler (Life Sciences Biotech) (Santa Clara)

    Vida Group International • Santa Clara, CA, United States
    Full-time
    The heart of our Biotech client is at the forefront of biotechnology, leveraging cutting-edge artificial intelligence and machine learning technologies to revolutionize drug discovery and developme...Show more
    Last updated: 1 day ago • Promoted
    Head of Data Science (San Jose)

    Head of Data Science (San Jose)

    Confidential • San Jose, CA, United States
    Full-time
    Innovative AI startup specializing in document redaction solutions.Information Technology and Services.The Company is in search of a Head of Data Science to lead its AI / ML-powered document redactio...Show more
    Last updated: 1 day ago • Promoted
    AI Research Scientist I (Intern) United States

    AI Research Scientist I (Intern) United States

    Cisco • San Jose, CA, United States
    Full-time
    Please note this posting is to advertise potential job opportunities.This exact role may not be open today but could open in the near future. When you apply, a Cisco representative may contact you d...Show more
    Last updated: 5 days ago • Promoted
    research scientist - RL

    research scientist - RL

    Cerebro • Fremont, CA, United States
    Full-time
    Join a Leading Applied Research Lab Pushing the Boundaries of Reinforcement Learning.Are you passionate about advancing the frontiers of. An innovative AI research lab is seeking talented and ambiti...Show more
    Last updated: 1 day ago • Promoted
    Remote Research Scientist (San Jose)

    Remote Research Scientist (San Jose)

    Infused Innovations • San Jose, California, United States, 95101
    Remote
    Full-time +1
    Remote Research Scientist (San Jose).We are seeking a highly motivated and innovative.This role involves designing, conducting, and evaluating research projects that support the development of new ...Show more
    Last updated: 19 hours ago • New!
    AI Foundations - Research Scientist - Research Internship : 2026

    AI Foundations - Research Scientist - Research Internship : 2026

    IBM • San Jose, CA, United States
    Internship
    IBM Research takes responsibility for technology and its role in society.Working in IBM Research means you'll join a team who invent what's next in computing, always choosing the big, urgent and mi...Show more
    Last updated: 5 days ago • Promoted
    Staff Data Scientist (Fremont)

    Staff Data Scientist (Fremont)

    Quantix Search • Fremont, CA, United States
    Full-time
    Staff Data Scientist | San Francisco | $250K$300K + Equity.Were partnering with one of the fastest-growing AI companies in the world to hire a Staff Data Scientist. Backed by over $230M from top-tie...Show more
    Last updated: 1 day ago • Promoted