Talent.com
Research Scientist - Data (Santa Clara)
Research Scientist - Data (Santa Clara)Storm3 • Santa Clara, CA, United States
No longer accepting applications
Research Scientist - Data (Santa Clara)

Research Scientist - Data (Santa Clara)

Storm3 • Santa Clara, CA, United States
19 hours ago
Job type
  • Full-time
Job description

Research Scientist - Data focus

💊 Foundation Models, AI Research Institute

🌎 San Francisco Bay Area, USA

💸 $200,000 - $350,000 salary + bonus

Come join a revolutionary AI research lab in SF Bay Area that is poised to develop & publish high-impact breakthroughs in GenAI - across LLMs and Multimodal AI.

As part of the team, youll work at the intersection of data, large-scale training, and foundation model innovation. You will collaborate with world-class researchers, data scientists, and engineers to solve critical challenges in creating robust, scalable, and reasoning-capable LLMs. Your research will shape the way data is curated, processed, and leveraged to train the next generation of intelligent systems.

Responsibilities :

  • Lead research on data-centric approaches for LLMs , including pretraining corpus design, data valuation, and speculative decoding strategies.
  • Develop pipelines to process challenging data sources into structured and reproducible training datasets.
  • Build and optimize agentic data pipelines , integrating retrieval, self-curation, and multi-agent feedback for high-quality training and evaluation data.
  • Collaborate with researchers on alignment and reasoning-focused training that leverage data-driven approaches for improving LLM capabilities.
  • Prototype and deploy evaluation frameworks to measure data quality, coverage, and downstream impact on LLM reasoning.
  • Publish findings at top-tier venues (e.g., NeurIPS, ICLR, ACL, EMNLP) and represent the institute at international conferences.
  • Contribute to open-source tools, datasets, and benchmarks that advance the global foundation model research community.

Requirements :

  • Masters degree in Computer Science, Data Science, or a related technical field (PhD strongly preferred)
  • Experience collecting and curating high-quality text data including multi-lingual data.
  • Hands-on experience with large-scale dataset curation and preprocessing for ML / LLM training.
  • Prior works synthesizing complex datasets. Code, math, and agentic data are higher priority
  • Experience with ML infrastructure for scalable training, evaluation, and debugging .
  • Experience at the intersection of data and post-training (RL / SFT)
  • Proven ability to independently drive research questions related to data quality, scaling, or reasoning .
  • Preferred Experience :

  • Experience with retrieval-augmented generation (RAG) , agentic data pipelines, or reasoning benchmarks.
  • Contributions to speculative decoding, self-curation, or reinforcement learning from synthetic data .
  • Background in knowledge graphs, semantic search, or indexing systems .
  • Strong publication record in leading AI conferences.
  • Prior contributions to open-source ML data tools or benchmarks .
  • Prior work on speculative decoding / contributions to LLM serving engines
  • Prior work on training LLM-as-a-judge
  • Deep expertise with tokenization / training tokenizers
  • Why apply :

  • Opportunity to build out a new division at the forefront of AI innovation
  • FAANG competitive salary & package
  • Work alongside superstars from FAANG labs & leading AI companies
  • Medical, Dental and Vision Insurance
  • Relocation package available
  • 🌎 San Francisco Bay Area, USA

    📧 Interested in applying? Please click on the Easy Apply button or alternatively email me your resume at stefani.lukic@storm3.com

    Create a job alert for this search

    Research Data Scientist • Santa Clara, CA, United States

    Related jobs
    Data Scientist, Innovation

    Data Scientist, Innovation

    Picarro • Santa Clara, CA, United States
    Full-time
    Lead Data Scientist, Innovation.Job Location : Santa Clara, CA (preferred) or Remote- US-based.Picarro is transforming gas utility operations with innovative solutions for methane emissions manageme...Show more
    Last updated: 30+ days ago • Promoted
    Research Scientist : 25-04150 (No C2C)

    Research Scientist : 25-04150 (No C2C)

    Akraya Inc • Sunnyvale, California, United States
    Full-time
    Quick Apply
    Primary Skills : Optical Metrology (Expert), Vision Science (Proficient), Python(Advanced), Data Analysis (Expert), Statistical Analysis (Advanced). Duration : 12 Months with possible extemsion.Locati...Show more
    Last updated: 30+ days ago
    Data Scientist Machine Learning Focus

    Data Scientist Machine Learning Focus

    Adidev Technologies • San Jose, California, United States
    Remote
    Full-time
    Role : Data Scientist – Machine Learning Focus.This role is open to US Citizens, Green Card holders, GC-EAD only.Adidev Technologies is seeking 2 yrs of relevant experience in Data Science.A project...Show more
    Last updated: 30+ days ago • Promoted
    Research Scientist

    Research Scientist

    Insight Recruitment • Hayward, CA, United States
    Full-time
    Insight are representing an early-stage AI startup who are fundamentally changing how media content is produced.They are building a cutting-edge multimodal AI platform capable of generating complex...Show more
    Last updated: 1 day ago • Promoted
    Data Scientist (Dublin, CA)

    Data Scientist (Dublin, CA)

    Savvymoney • Dublin, California, United States
    Full-time
    SavvyMoney is a leading San Francisco East Bay fintech company.We provide integrated credit score and personal finance solutions to 1,500 + bank and credit union partners nationally.The SavvyMoney ...Show more
    Last updated: 30+ days ago • Promoted
    Data Scientist - Open on W2 only

    Data Scientist - Open on W2 only

    Dataflix • San Jose, CA, United States
    Remote
    Full-time
    We are seeking a highly motivated and detail-oriented Data Scientist to join our team.In this role, you will be responsible for analyzing large datasets to uncover insights, develop predictive mode...Show more
    Last updated: 30+ days ago • Promoted
    Staff Data Scientist, Full Stack (Santa Clara)

    Staff Data Scientist, Full Stack (Santa Clara)

    Palo Alto Networks • Santa Clara, CA, US
    Full-time +1
    At Palo Alto Networks everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and mo...Show more
    Last updated: 1 day ago • Promoted
    Research scientist (Robotics, AI)

    Research scientist (Robotics, AI)

    IntelliPro Group Inc. • Santa Clara, CA, US
    Full-time
    Quick Apply
    Research scientist (Robotics, AI) Position Type : Full time Location : Santa Clara, CA, USA Salary Range : $150,000 - $230, 000 (USD) Job ID# : 158284​​​​​​...Show more
    Last updated: 30+ days ago
    Product Development Scientist (Santa Clara)

    Product Development Scientist (Santa Clara)

    NotCo • Santa Clara, CA, United States
    Full-time
    We are a group of people united by a strong purpose.We want you to get to know us, and Giuseppe, our very own artificial intelligence. We love good food, but above all, we love the planet, and that'...Show more
    Last updated: 1 day ago • Promoted
    Senior Research Engineer (Santa Clara)

    Senior Research Engineer (Santa Clara)

    Harrison Clarke • Santa Clara, CA, US
    Part-time
    A fast-growing, deeply technical AI company is looking for a.This is an opportunity to work at the frontier of AI, helping design and evaluate models that can understand, write, and reason about co...Show more
    Last updated: 1 day ago • Promoted
    Sr Research Data Scientist

    Sr Research Data Scientist

    Roku • San Jose, California, United States
    Full-time
    Teamwork makes the stream work.Roku is changing how the world watches TV.Roku is the #1 TV streaming platform in the U.Canada, and Mexico, and we've set our sights on powering every television in t...Show more
    Last updated: 30+ days ago • Promoted
    Research Scientist (San Jose)

    Research Scientist (San Jose)

    Goliath Partners • San Jose, CA, US
    Part-time
    We're working with a San Francisco client that's got a research team of 50~ professionals and looking to further expand it. They are specifically looking to flesh out their Research Group by hiring ...Show more
    Last updated: 1 day ago • Promoted
    AI Research Scientist I (Intern) United States

    AI Research Scientist I (Intern) United States

    Cisco • San Jose, CA, United States
    Full-time
    Please note this posting is to advertise potential job opportunities.This exact role may not be open today but could open in the near future. When you apply, a Cisco representative may contact you d...Show more
    Last updated: 4 days ago • Promoted
    research scientist - RL

    research scientist - RL

    Cerebro • Fremont, CA, United States
    Full-time
    Join a Leading Applied Research Lab Pushing the Boundaries of Reinforcement Learning.Are you passionate about advancing the frontiers of. An innovative AI research lab is seeking talented and ambiti...Show more
    Last updated: 1 day ago • Promoted
    Remote Research Scientist (San Jose)

    Remote Research Scientist (San Jose)

    Infused Innovations • San Jose, California, United States, 95101
    Remote
    Full-time +1
    Remote Research Scientist (San Jose).We are seeking a highly motivated and innovative.This role involves designing, conducting, and evaluating research projects that support the development of new ...Show more
    Last updated: 18 hours ago • New!
    AI Foundations - Research Scientist - Research Internship : 2026

    AI Foundations - Research Scientist - Research Internship : 2026

    IBM • San Jose, CA, United States
    Internship
    IBM Research takes responsibility for technology and its role in society.Working in IBM Research means you'll join a team who invent what's next in computing, always choosing the big, urgent and mi...Show more
    Last updated: 4 days ago • Promoted
    Associate Fraud Strategy Data Scientist San Jose, CA

    Associate Fraud Strategy Data Scientist San Jose, CA

    Esrhealthcare • San Jose, California, United States
    Full-time
    Associate Fraud Strategy Data Scientist San Jose, CA.Fraud Strategy Data Scientist, Risk Data Scientist w / Fraud, Risk Analytics, Data Analysis, Data Science, Fraud Mitigation, Industry : eCommerce, ...Show more
    Last updated: 30+ days ago • Promoted
    Staff Data Scientist (Fremont)

    Staff Data Scientist (Fremont)

    Quantix Search • Fremont, CA, United States
    Full-time
    Staff Data Scientist | San Francisco | $250K$300K + Equity.Were partnering with one of the fastest-growing AI companies in the world to hire a Staff Data Scientist. Backed by over $230M from top-tie...Show more
    Last updated: 1 day ago • Promoted