Talent.com
Research Scientist - Vision Data Infrastructure

Research Scientist - Vision Data Infrastructure

Storm3San Francisco, CA, US
3 days ago
Job type
  • Full-time
Job description

Research Scientists / Engineers (all levels)

Focus on Vision Data Infrastructure

  • ?? Fundamental AI Research Institute
  • ?? San Francisco Bay Area, USA
  • ?? $250,000 - $600,000 salary + annual bonus

Come join one of the only research institutions globally with resources to compete with top AI companies =>

10s of 1000s of GPUs to explore state-of-the-art research in LLMs, Multimodal and Agentic AI.

Currently seeking AI talent with expertise in building scalable pipelines for vision data to support both image / video generative training and multi-modal alignment. You'll design high-performance pipelines for large-scale image and video datasets , enabling efficient pretraining, alignment, and simulation-based data generation.

Responsibilities :

Vision Data Sourcing & Curation

  • Collect and organize image and video data from open datasets and the web.
  • Handle data cleaning, filtering, deduplication, and metadata generation.
  • Ensure ethical and compliant data collection at scale.
  • Processing & Augmentation

  • Build high-throughput pipelines for vision data preprocessing (frame extraction, resolution normalization, format conversion, latent caching).
  • Implement GPU-accelerated augmentation and distributed data loading (WebDataset, TFRecords, Parquet).
  • Synthetic & Simulation-Based Data Generation

  • Use simulation tools (e.g., Unreal Engine 5 , Isaac Sim, Unity) to generate high-quality synthetic vision data
  • Create specialized datasets for VLM training visual reasoning , and agent interaction
  • Requirements :

  • Strong experience with data engineering computer vision , or machine learning infrastructure
  • Expertise in building and scaling ETL / data pipelines for large unstructured datasets.
  • Proficiency with Python PyTorch , and distributed data frameworks (e.g., Ray Spark Dask
  • Experience with WebDataset TFRecords Parquet , or similar high-throughput data formats.
  • Familiarity with GPU-accelerated preprocessing NVIDIA DALI , or equivalent systems.
  • Understanding of image / video codecs data compression , and cloud storage optimization
  • Preferred Experience :

  • Prior work with simulation-based or synthetic data generation using Unreal Engine Isaac Sim , or Unity
  • Experience curating datasets for multimodal or vision-language model training.
  • Knowledge of data ethics privacy , and compliance frameworks for large-scale AI datasets.
  • Experience contributing to open datasets or data-centric AI research
  • Why apply :

  • Opportunity to join a fast-growing core team that are already pushing AI breakthroughs
  • Highly competitive salary package
  • Work alongside ambitious and bright superstars from tech and academia
  • Medical, Dental and Vision Insurance
  • Relocation package available
  • ?? San Francisco Bay Area, USA
  • ?? Interested in applying? Please click on the 'Easy Apply' button or alternatively email me your resume at stefani.lukic@storm3.com
  • Create a job alert for this search

    Research Scientist • San Francisco, CA, US

    Related jobs
    • Promoted
    Data Scientist, Innovation

    Data Scientist, Innovation

    PicarroSanta Clara, CA, United States
    Full-time
    Lead Data Scientist, Innovation.Job Location : Santa Clara, CA (preferred) or Remote- US-based.Picarro is transforming gas utility operations with innovative solutions for methane emissions manageme...Show moreLast updated: 30+ days ago
    • Promoted
    AI Research Scientist / Engineer

    AI Research Scientist / Engineer

    PhizenixMenlo Park, CA, US
    Full-time +1
    AI Research Scientist / Engineer.Menlo Park, CA | On-Site | Full-Time / Direct Hire .Seeking top-tier PhDs (Bay Area preferred) with ICML / ICLR publications in LLM training and inference optimizati...Show moreLast updated: 30+ days ago
    • Promoted
    Research Data Scientist

    Research Data Scientist

    University of California - San FranciscoSan Francisco, CA, United States
    Full-time
    The research data analyst will join the Department of Ophthalmology and work closely with a collaborative and exciting team at the F. Proctor Foundation studying eye diseases in the U.Sun's Lab Here...Show moreLast updated: 30+ days ago
    • Promoted
    Research Scientist

    Research Scientist

    Sei LabsSan Francisco, CA, United States
    Full-time
    Sei Labs builds open sourced technology for the high-performance Sei Blockchain, the first parallelized EVM Layer 1 blockchain designed to scale with the industry. The unique optimizations built int...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Data Scientist, Infrastructure

    Principal Data Scientist, Infrastructure

    RobloxSan Mateo, CA, United States
    Full-time
    Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist

    Data Scientist

    Lawrence Berkeley National LaboratorySan Francisco, CA, United States
    Full-time
    Lawrence Berkeley National Lab’s (.Energy Geosciences Division has an opening for a Data Scientist to join the team.In this role, you will be primarily responsible for the development, implementati...Show moreLast updated: 23 days ago
    • Promoted
    Senior Data Scientist, Research, Operations Data Science

    Senior Data Scientist, Research, Operations Data Science

    GoogleSan Francisco, CA, United States
    Full-time
    Senior Data Scientist, Research, Operations Data Science.Be among the first 25 applicants.Applicants in San Francisco : Qualified applications with arrest or conviction records will be considered fo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Scientist, Search and Recommender Systems- VenturaOS

    Senior Data Scientist, Search and Recommender Systems- VenturaOS

    The Trade DeskSan Jose, CA, United States
    Full-time
    We're a small, high-impact team building the next-generation TV operating system.Our flexible structure gives engineers direct influence over product direction-an opportunity rarely found in larger...Show moreLast updated: 30+ days ago
    • Promoted
    Research Scientist (diffusion)

    Research Scientist (diffusion)

    GenmoSan Francisco, CA, United States
    Full-time
    We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the bo...Show moreLast updated: 30+ days ago
    • Promoted
    Research Scientist, Low Level Vision, Level 4

    Research Scientist, Low Level Vision, Level 4

    Snap Inc.San Francisco, CA, US
    Full-time
    We believe the camera presents the greatest opportunity to improve the way people live and communicate.Snap contributes to human progress by empowering people to express themselves, live in the mom...Show moreLast updated: 30+ days ago
    • Promoted
    Research Scientist

    Research Scientist

    Menlo VenturesSan Francisco, CA, United States
    Full-time
    Behind our name : Like fire, AI holds the potential for both immense benefit and significant risk.Just as mastering fire transformed human history, we believe the safe and intentional development of...Show moreLast updated: 18 days ago
    • Promoted
    Research Engineer, Data Infrastructure

    Research Engineer, Data Infrastructure

    Halodi RoboticsPalo Alto, CA, United States
    Full-time
    Target start date : Immediately.Since its founding in 2015, 1X has been at the forefront of developing advanced humanoid robots designed for household use. Our mission is to create an abundant supply...Show moreLast updated: 6 days ago
    • Promoted
    Data Scientist

    Data Scientist

    WaymoSan Francisco, CA, United States
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Wa...Show moreLast updated: 9 days ago
    • Promoted
    Data Scientist

    Data Scientist

    VisaFoster City, CA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 30+ days ago
    • Promoted
    Research Engineer – Synthetic Data for Vision

    Research Engineer – Synthetic Data for Vision

    SesameSan Francisco, CA, United States
    Full-time
    Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate with us in ways that feel natural and human. With this vision, we're designing a new kind of...Show moreLast updated: 25 days ago
    • Promoted
    Research Scientist, World Models

    Research Scientist, World Models

    WaabiSan Francisco, CA, United States
    Full-time
    Waabi, founded by AI pioneer and visionary Raquel Urtasun, is an AI company building the next generation of self-driving technology. With a world class team and an innovative approach that unleashes...Show moreLast updated: 11 days ago
    • Promoted
    Research Scientist - Applied AI

    Research Scientist - Applied AI

    P-1 AISan Francisco, CA, United States
    Full-time
    We are building an engineering AGI.We founded P-1 AI with the conviction that the greatest impact of artificial intelligence will be on the built world—helping mankind conquer nature and bend it to...Show moreLast updated: 12 days ago
    • Promoted
    Research Engineer - AI Infrastructure

    Research Engineer - AI Infrastructure

    Applied IntuitionMountain View, CA, United States
    Full-time
    Mountain View, California, United States.Applied Intuition is the vehicle intelligence company that accelerates the global adoption of safe, AI-driven machines. Founded in 2017, Applied Intuition de...Show moreLast updated: 1 day ago