Talent.com
No longer accepting applications
Senior Data Scientist - Machine Learning Data Operations (San Francisco)

Senior Data Scientist - Machine Learning Data Operations (San Francisco)

Turbine OneSan Francisco, CA, United States
12 hours ago
Job type
  • Full-time
Job description

Senior Data Scientist - Machine Learning Data Operations

Senior Data Scientist - Machine Learning Operations

About The Job

Company Intro : TurbineOne is the frontline perception company. We deliver decision advantage, better situational awareness, and stronger force protection. Our customers love how we automate the right portions of the military intelligence cycle while keeping them in the loop. The company is a small, fast-moving, and high-performance startup that is backed by the best DefenseTech venture capitalists.

Job Title : Data Scientist

  • Reporting to the Machine Learning team lead
  • Geographically flexible for home-office

Primary Responsibilities :

  • Ingesting, organizing, and maintaining large-scale training datasets from open-source resources and contract-specific artifacts
  • Creating and managing data cataloging systems to ensure datasets are findable, accessible, and ready for ML training pipelines
  • Designing and implementing data labeling workflows, including managing external labeling vendors and quality assurance processes
  • Building and maintaining YOLO-style manifests and annotation formats for custom computer vision datasets
  • Performing data cleaning, validation, and augmentation to ensure high-quality training data
  • Conducting exploratory data analysis and generating insights about dataset characteristics, biases, and coverage gaps
  • Supporting the ML research team with statistical analysis, experiment design, and model evaluation
  • Developing data pipelines and automation tools for continuous data ingestion and processing
  • Collaborating with ML engineers to optimize data loading and preprocessing for training efficiency
  • On A Typical Day You Would :

  • Process incoming datasets from various sources, performing quality checks and organizing them into our data management system
  • Create or review annotation schemas and coordinate with labeling teams to ensure consistent, high-quality labels
  • Write Python scripts to clean, transform, and validate datasets for specific ML training requirements
  • Analyze dataset statistics and create visualizations to identify potential issues or opportunities for improvement
  • Collaborate with the ML research lead to design experiments and evaluate model performance across different data splits
  • Document dataset characteristics, versioning, and lineage to maintain reproducibility and compliance
  • Desired Experience :

  • High standard of ethics, grit, integrity and moral character
  • 5+ years of experience in data science, analytics, or related field with focus on ML data preparation
  • Strong foundation in probability, statistics, and experimental design
  • Bachelor's degree in Statistics, Mathematics, Computer Science, or related quantitative field (Master's preferred)
  • Proficiency with Python data stack : Pandas, NumPy, Jupyter Notebooks, and data visualization libraries
  • Experience with ML frameworks (PyTorch, Scikit-learn) and familiarity with training workflows
  • Hands-on experience with computer vision datasets and annotation formats (COCO, YOLO, Pascal VOC)
  • Experience managing data labeling projects and working with annotation tools (Label Studio, CVAT, or similar)
  • Familiarity with open-source ML models and experience applying them to real-world problems
  • Strong SQL skills and experience with data warehousing concepts
  • Experience with version control (Git) and collaborative development practices
  • Excellent communication skills for coordinating with technical and non-technical stakeholders
  • Meticulous attention to detail and strong organizational skills for managing complex datasets
  • Willingness to embrace the Startup Culture of moving fast, being insatiably curious, celebrating often, embracing uncertainty, and having a personal desire to improve other peoples' lives
  • Nice To Have :

  • Experience with defense or security-related datasets
  • Knowledge of edge computing constraints and data optimization techniques
  • Experience with distributed data processing frameworks (Spark, Dask)
  • Familiarity with MLOps practices and tools
  • Background in specific domains relevant to perception systems (satellite imagery, sensor fusion, etc.)
  • Startup Culture Expectations :

  • We're a small, fully remote team and everything is our responsibility
  • Our team thrives on autonomy, trust and solid communication
  • Everyone on the Team needs to be very comfortable with constant change, moving fast, sharing failures, embracing grit, and building things themselves
  • Eligibility :

  • Must be eligible to obtain a clearance with the U.S. government
  • Create a job alert for this search

    Senior Data Scientist • San Francisco, CA, United States

    Related jobs
    • Promoted
    Senior Data Scientist (San Francisco Bay Area)

    Senior Data Scientist (San Francisco Bay Area)

    HarnhamSan Francisco Bay Area, US
    Part-time
    SENIOR DATA SCIENTIST - SEARCH AND RECOMMENDATIONS.SAN FRANCISCO BAY AREA (HYBRID).Were seeking a Data Scientist II to drive next-generation personalization and discovery for a fast-growing social ...Show moreLast updated: 4 days ago
    • Promoted
    Senior Applied Machine Learning Scientist

    Senior Applied Machine Learning Scientist

    SanasPalo Alto, CA, United States
    Full-time
    Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity.Pioneered by season...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Scientist (Alameda)

    Senior Data Scientist (Alameda)

    HarnhamAlameda, CA, US
    Part-time
    SENIOR DATA SCIENTIST - SEARCH AND RECOMMENDATIONS.SAN FRANCISCO BAY AREA (HYBRID).Were seeking a Data Scientist II to drive next-generation personalization and discovery for a fast-growing social ...Show moreLast updated: 4 days ago
    • Promoted
    Senior Data Scientist

    Senior Data Scientist

    HarnhamSan Jose, CA, US
    Full-time
    SENIOR DATA SCIENTIST - SEARCH AND RECOMMENDATIONS.SAN FRANCISCO BAY AREA (HYBRID).We’re seeking a Data Scientist II to drive next-generation personalization and discovery for a fast-growing ...Show moreLast updated: 6 days ago
    • Promoted
    Senior Machine Learning Scientist, BRAID (Clinical Sciences ML)

    Senior Machine Learning Scientist, BRAID (Clinical Sciences ML)

    GenentechSan Francisco, CA, United States
    Full-time
    It’s what drives us to innovate.To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Scientist, AI for Drug Discovery (Frontier Research)

    Senior Machine Learning Scientist, AI for Drug Discovery (Frontier Research)

    GenentechSan Francisco, CA, United States
    Full-time
    It’s what drives us to innovate.To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Scientist (San Francisco)

    Senior Data Scientist (San Francisco)

    HarnhamSan Francisco, CA, US
    Part-time
    SENIOR DATA SCIENTIST - SEARCH AND RECOMMENDATIONS.SAN FRANCISCO BAY AREA (HYBRID).Were seeking a Data Scientist II to drive next-generation personalization and discovery for a fast-growing social ...Show moreLast updated: 4 days ago
    • Promoted
    Senior Data Scientist, Analytics - Ads Product and GTM

    Senior Data Scientist, Analytics - Ads Product and GTM

    King River Capital GroupSan Francisco, CA, United States
    Full-time
    Senior Data Scientist, Analytics - Ads Product and GTM.Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our plat...Show moreLast updated: 8 days ago
    • Promoted
    R&D Senior Data Scientist

    R&D Senior Data Scientist

    Division5Palo Alto, CA, United States
    Full-time
    In this role, you’ll own the end-to-end lifecycle of advanced ML / DL systems—from research and data strategy to deployment—while shaping our technical roadmap and mentoring peers.If you are passiona...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Data Scientist, Forecasting

    Machine Learning Data Scientist, Forecasting

    OpenAISan Francisco, CA, United States
    Full-time
    Machine Learning Data Scientist, Forecasting.The Strategic Finance team at OpenAI plays a critical role in shaping the company’s long-term trajectory. We partner closely with Product, Engineering, a...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Scientist (San Jose)

    Senior Data Scientist (San Jose)

    HarnhamSan Jose, CA, US
    Part-time
    SENIOR DATA SCIENTIST - SEARCH AND RECOMMENDATIONS.SAN FRANCISCO BAY AREA (HYBRID).Were seeking a Data Scientist II to drive next-generation personalization and discovery for a fast-growing social ...Show moreLast updated: 4 days ago
    • Promoted
    • New!
    Lead Data Scientist - Personalization & Recommendation Engines

    Lead Data Scientist - Personalization & Recommendation Engines

    HarnhamSan Jose, CA, United States
    Full-time
    Lead Data Scientist - Personalization & Recommendation Engines.SF Bay Area - Hybrid (3 days / week onsite).A leading commerce marketplace with 130M+ users and billions of daily events is hiring a Sta...Show moreLast updated: 19 hours ago
    • Promoted
    Senior Data Scientist, Search and Recommender Systems- VenturaOS

    Senior Data Scientist, Search and Recommender Systems- VenturaOS

    The Trade DeskSan Jose, CA, United States
    Full-time
    We're a small, high-impact team building the next-generation TV operating system.Our flexible structure gives engineers direct influence over product direction-an opportunity rarely found in larger...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Scientist, Analytics

    Senior Data Scientist, Analytics

    King River Capital GroupSan Francisco, CA, United States
    Full-time
    Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform : . Over 90% of our users play games, spending a comb...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Data Scientist / Machine Learning Engineer - Retailer Growth Operations

    Staff Data Scientist / Machine Learning Engineer - Retailer Growth Operations

    StartopsSan Francisco, CA, United States
    Full-time
    Staff Data Scientist / Machine Learning Engineer - Retailer Growth.Build ML models to personalize retailer onboarding and engagement strategies. San Francisco, California, United States.Staff Data S...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    MeltwaterRedwood City, CA, United States
    Full-time
    Meltwater's Consumer Intelligence AI Team is looking for a.Natural Language Processing or Computer Vision features relying on the literature's state of the art. Those features are meant to be integr...Show moreLast updated: 19 hours ago
    • Promoted
    Senior Staff Data Scientist- Machine Learning

    Senior Staff Data Scientist- Machine Learning

    SoFiSan Francisco, CA, United States
    Full-time
    Senior Staff Data Scientist- Machine Learning.This range is provided by SoFi.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Employee Applicant ...Show moreLast updated: 2 days ago
    • Promoted
    Senior Machine Learning Scientist

    Senior Machine Learning Scientist

    Expedia, Inc.San Jose, CA, United States
    Full-time
    Expedia Group brands power global travel for everyone, everywhere.We design cutting-edge tech to make travel smoother and more memorable, and we create groundbreaking solutions for our partners.Our...Show moreLast updated: 27 days ago
    • Promoted
    Senior Machine Learning Scientist (BRAID - ML for Clinical Sciences)

    Senior Machine Learning Scientist (BRAID - ML for Clinical Sciences)

    GenentechSan Francisco, CA, United States
    Full-time
    It’s what drives us to innovate.To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Scientist, Machine Learning

    Senior Scientist, Machine Learning

    dimensioncap.comBerkeley, CA, United States
    Full-time
    Kimia Therapeutics is building a next-generation drug discovery engine that combines a unique physical platform for high-throughput chemistry and screening with state-of-the-art computational appro...Show moreLast updated: 30+ days ago