Senior Machine Learning EngineerTroveo Ai • San Francisco, CA, United States

Senior Machine Learning Engineer

Troveo Ai • San Francisco, CA, United States

2 days ago

Job type

Full-time

Job description

About Troveo

Troveo is building the next-generation data platform to train AI video models. Troveo offers the world’s largest library of AI video training data, featuring millions of hours of licensed video content. Our end-to-end data pipeline connects creators, rights holders, and AI research labs, enabling scalable, compliant, and innovative uses of video across for AI application and model development.

We are an early‑stage, high-growth venture backed by forward‑thinking investors, and we are seeking an innovative strategic engineer to help us scale.

Role Overview

The Senior Machine Learning Engineer will play a central role in designing, building, and optimizing large‑scale machine learning pipelines for AI video model training. You’ll work across the full ML lifecycle, from structuring massive datasets to deploying, evaluating, and training models in production.

This is a hands‑on, high‑impact role for an engineer who thrives on scale, autonomy, and cross‑functional collaboration. You will combine deep technical expertise with strong communication and business acumen, translating models into measurable costs, performance targets, and real‑world outcomes.

Key Responsibilities

Data Curation & Indexing Pipelines – Architect and implement large‑scale pipelines for video ingestion, metadata extraction, and indexing using vector databases and embedding models to enable fast, semantic retrieval. Design annotation workflows integrating active learning, weak supervision, and human‑in‑the‑loop systems to curate high‑quality labeled datasets for video models. Contribute to optimizing data partitioning, sharding, and caching strategies to handle petabyte‑scale video corpora, ensuring low‑latency search and robust data lineage.
Model Training & Evaluation – Develop and fine‑tune multimodal models (e.g., CLIP variants, transformer‑based encoders) for video embeddings, scene segmentation, and relevance ranking using PyTorch and Hugging Face. Build evaluation frameworks with metrics like NDCG, mAP, and annotation consistency scores to iteratively improve search accuracy and annotation efficiency. Deploy models via containerized services with A / B testing and monitoring for drift detection in production search and annotation pipelines. Collaborate with Product and Operations teams to translate ML performance into business insights and cost implications.
Infrastructure & Optimization – Scale ML infrastructure on AWS, leveraging multi‑GPU clusters and distributed training to accelerate embedding computation and indexing jobs. Implement testing and deployment processes across large distributed systems. Fine‑tune OSS models. Working knowledge in training large models is a plus. Implement automated CI / CD for model versioning, hyperparameter tuning, and resource orchestration to minimize compute costs and maximize GPU utilization. Profile and tune systems for bottlenecks in vector similarity search, batch annotation, and real‑time querying.
Cross‑Functional Collaboration – Partner with product, research, and data teams to align ML outputs with business KPIs, such as search latency targets and annotation throughput. Translate technical trade‑offs (e.g., recall vs. precision in embeddings) into actionable insights for stakeholders, fostering adoption in video discovery features. Work closely with data engineers, research scientists, and product teams to align model performance with strategic business goals. Communicate technical concepts clearly to both technical and non‑technical stakeholders. Take ownership of project outcomes in a fast‑paced, startup environment.

Qualifications & Experience

6+ years in ML engineering, with a focus on information retrieval, embedding systems, or data annotation pipelines.

Proven track record building scalable indexing and search infrastructure, including vector stores and similarity search algorithms.

Expertise in Python and PyTorch for core model development; hands‑on experience with Hugging Face Transformers for multimodal embeddings and fine‑tuning.

Working experience with video, computer vision, and multi‑modal LLMs.

Hands‑on experience deploying models in production environments and measuring model accuracy.

Proficiency in ML ops tools (e.g., MLflow, Weights & Biases) for experimentation, versioning, and deployment.

Hands‑on experience with production ML deployment, evaluation metrics for retrieval / annotation tasks, and cost‑optimized scaling on cloud platforms like AWS.

Strong analytical skills for dissecting performance in large distributed systems; familiarity with multi‑GPU training and vector databases preferred.

Excellent communication to bridge technical depth with strategic priorities in collaborative settings.

Nice to Have

Prior experience training video models or working with video‑based datasets.

Demonstrated expertise in GPU optimization and large‑scale compute performance tuning.

A blend of startup agility and big tech rigor.

Contributions to open source development and projects.

Experience working with search ranking algorithms.

Location & Compensation

Location : Strong preference for candidates based in the San Francisco Bay Area.

Compensation : $200,000 – $400,000 base salary + equity.

Why Join Troveo?

Work at the cutting edge of AI, video, and large‑scale data infrastructure.

Build systems that directly power the next generation of AI video models.

Collaborate with a world‑class team of engineers, researchers, and industry experts.

High autonomy, high impact, your work will shape the foundation of our platform.

Competitive compensation with meaningful equity upside.

#J-18808-Ljbffr

Create a job alert for this search

Senior Machine Learning Engineer • San Francisco, CA, United States

Related jobs

Senior Machine Learning Engineer

Quantix Search • Fremont, CA, United States

Full-time

Senior Machine Learning Engineer.San Francisco | Hybrid, 3 days / week | $200K - $280K + equity.We are excited to partner with a rapidly growing healthtech startup that has successfully raised $40M i...Show more

Last updated: 4 days ago • Promoted

Senior Machine Learning Engineer

Scribd • San Francisco, CA, United States

Full-time

At Scribd (pronounced “scribbed”), our mission is to spark human curiosity.Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empowe...Show more

Last updated: 30+ days ago • Promoted

Senior Machine Learning Engineer

Top Engineer • San Francisco, CA, United States

Full-time

Confidential Search for International Employer.Social Commerce / AI Technology.BS in Computer Science or Mathematics from Top 40 University. AI-POWERED SOCIAL COMMERCE REVOLUTION.Role : Senior Machin...Show more

Last updated: 1 day ago • Promoted

Senior Machine Learning Engineer

Harnham • San Francisco, CA, United States

Full-time

Join a fast‑paced startup environment with 10+ independent product lines, each solving unique financial use cases.Engineering teams are lean and focused on top talent—not mass hiring.We prioritize ...Show more

Last updated: 30+ days ago • Promoted

Senior / Staff Machine Learning Engineer, Perception

Agtonomy • San Francisco, CA, United States

Full-time

At Agtonomy, we’re not just building tech—we’re transforming how vital industries get work done.Our Physical AI and fleet services turn heavy machinery into intelligent, autonomous systems that tac...Show more

Last updated: 30+ days ago • Promoted

Senior Machine Learning Engineer (Recommendations)

Scribd • San Francisco, CA, United States

Full-time

Last updated: 30+ days ago • Promoted

Senior Machine Learning Engineer, ML Platform

Block • San Francisco, CA, United States

Full-time

Senior Machine Learning Engineer, ML Platform.Block is one company built from many blocks, all united by the same purpose of economic empowerment. The blocks that form our foundational teams — Peopl...Show more

Last updated: 9 days ago • Promoted

Senior Machine Learning Engineer

HRB • San Francisco, CA, United States

Full-time

Senior Machine Learning Engineer.Location : San Jose, CA (Onsite).About Mendel : Mendel AI is at the forefront of transforming healthcare through cutting-edge AI and technology.Our mission is to harn...Show more

Last updated: 30+ days ago • Promoted

Senior Machine Learning Engineer - GenAI Platform

Menlo Ventures • San Francisco, CA, United States

Full-time

Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine‑tune, train and deploy custom AI models on their own data, for maxi...Show more

Last updated: 11 days ago • Promoted

Senior Machine Learning Engineer

Ontra • San Francisco, CA, United States

Full-time

Join to apply for the Senior Machine Learning Engineer role at Ontra.Ontra is the leader in AI‑powered solutions for the private markets. Powered by industry‑leading AI, data from over 2 million con...Show more

Last updated: 6 days ago • Promoted

Senior Machine Learning Engineer

RainesDev • San Francisco, CA, United States

Full-time

We're a leading partner in social commerce, collaborating with major athletic wear, footwear, and electronics brands to expand their influencer-driven sales channels. Having achieved significant rev...Show more

Last updated: 2 days ago • Promoted

Senior Machine Learning Engineer

Trucksmarter • San Francisco, CA, United States

Full-time

Truck drivers are the backbone of the American economy.They move 71% of our freight (~$800bn annually) and make up 6% of the American workforce. But the trucking industry is fragmented, opaque, and ...Show more

Last updated: 30+ days ago • Promoted

Senior Machine Learning Engineer, AI Collaboration

Snowflake Computing • Menlo Park, CA, United States

Full-time

Snowflake is about empowering enterprises to achieve their full potential - and people too.With a culture that's all in on impact, innovation, and collaboration, Snowflake is the sweet spot for bui...Show more

Last updated: 4 days ago • Promoted

Senior Machine Learning Engineer - GenAI Platform

Databricks Inc. • San Francisco, CA, United States

Full-time

Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine-tune, train and deploy custom AI models on their own data, for maxi...Show more

Last updated: 30+ days ago • Promoted

Senior Staff Machine Learning Engineer

Rippling • San Francisco, CA, United States

Full-time

Rippling gives businesses one place to run HR, IT, and Finance.It brings together all of the workforce systems that are normally scattered across a company, like payroll, expenses, benefits, and co...Show more

Last updated: 30+ days ago • Promoted

Senior Machine Learning Engineer

Henpen Corporation • San Francisco, CA, United States

Full-time

Senior Machine Learning Engineer.San Francisco, California, USA.We're a leading partner in social commerce, collaborating with major athletic wear, footwear, and electronics brands to expand their ...Show more

Last updated: 3 hours ago • Promoted • New!

Senior Machine Learning Engineer, Relevance

Patreon • San Francisco, CA, United States

Full-time

Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show more

Last updated: 30+ days ago • Promoted

Senior Machine Learning Engineer

Mercury • San Francisco, CA, United States

Full-time

Locations : San Francisco, CA; New York, NY; Portland, OR; or Remote within Canada or United States.Mercury is building systems to efficiently review customers’ applications to prevent money launder...Show more

Last updated: 30+ days ago • Promoted