Talent.com
Machine Learning Data Engineer - Systems & Retrieval
Machine Learning Data Engineer - Systems & RetrievalZyphra • Palo Alto, California, United States
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

Zyphra • Palo Alto, California, United States
30+ days ago
Job type
  • Full-time
Job description

Zyphra is an artificial intelligence company based in Palo Alto, California.

The Role :

As a Machine Learning Data Engineer - Systems & Retrieval , you will build and optimize the data infrastructure that fuels our machine learning systems. This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw web-scale data to enterprise document corpora. You’ll play a central role in architecting retrieval systems for LLMs and enabling scalable training and inference with clean, accessible, and secure data. You’ll have an impact across both research and product teams by shaping the foundation upon which intelligent systems are trained, retrieved, and reasoned over.

You’ll work across :

Design and implementation of distributed data ingestion and transformation pipelines

Building retrieval and indexing systems that support RAG and other LLM-based methods

Mining and organizing large unstructured datasets, both in research and production environments

Collaborating with ML engineers, systems engineers, and DevOps to scale pipelines and observability

Ensuring compliance and access control in data handling, with security and auditability in mind

Requirements :

Strong software engineering background with fluency in Python

Experience designing, building, and maintaining data pipelines in production environments

Deep understanding of data structures, storage formats, and distributed data systems

Familiarity with indexing and retrieval techniques for large-scale document corpora

Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics

Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)

Excellent debugging, observability, and logging practices to support reliability at scale

Strong communication skills and experience collaborating across ML, infra, and product teams

Bonus Skill Set :

Experience building or maintaining LLM-integrated retrieval systems (e.g, RAG pipelines)

Academic or industry background in data mining, search, recommendation systems, or IR literature

Experience with large-scale ETL systems and tools like Apache Beam, Spark, or similar

Familiarity with vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding-based retrieval

Understanding of data validation and quality assurance in machine learning workflows

Experience working on cross-functional infra and MLOps teams

Knowledge of how data infrastructure supports training pipelines, inference serving, and feedback loops

Comfort working across raw, unstructured data, structured databases, and model-ready formats

Why Work at Zyphra :

Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued

We strongly value new and crazy ideas and are very willing to bet big on new ideas

We move as quickly as we can; we aim to minimize the bar to impact as low as possible

We all enjoy what we do and love discussing AI

Benefits and Perks :

Comprehensive medical, dental, vision, and FSA plans

Competitive compensation and 401(k)

Relocation and immigration support on a case-by-case basis

On-site meals prepared by a dedicated culinary team; Thursday Happy Hours

In-person team in Palo Alto, CA, with a collaborative, high-energy environment

If you're excited by the challenge of high-scale, high-performance data engineering in the context of cutting-edge AI, you’ll thrive in this role. Apply Today!

Create a job alert for this search

Machine Learning Engineer • Palo Alto, California, United States

Related jobs
Machine Learning Engineer

Machine Learning Engineer

Neuralink • Fremont, CA, United States
Full-time
We are creating devices that enable a bi-directional interface with the brain.These devices allow us to restore movement to the paralyzed, restore sight to the blind, and revolutionize how humans i...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, ML Runtime & Optimization

Machine Learning Engineer, ML Runtime & Optimization

Pony.ai • Fremont, CA, United States
Full-time
Founded in 2016 in Silicon Valley, Pony.Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony. CNBC Disruptor list of the 50 most innovative and disruptive tech comp...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Ampcus • Pleasanton, CA, United States
Full-time
Technology and Business consulting services.We are in search of a highly motivated candidate to join our talented Team.Design, build, and maintain end-to-end MLOps pipelines for data prep, training...Show more
Last updated: 13 days ago • Promoted
Machine Learning Systems Engineer

Machine Learning Systems Engineer

Apple Inc. • Cupertino, CA, United States
Full-time
Cupertino, California, United States Machine Learning and AI.The Siri organization is looking for passionate Machine Learning Systems Engineers to join us in developing and shipping state-of-the-ar...Show more
Last updated: 9 days ago • Promoted
Senior Machine Learning Engineer

Senior Machine Learning Engineer

RIT Solutions, Inc. • Fremont, CA, United States
Full-time
Senior Machine Learning Engineer.On-site in Fremont 5x a week (MUST BE LOCAL).In-depth knowledge of Python for high-performance data-intensive applications. Familiarity with at least one modern deep...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

RADAR • Sunnyvale, CA, United States
Full-time
San Diego, CA or Sunnyvale, CA or Remote.At RADAR, we're transforming the way the world thinks about physical retail.RADAR has raised over $104M from top investors, retailers, and strategics and wo...Show more
Last updated: 11 hours ago • Promoted • New!
Machine Learning Engineer, Level 4

Machine Learning Engineer, Level 4

Minimal • Palo Alto, CA, United States
Full-time
Machine Learning Engineer, Level 4 page is loaded## Machine Learning Engineer, Level 4locations : Palo Alto, California : Santa Monica - 3250 Ocean Park Blvd : Seattle, Washington : Bellevue, W...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

NR Consulting • Fremont, CA, United States
Full-time
Develop, optimize, and deploy lightweight machine learning models for edge AI applications, particularly for audio processing. Implement and optimize ML models on embedded platforms, including FPGA ...Show more
Last updated: 30+ days ago • Promoted
AI / Machine Learning Engineer

AI / Machine Learning Engineer

4 Staffing Corp • Fremont, CA, United States
Full-time
About the job AI / Machine Learning Engineer.Our client is seeking a highly skilled and motivated AI / Machine Learning Engineer to join their team. As an AI / Machine Learning Engineer, you will play a c...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Perfict Global, Inc. • Fremont, CA, United States
Full-time
Position : Machine Learning Engineer.Visa : GC, USC, GCEAD, H4EAD, TN.Location : Fremont, CA, onsite.Design, develop and implement critical machine learning models that operate on our factory and w...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer - Intelligent Agents & Systems

Machine Learning Engineer - Intelligent Agents & Systems

Zyphra Technologies Inc. • Palo Alto, CA, United States
Full-time
Agentic Systems and Interaction projects.You will be at the forefront of building a next-generation desktop and browser-based agent that can autonomously navigate the web, interact with filesystems...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Operations Contractor

Machine Learning Operations Contractor

Coherent • Fremont, CA, United States
Full-time
Machine Learning Operations (MLOps) Contractor.The emphasis will be on yield improvement, screening accuracy, and design optimization. The MLOps Contractor will develop and validate models based on ...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Instrumental Inc. • Palo Alto, CA, United States
Full-time
Machine Learning Engineer (Computer Vision).We are looking for a customer-focused ML Engineer to help build and scale our end-to-end ML pipeline. You’ll balance research and productization in a fast...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

SynergisticIT • Fremont, CA, United States
Full-time
The Job Market is Challenging due to more than 150,000 Tech Layoffs in 2022 and in 2023 more than 240,000 layoffs so almost 3,90,00 tech employees have been laid off since 2022 and its still going ...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer (Naga)

Machine Learning Engineer (Naga)

Ursus Inc • Fremont, CA, United States
Full-time +1
JOB TITLE : Machine Learning Engineer.LOCATION : Onsite in Fremont, CA.DURATION : 6+ month contract to hire.Serve as a member of our client's supply chain development team. Design, develop, and impleme...Show more
Last updated: 13 days ago • Promoted
ML Ops Engineer

ML Ops Engineer

Omni Inclusive • San Leandro, CA, United States
Full-time
ML Ops Engineer to drive the full lifecycle of machine learning solutions-from data exploration and model development to scalable deployment and monitoring. This role bridges the gap between data sc...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, ML Resources

Machine Learning Engineer, ML Resources

The Rundown AI, Inc. • Mountain View, CA, United States
Full-time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show more
Last updated: 11 hours ago • Promoted • New!
Machine Learning Researcher / ML-Ops Engineer

Machine Learning Researcher / ML-Ops Engineer

Rivet Industries, Inc. • Palo Alto, CA, United States
Full-time
Machine Learning Researcher / ML-Ops Engineer.Rivet is an American company building integrated task systems — fusing hardened hardware with software, sensors, AI, and networking — for industrial wo...Show more
Last updated: 30+ days ago • Promoted