Talent.com
Machine Learning Data Engineer - Systems & Retrieval
Machine Learning Data Engineer - Systems & RetrievalZyphra • Palo Alto, CA, United States
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

Zyphra • Palo Alto, CA, United States
23 hours ago
Job type
  • Full-time
Job description

Zyphra is an artificial intelligence company based in Palo Alto, California.

The Role :

As a Machine Learning Data Engineer - Systems & Retrieval , you will build and optimize the data infrastructure that fuels our machine learning systems. This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw web-scale data to enterprise document corpora. You’ll play a central role in architecting retrieval systems for LLMs and enabling scalable training and inference with clean, accessible, and secure data. You’ll have an impact across both research and product teams by shaping the foundation upon which intelligent systems are trained, retrieved, and reasoned over.

You’ll work across :

Design and implementation of distributed data ingestion and transformation pipelines

Building retrieval and indexing systems that support RAG and other LLM-based methods

Mining and organizing large unstructured datasets, both in research and production environments

Collaborating with ML engineers, systems engineers, and DevOps to scale pipelines and observability

Ensuring compliance and access control in data handling, with security and auditability in mind

Requirements :

Strong software engineering background with fluency in Python

Experience designing, building, and maintaining data pipelines in production environments

Deep understanding of data structures, storage formats, and distributed data systems

Familiarity with indexing and retrieval techniques for large-scale document corpora

Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics

Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)

Excellent debugging, observability, and logging practices to support reliability at scale

Strong communication skills and experience collaborating across ML, infra, and product teams

Bonus Skill Set :

Experience building or maintaining LLM-integrated retrieval systems (e.g, RAG pipelines)

Academic or industry background in data mining, search, recommendation systems, or IR literature

Experience with large-scale ETL systems and tools like Apache Beam, Spark, or similar

Familiarity with vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding-based retrieval

Understanding of data validation and quality assurance in machine learning workflows

Experience working on cross-functional infra and MLOps teams

Knowledge of how data infrastructure supports training pipelines, inference serving, and feedback loops

Comfort working across raw, unstructured data, structured databases, and model-ready formats

Why Work at Zyphra :

Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued

We strongly value new and crazy ideas and are very willing to bet big on new ideas

We move as quickly as we can; we aim to minimize the bar to impact as low as possible

We all enjoy what we do and love discussing AI

Benefits and Perks :

Comprehensive medical, dental, vision, and FSA plans

Competitive compensation and 401(k)

Relocation and immigration support on a case-by-case basis

On-site meals prepared by a dedicated culinary team; Thursday Happy Hours

In-person team in Palo Alto, CA, with a collaborative, high-energy environment

If you're excited by the challenge of high-scale, high-performance data engineering in the context of cutting-edge AI, you’ll thrive in this role. Apply Today!

#J-18808-Ljbffr

Create a job alert for this search

Machine Learning Engineer • Palo Alto, CA, United States

Related jobs
Machine Learning Engineer - Data, Productivity Apps

Machine Learning Engineer - Data, Productivity Apps

Apple Inc. • Cupertino, CA, United States
Full-time
Machine Learning Engineer - Data, Productivity Apps.Cupertino, California, United States Software and Services.At Apple, new ideas have a way of becoming phenomenal products, services, and customer...Show more
Last updated: 23 hours ago • Promoted
Machine Learning Research Engineer, Generative AI

Machine Learning Research Engineer, Generative AI

Apple Inc. • Cupertino, CA, United States
Full-time
Machine Learning Research Engineer, Generative AI.Cupertino, California, United States Software and Services.Apple is where individual imaginations gather together, committing to the values that le...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Infrastructure and Data Engineer

Machine Learning Infrastructure and Data Engineer

Apple Inc. • Sunnyvale, CA, United States
Full-time
Machine Learning Infrastructure and Data Engineer.Sunnyvale, California, United States.Want to ship amazing experiences in Apple products? Be part of the team in the Video Computer Vision (VCV) org...Show more
Last updated: 29 days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Meta • Menlo Park, CA, United States
Full-time
Machine Learning EngineerMetaSoftware EngineeringMachine LearningMachine Learning Engineer Responsibilities • Adapt standard machine learning methods to best exploit modern parallel environments (e....Show more
Last updated: 23 hours ago • Promoted
Machine Learning Research Engineer, Text Generation

Machine Learning Research Engineer, Text Generation

Apple Inc. • Cupertino, CA, United States
Full-time
Machine Learning Research Engineer, Text Generation.Cupertino, California, United States Machine Learning and AI.Apple is where individual imaginations gather together, committing to the values tha...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Systems Engineer

Machine Learning Systems Engineer

Apple Inc. • Cupertino, CA, United States
Full-time
Cupertino, California, United States Machine Learning and AI.The Siri organization is looking for passionate Machine Learning Systems Engineers to join us in developing and shipping state-of-the-ar...Show more
Last updated: 12 days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Clutch Canada • Palo Alto, CA, United States
Full-time
Palo Alto, CA - Engineering - Hybrid - Full-time.Building hardware is like writing software with no debugger, no logs, and only three compile attempts — before mass production.This lack of visibili...Show more
Last updated: 30+ days ago • Promoted
Senior Machine Learning Engineer

Senior Machine Learning Engineer

Harnham • Hayward, CA, US
Full-time
SENIOR MACHINE LEARNING ENGINEER - SEARCH & RECOMMENDATIONS.Hybrid – Bay Area (3 Days / Week Onsite).We're a fast-growing online marketplace backed by a major global tech player.Our platform help...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer - Intelligent Agents & Systems

Machine Learning Engineer - Intelligent Agents & Systems

Zyphra Technologies Inc. • Palo Alto, CA, United States
Full-time
Agentic Systems and Interaction projects.You will be at the forefront of building a next-generation desktop and browser-based agent that can autonomously navigate the web, interact with filesystems...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Research Engineer

Machine Learning Research Engineer

Decagon AI, Inc. • San Francisco, CA, United States
Full-time
Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experience.Our AI agents provide intelligent, human-like responses across chat, email, and voi...Show more
Last updated: 30+ days ago • Promoted
Defense Machine Learning Engineer - Remote

Defense Machine Learning Engineer - Remote

iO Associates • Fremont, CA, United States
Remote
Full-time
Job Title : Uncleared Machine Learning Engineer - Remote.Our Client is at the forefront of Intelligent Exploration and Enterprise AI with their cutting-edge AI Platform. Operating in a dynamic indust...Show more
Last updated: 15 days ago • Promoted
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Hedra, Inc • San Francisco, CA, United States
Full-time
Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures.We're building Hedra Studio, a multimodal creation platform capable of control, emotion,...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer 2

Machine Learning Engineer 2

Intuit Inc. • Mountain View, CA, United States
Full-time
In this role, you’ll be embedded inside a vibrant team of data scientists.You’ll be expected to help conceive, code, and deploy data science models at scale using the latest industry tools.Importan...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Ipro Networks Pte. Ltd. • San Francisco, CA, United States
Full-time
Job Title : Machine Learning Engineer, Training Infrastructure | Position Type : Full time | Location : San Francisco, CA, USA | Salary Range : $150,000 - $250,000 (USD) | Job ID# : 158135.Design, imple...Show more
Last updated: 30+ days ago • Promoted
Senior Machine Learning Engineer, Relevance

Senior Machine Learning Engineer, Relevance

Patreon • San Francisco, CA, United States
Full-time
Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show more
Last updated: 17 days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Adobe Systems GmbH • San Jose, CA, United States
Full-time
Changing the world through digital experiences is what Adobe’s all about.We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital exper...Show more
Last updated: 30+ days ago • Promoted
Research Engineer - Machine Learning & Systems

Research Engineer - Machine Learning & Systems

World Labs • San Francisco, CA, United States
Full-time
We are looking for a versatile Research Engineer with a strong background in machine learning or 3D, software development, and systems design. This role is ideal for someone excited about bridging c...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Researcher / ML-Ops Engineer

Machine Learning Researcher / ML-Ops Engineer

Rivet Industries, Inc. • Palo Alto, CA, United States
Full-time
Machine Learning Researcher / ML-Ops Engineer.Rivet is an American company building integrated task systems — fusing hardened hardware with software, sensors, AI, and networking — for industrial wo...Show more
Last updated: 11 days ago • Promoted