Talent.com
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

Zyphra Technologies Inc.Palo Alto, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Zyphra is an artificial intelligence company based in Palo Alto, California.

The Role :

As a Machine Learning Data Engineer - Systems & Retrieval , you will build and optimize the data infrastructure that fuels our machine learning systems. This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw web-scale data to enterprise document corpora. You’ll play a central role in architecting retrieval systems for LLMs and enabling scalable training and inference with clean, accessible, and secure data. You’ll have an impact across both research and product teams by shaping the foundation upon which intelligent systems are trained, retrieved, and reasoned over.

You’ll work across :

Design and implementation of distributed data ingestion and transformation pipelines

Building retrieval and indexing systems that support RAG and other LLM-based methods

Mining and organizing large unstructured datasets, both in research and production environments

Collaborating with ML engineers, systems engineers, and DevOps to scale pipelines and observability

Ensuring compliance and access control in data handling, with security and auditability in mind

Requirements :

Strong software engineering background with fluency in Python

Experience designing, building, and maintaining data pipelines in production environments

Deep understanding of data structures, storage formats, and distributed data systems

Familiarity with indexing and retrieval techniques for large-scale document corpora

Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics

Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)

Excellent debugging, observability, and logging practices to support reliability at scale

Strong communication skills and experience collaborating across ML, infra, and product teams

Bonus Skill Set :

Experience building or maintaining LLM-integrated retrieval systems (e.g, RAG pipelines)

Academic or industry background in data mining, search, recommendation systems, or IR literature

Experience with large-scale ETL systems and tools like Apache Beam, Spark, or similar

Familiarity with vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding-based retrieval

Understanding of data validation and quality assurance in machine learning workflows

Experience working on cross-functional infra and MLOps teams

Knowledge of how data infrastructure supports training pipelines, inference serving, and feedback loops

Comfort working across raw, unstructured data, structured databases, and model-ready formats

Why Work at Zyphra :

Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued

We strongly value new and crazy ideas and are very willing to bet big on new ideas

We move as quickly as we can; we aim to minimize the bar to impact as low as possible

We all enjoy what we do and love discussing AI

Benefits and Perks :

Comprehensive medical, dental, vision, and FSA plans

Competitive compensation and 401(k)

Relocation and immigration support on a case-by-case basis

On-site meals prepared by a dedicated culinary team; Thursday Happy Hours

In-person team in Palo Alto, CA, with a collaborative, high-energy environment

If you're excited by the challenge of high-scale, high-performance data engineering in the context of cutting-edge AI, you’ll thrive in this role. Apply Today!

#J-18808-Ljbffr

Create a job alert for this search

Machine Learning Engineer • Palo Alto, CA, United States

Related jobs
  • Promoted
Machine Learning Engineer, Recommendation

Machine Learning Engineer, Recommendation

NewsBreakMountain View, CA, US
Full-time
Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy.With over 40 million monthly active users, our flagship platform delivers highly personalized loca...Show moreLast updated: 1 day ago
  • Promoted
Senior AI / ML Engineer

Senior AI / ML Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Senior AI / ML Engineer specializing in Generative AI to develop and implement advanced AI solutions. Key Responsibilities Implement and optimize AI orchestration framewor...Show moreLast updated: 30+ days ago
  • Promoted
Senior MLOps Engineer

Senior MLOps Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Senior MLOps Engineer to design and scale infrastructure for AI research and product development. Key Responsibilities Identify and resolve infrastructure and software b...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Engineer, GenAI Applied ML

Machine Learning Engineer, GenAI Applied ML

Scale AI, Inc.San Francisco, CA, United States
Full-time
At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
AI Research Engineer

AI Research Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for an AI Research Engineer specializing in LLM orchestration and prompting.Key Responsibilities Build LLM-powered software by designing prompt flows and orchestrations for o...Show moreLast updated: 20 hours ago
  • Promoted
Machine Learning Engineer

Machine Learning Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Machine Learning Engineer specializing in Personalization for a remote position.Key Responsibilities Build and deploy robust ML systems / algorithms for personalized prod...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Data & AI Engineer

Data & AI Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Data / AI Engineer (Gen AI, LLM, ML).Key Responsibilities Design, build, and maintain robust data pipelines and workflows for healthcare data Develop, train, and deploy ...Show moreLast updated: 14 hours ago
  • Promoted
AI / ML Engineer

AI / ML Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for an AI / ML Engineer - Model Dev & Data Pipeline.Key Responsibilities Design and train custom neural networks and fine-tune large language models for specific tasks Build a...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Machine Learning Data Scientist

Machine Learning Data Scientist

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Machine Learning Data Scientist.Key Responsibilities Ensure data accuracy, completeness, and consistency across systems Analyze diverse data sources to identify trends...Show moreLast updated: 12 hours ago
  • Promoted
Analytics Engineer

Analytics Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for an Analytics Engineer II (REMOTE).Key Responsibilities Design and build data models and BI dashboards to solve business problems Collaborate with Agile teams to understa...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Engineer

Machine Learning Engineer

JobotSan Francisco, CA, US
Full-time
Entry Level ML Engineer Needed for Growing AI Startup!.This Jobot Job is hosted by : Reed Kellick.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.Salary : ...Show moreLast updated: 1 day ago
  • Promoted
Machine Learning Systems Engineer, RL Engineering

Machine Learning Systems Engineer, RL Engineering

Menlo VenturesSan Francisco, CA, United States
Full-time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 30+ days ago
  • Promoted
Senior AI Model Engineer

Senior AI Model Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Senior AI Research Engineer, Model Inference (100% Remote).Key Responsibilities Implement and optimize custom inference and fine-tuning kernels for language models acro...Show moreLast updated: 17 days ago
  • Promoted
ML Research Engineer, ML Systems

ML Research Engineer, ML Systems

Scale AI, Inc.San Francisco, CA, United States
Full-time
Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show moreLast updated: 30+ days ago
  • Promoted
Senior Machine Learning Engineer

Senior Machine Learning Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Senior Machine Learning Engineer to join their Data Science team.Key Responsibilities Design data architecture for real-time model endpoints and batch solutions Engine...Show moreLast updated: 30+ days ago
  • Promoted
Senior Deep Learning Engineer

Senior Deep Learning Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for a Senior Deep Learning Software Engineer - Autonomous Vehicles.Key Responsibilities Train, fine-tune, optimize, and customize perception DNNs in low precision (FP16 / INT8)...Show moreLast updated: 30+ days ago
  • Promoted
Generative Machine Learning Engineer

Generative Machine Learning Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Generative Machine Learning Engineer.Key Responsibilities Develop and implement Deep Learning Pipelines using cutting-edge models Sustain the infrastructure for Deep L...Show moreLast updated: 30+ days ago
  • Promoted
AI Applications and Data Science Engineer

AI Applications and Data Science Engineer

CXApp US, Inc.San Ramon, CA, US
Full-time
CXAPP is a forward-thinking technology company that leverages AI and data science to drive innovation and deliver cutting-edge solutions. We are seeking talented AI Applications and Data Science Eng...Show moreLast updated: 30+ days ago