Talent.com
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

Zyphra Technologies Inc.Palo Alto, CA, United States
16 hours ago
Job type
  • Full-time
Job description

Zyphra is an artificial intelligence company based in Palo Alto, California.

The Role :

As a Machine Learning Data Engineer - Systems & Retrieval , you will build and optimize the data infrastructure that fuels our machine learning systems. This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw web-scale data to enterprise document corpora. You’ll play a central role in architecting retrieval systems for LLMs and enabling scalable training and inference with clean, accessible, and secure data. You’ll have an impact across both research and product teams by shaping the foundation upon which intelligent systems are trained, retrieved, and reasoned over.

You’ll work across :

Design and implementation of distributed data ingestion and transformation pipelines

Building retrieval and indexing systems that support RAG and other LLM-based methods

Mining and organizing large unstructured datasets, both in research and production environments

Collaborating with ML engineers, systems engineers, and DevOps to scale pipelines and observability

Ensuring compliance and access control in data handling, with security and auditability in mind

Requirements :

Strong software engineering background with fluency in Python

Experience designing, building, and maintaining data pipelines in production environments

Deep understanding of data structures, storage formats, and distributed data systems

Familiarity with indexing and retrieval techniques for large-scale document corpora

Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics

Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)

Excellent debugging, observability, and logging practices to support reliability at scale

Strong communication skills and experience collaborating across ML, infra, and product teams

Bonus Skill Set :

Experience building or maintaining LLM-integrated retrieval systems (e.g, RAG pipelines)

Academic or industry background in data mining, search, recommendation systems, or IR literature

Experience with large-scale ETL systems and tools like Apache Beam, Spark, or similar

Familiarity with vector databases (e.g., FAISS, Weaviate, Pinecone) and embedding-based retrieval

Understanding of data validation and quality assurance in machine learning workflows

Experience working on cross-functional infra and MLOps teams

Knowledge of how data infrastructure supports training pipelines, inference serving, and feedback loops

Comfort working across raw, unstructured data, structured databases, and model-ready formats

Why Work at Zyphra :

Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued

We strongly value new and crazy ideas and are very willing to bet big on new ideas

We move as quickly as we can; we aim to minimize the bar to impact as low as possible

We all enjoy what we do and love discussing AI

Benefits and Perks :

Comprehensive medical, dental, vision, and FSA plans

Competitive compensation and 401(k)

Relocation and immigration support on a case-by-case basis

On-site meals prepared by a dedicated culinary team; Thursday Happy Hours

In-person team in Palo Alto, CA, with a collaborative, high-energy environment

If you're excited by the challenge of high-scale, high-performance data engineering in the context of cutting-edge AI, you’ll thrive in this role. Apply Today!

#J-18808-Ljbffr

Create a job alert for this search

Machine Learning Engineer • Palo Alto, CA, United States

Related jobs
  • Promoted
  • New!
Machine Learning Systems Engineer, RL Engineering

Machine Learning Systems Engineer, RL Engineering

AnthropicSan Francisco, CA, United States
Full-time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 16 hours ago
  • Promoted
  • New!
Machine Learning Engineer – Ads Signals Intelligence & Information Retrieval

Machine Learning Engineer – Ads Signals Intelligence & Information Retrieval

Apple Inc.Cupertino, CA, United States
Full-time
Cupertino, California, United States.Design, implement, and scale ML systems that extract high-value semantic signals from structured and unstructured content. Contribute to retrieval and ranking pi...Show moreLast updated: 16 hours ago
  • Promoted
Machine Learning Engineer

Machine Learning Engineer

Conviva Inc.Foster City, CA, United States
Full-time
Conviva is the first and best place to understand and optimize digital customer experiences.Our Operational Data Platform harnesses full-census, comprehensive client-side telemetry—capturing every ...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Systems Engineer, RL Engineering

Machine Learning Systems Engineer, RL Engineering

Menlo VenturesSan Francisco, CA, United States
Full-time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 30+ days ago
  • Promoted
Machine Learning Systems Engineer, Encodings and Tokenization

Machine Learning Systems Engineer, Encodings and Tokenization

AnthropicSan Francisco, CA, United States
Full-time
Machine Learning Systems Engineer, Research Tools.Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for socie...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Machine Learning Engineer

Machine Learning Engineer

Crossing HurdlesSan Francisco, CA, United States
Full-time
Crossing Hurdles is a recruitment firm that assists our clients to hire best talent.It is a Cognitive AI platform that automates low-level cognitive tasks involving clinical data and knowledge - ta...Show moreLast updated: 16 hours ago
  • Promoted
  • New!
Machine Learning Research Engineer

Machine Learning Research Engineer

ProphecySan Francisco, CA, United States
Full-time
Prophecy is the world’s most advanced data integration platform, designed to make complex data work simple and powerful.Designed natively for modern cloud data platforms, Prophecy uniquely serves t...Show moreLast updated: 16 hours ago
  • Promoted
  • New!
Machine Learning Data Engineer - Systems & Retrieval

Machine Learning Data Engineer - Systems & Retrieval

ZipRecruiterPalo Alto, CA, United States
Full-time
Machine Learning Data Engineer - Systems & Retrieval.This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw ...Show moreLast updated: 16 hours ago
  • Promoted
Machine Learning Engineer

Machine Learning Engineer

Metric BioHayward, CA, US
Full-time
Metric Bio is recruiting on behalf of a San Francisco–based digital health company that is building an AI-powered platform to transform patient care and healthcare delivery.ML techniques to s...Show moreLast updated: 4 days ago
  • Promoted
  • New!
Research Engineer - Machine Learning & Systems

Research Engineer - Machine Learning & Systems

World LabsSan Francisco, CA, United States
Full-time
We are looking for a versatile Research Engineer with a strong background in machine learning or 3D, software development, and systems design. This role is ideal for someone excited about bridging c...Show moreLast updated: 16 hours ago
  • Promoted
  • New!
Machine Learning Systems Engineer

Machine Learning Systems Engineer

SubstackSan Francisco, CA, United States
Full-time
Substack is building a new economic engine for culture, giving the brightest, most interesting, and most creative people on the internet the power of their own publishing platform.The terms of our ...Show moreLast updated: 16 hours ago
  • Promoted
  • New!
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

Ipro Networks Pte. Ltd.San Francisco, CA, United States
Full-time
Job Title : Machine Learning Engineer, Training Infrastructure | Position Type : Full time | Location : San Francisco, CA, USA | Salary Range : $150,000 - $250,000 (USD) | Job ID# : 158135.Design, imple...Show moreLast updated: 16 hours ago
  • Promoted
  • New!
Machine Learning Engineer, Synthetic Data

Machine Learning Engineer, Synthetic Data

AuroraMountain View, CA, United States
Full-time
At Aurora, you will tackle massively complex problems alongside other passionate, intelligent individuals, growing as an expert while expanding your knowledge. For the latest news from Aurora, visit...Show moreLast updated: 16 hours ago
  • Promoted
Machine Learning Systems Engineer, Safeguards Research

Machine Learning Systems Engineer, Safeguards Research

AnthropicSan Francisco, CA, United States
Full-time
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Machine Learning Engineer, Synthetic Data

Machine Learning Engineer, Synthetic Data

Australian Competition and Consumer CommissionMountain View, CA, United States
Full-time
Software Autonomy Sensing Mountain View, California.Aurora’s mission is to deliver the benefits of self-driving technology safely, quickly, and broadly. The Aurora Driver will create a new era in mo...Show moreLast updated: 16 hours ago
  • Promoted
  • New!
Machine Learning Systems Platform Engineer

Machine Learning Systems Platform Engineer

Blue SignalSan Francisco, CA, United States
Full-time
Confidential Opening : Machine Learning Systems Platform Engineer.San Francisco, CA (Hybrid Preferred).A stealth-mode innovator at the forefront of AI infrastructure is seeking a dynamic Machine Lea...Show moreLast updated: 16 hours ago
  • Promoted
Machine Learning Systems Engineer, Research Tools

Machine Learning Systems Engineer, Research Tools

AnthropicSan Francisco, CA, United States
Full-time
Machine Learning Systems Engineer, Research Tools.Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for socie...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Machine Learning Engineer, Training Infrastructure

Machine Learning Engineer, Training Infrastructure

ZipRecruiterSan Francisco, CA, United States
Full-time
Machine Learning Engineer, Training Infrastructure.We are looking for an ML Engineer with 3+ years of experience in high-performance computing systems to manage and optimize our computational infra...Show moreLast updated: 16 hours ago