AI Systems & Data Engineer

HyperFiSan Francisco, CA, United States

1 day ago

Job type

Full-time

Job description

We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connects systems, abstracts messy workflows, and leaves room for smart automation. The surface is clean and simple. The interactions are seamless and intuitive. The machinery underneath is anything but. That’s where you come in.

We’re a well-networked founding team with strong execution roots and a clear roadmap. We’re backed, focused, and delivering fast.

We are seeking an AI Systems & Data Engineer to join our team. We are building a fast, flexible, and complex platform with a robust, event-driven architecture. This role requires expertise in building data pipelines within the Databricks environment, specifically for ingesting unstructured data, and leveraging that data to build AI agents.

💥 What You’ll Do

Design and build data pipelines in Databricks for ingesting unstructured data.
Construct retrieval-augmented generation (RAG) systems from scratch using ingested data.
Build agentic LLM pipelines utilizing frameworks like LangChain, LangGraph, and LangSmith.
Own orchestration of PySpark and Databricks workflows to prepare inputs and track outputs for AI models.
Instrument evaluation metrics and telemetry to guide the evolution of prompt strategies.
Work alongside product, frontend, and backend engineers to tightly integrate AI into user-facing flows.
Leverage Databricks features such as Auto Loader for automatic detection of new files on cloud storage and schema changes.
Utilize Delta Lake for reliability, security, and performance on the data lake for streaming and batch operations.
Apply Databricks Workflows for orchestrating tasks to integrate data.
Implement Delta Live Tables for building reliable data pipelines with a declarative approach.
Python (primary language for all LLM + orchestration work)
LangChain + LangGraph + LangSmith
Databricks + PySpark for processing, labeling, and training context
Postgres, and custom orchestration via MCP
GitHub Actions, GCP

You’ll be a crucial member of rolling out products that will have immediate impact.

Engineers come first : your time, focus, and judgment are respected

Deep work >

chaos : fixed cycles & cooldowns protect focus and keep context switching low

Autonomy is the default : trusted builders who own outcomes, no babysitters

Ship daily, safely : merge early, integrate vertically, ship often, use feature flags, and keep momentum

Outcomes over optics : solve real problems, not ticket soup

Voice matters : from week one, contribute, improve something, and shape how we build

Senior peers, no ego : collaborate in a high-trust, async-friendly environment

Bold problems, cool tech : work on complex challenges that actually move the needle

Fun is part of it : we move fast, but we also celebrate wins and laugh together

✅ What We’re Looking For

5-7 years of experience building production-grade ML, data, or AI systems.

Strong grasp of prompt engineering, context construction, and retrieval design.

Comfortable working in LangChain and building agents.

Experience with PySpark and Databricks to handle real-world data scale.

Ability to write testable, maintainable Python with clear structure.

Understanding of model evaluation, observability, and feedback loops.

Familiarity with Databricks Data Intelligence Platform which unifies data warehousing and AI use cases on a single platform.

Knowledge of Unity Catalog for open and unified governance of data, analytics, and AI on the lakehouse.

Understanding of data security concerns related to AI and how to mitigate them using the Databricks AI Security Framework (DASF).

Confident English skills to collaborate clearly and effectively with teammates

Have built scalable agent-like workflows on the Databricks platform.

Have worked on semantic chunking, vector search, or hybrid retrieval strategies.

Can walk us through a real-world prompt failure and how you fixed it.

Have contributed to OSS tools or internal AI platforms.

Think of yourself as both an engineer and a systems designer.

Are familiar with the concept of a data lakehouse architecture.

Must be based in San Francisco, Las Vegas, or Tel Aviv

Full-time role with competitive comp

Flexible hours, async-friendly culture, engineering-led environment

#J-18808-Ljbffr

Create a job alert for this search

Ai Data Engineer • San Francisco, CA, United States

Related jobs

Promoted

Systems Engineer, Hardware Architecture

WaymoMountain View, CA, United States

Full-time

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show moreLast updated: 30+ days ago

Promoted

Senior Director, Data and AI Architecture Leader

Dynavax TechnologiesEmeryville, CA, United States

Full-time

This position can be 100% remote, but must be located in the United States.Dynavax is a commercial-stage biopharmaceutical company developing and commercializing novel vaccines to help protect the ...Show moreLast updated: 19 days ago

Promoted

Senior Applied AI Engineer – ML for Systems & Infrastructure

Databricks Inc.San Francisco, CA, United States

Full-time

Senior Applied AI Engineer – ML for Systems & Infrastructure.The Applied AI team at Databricks sits at the forefront of advancing GenAI-powered products. Over the past years, we’ve launched Databric...Show moreLast updated: 26 days ago

Promoted

Engineer – Conversational AI

Voiceplug Inc.San Francisco, CA, United States

Full-time

Overview : Exciting job role that involves designing and building the next generation of Conversational AI / NLP products. Build a long-term career in Conversational AI / NLP and Deep Learning.Opportunit...Show moreLast updated: 3 days ago

Promoted

Principal AI Engineer

SynopsysMountain View, CA, United States

Full-time

You are a passionate and driven individual with a degree in Computer Science, Computer Engineering, or Electrical Engineering. With a strong foundation in Artificial Intelligence algorithms and expe...Show moreLast updated: 30+ days ago

Promoted

System Engineer

SupermicroSan Jose, CA, United States

Full-time

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago

Promoted

AI Research Engineer, Lead

Menlo VenturesSan Francisco, CA, United States

Full-time

The Technical Lead will drive AI research in one or more of the following areas : structure prediction, protein design, and lead optimization. PhD in AI, Machine Learning, Bioinformatics, or related ...Show moreLast updated: 30+ days ago

Promoted

Machine Learning Data Engineer - Systems & Retrieval

ZyphraPalo Alto, CA, United States

Full-time

Machine Learning Data Engineer - Systems & Retrieval.Machine Learning Data Engineer - Systems & Retrieval.This includes designing high-performance pipelines for collecting, transforming, indexing, ...Show moreLast updated: 30+ days ago

Promoted

Systems Engineer, Behavioral Requirements

WaymoSan Francisco, CA, United States

Full-time

Promoted

Distinguished AI Engineer

Capital OneRichmond, CA, United States

Part-time

Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 8 years of experience developing AI and ML algorithms or technologies, or a ...Show moreLast updated: 24 days ago

Promoted

AI Engineer

CEF AISan Francisco, CA, United States

Full-time

Are you excited by the opportunity to become a part of a badass team leading the AI-first data revolution?.Do you thrive on solving hard problems (individually and collectively), yet humble and hun...Show moreLast updated: 30+ days ago

Promoted

AI / Generative AI Engineer

AndiamoSan Francisco, CA, United States

Permanent

Get AI-powered advice on this job and more exclusive features.Generative AI Engineer - Pre-IPO Tech Company.AI-powered applications at a high-growth, pre-IPO technology company.You will work on app...Show moreLast updated: 7 days ago

Promoted

Data Engineer, Analytics

OpenAISan Francisco, CA, United States

Full-time

The Applied team works across research, engineering, product, and design to bring OpenAI’s technology to consumers and businesses. We seek to learn from deployment and distribute the benefits of AI,...Show moreLast updated: 1 day ago

Promoted

Senior AI Engineer (Gen AI Platform Services, Agentic Systems)

Capital OneSan Francisco, CA, United States

Part-time

You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing.You want to work on problems that will help change banking for good.Passion for s...Show moreLast updated: 24 days ago

Promoted

Principal, DAA Programs (Data, Analytics, and AI)

SnowflakeMenlo Park, CA, US

Full-time

We're looking for a sharp, strategic operator to join our team as Principal, DAA Programsa critical role at the heart of Snowflake's data, analytics, and AI transformation.Reporting directly to the...Show moreLast updated: 30+ days ago

Promoted

AI Solutions Engineer

ButterflyMX, Inc.San Francisco, CA, United States

Full-time

ButterflyMX is on a mission to empower people to open and manage doors & gates from a smartphone.Our products are installed in more than 20,000+ multifamily, commercial, gated communities, and stud...Show moreLast updated: 21 days ago

Promoted

Machine Learning Data Engineer - Systems & Retrieval

Zyphra Technologies Inc.Palo Alto, CA, United States

Full-time

Machine Learning Data Engineer - Systems & Retrieval.This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw ...Show moreLast updated: 30+ days ago

Promoted

Fullstack Engineer, Intelligence Systems

OpenAISan Francisco, CA, United States

Full-time

As an Intelligence Systems Engineer, you’ll be focused on advancing our Intelligence & Investigations efforts at OpenAI, ensuring the safe and responsible use of AI across our products and services...Show moreLast updated: 30+ days ago