Talent.com
AI Systems & Data Engineer
AI Systems & Data EngineerHyperFi • San Francisco, CA, United States
AI Systems & Data Engineer

AI Systems & Data Engineer

HyperFi • San Francisco, CA, United States
11 days ago
Job type
  • Full-time
Job description

We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connects systems, abstracts messy workflows, and leaves room for smart automation. The surface is clean and simple. The interactions are seamless and intuitive. The machinery underneath is anything but. That’s where you come in.

We’re a well-networked founding team with strong execution roots and a clear roadmap. We’re backed, focused, and delivering fast.

We are seeking an AI Systems & Data Engineer to join our team. We are building a fast, flexible, and complex platform with a robust, event-driven architecture. This role requires expertise in building data pipelines within the Databricks environment, specifically for ingesting unstructured data, and leveraging that data to build AI agents.

💥 What You’ll Do

  • Design and build data pipelines in Databricks for ingesting unstructured data.
  • Construct retrieval-augmented generation (RAG) systems from scratch using ingested data.
  • Build agentic LLM pipelines utilizing frameworks like LangChain, LangGraph, and LangSmith.
  • Own orchestration of PySpark and Databricks workflows to prepare inputs and track outputs for AI models.
  • Instrument evaluation metrics and telemetry to guide the evolution of prompt strategies.
  • Work alongside product, frontend, and backend engineers to tightly integrate AI into user-facing flows.
  • Leverage Databricks features such as Auto Loader for automatic detection of new files on cloud storage and schema changes.
  • Utilize Delta Lake for reliability, security, and performance on the data lake for streaming and batch operations.
  • Apply Databricks Workflows for orchestrating tasks to integrate data.
  • Implement Delta Live Tables for building reliable data pipelines with a declarative approach.
  • Python (primary language for all LLM + orchestration work)
  • LangChain + LangGraph + LangSmith
  • Databricks + PySpark for processing, labeling, and training context
  • Postgres, and custom orchestration via MCP
  • GitHub Actions, GCP

You’ll be a crucial member of rolling out products that will have immediate impact.

  • Engineers come first : your time, focus, and judgment are respected
  • Deep work >
  • chaos : fixed cycles & cooldowns protect focus and keep context switching low

  • Autonomy is the default : trusted builders who own outcomes, no babysitters
  • Ship daily, safely : merge early, integrate vertically, ship often, use feature flags, and keep momentum
  • Outcomes over optics : solve real problems, not ticket soup
  • Voice matters : from week one, contribute, improve something, and shape how we build
  • Senior peers, no ego : collaborate in a high-trust, async-friendly environment
  • Bold problems, cool tech : work on complex challenges that actually move the needle
  • Fun is part of it : we move fast, but we also celebrate wins and laugh together
  • ✅ What We’re Looking For

  • 5-7 years of experience building production-grade ML, data, or AI systems.
  • Strong grasp of prompt engineering, context construction, and retrieval design.
  • Comfortable working in LangChain and building agents.
  • Experience with PySpark and Databricks to handle real-world data scale.
  • Ability to write testable, maintainable Python with clear structure.
  • Understanding of model evaluation, observability, and feedback loops.
  • Familiarity with Databricks Data Intelligence Platform which unifies data warehousing and AI use cases on a single platform.
  • Knowledge of Unity Catalog for open and unified governance of data, analytics, and AI on the lakehouse.
  • Understanding of data security concerns related to AI and how to mitigate them using the Databricks AI Security Framework (DASF).
  • Confident English skills to collaborate clearly and effectively with teammates
  • Have built scalable agent-like workflows on the Databricks platform.
  • Have worked on semantic chunking, vector search, or hybrid retrieval strategies.
  • Can walk us through a real-world prompt failure and how you fixed it.
  • Have contributed to OSS tools or internal AI platforms.
  • Think of yourself as both an engineer and a systems designer.
  • Are familiar with the concept of a data lakehouse architecture.
  • Must be based in San Francisco, Las Vegas, or Tel Aviv
  • Full-time role with competitive comp
  • Flexible hours, async-friendly culture, engineering-led environment
  • #J-18808-Ljbffr

    Create a job alert for this search

    Ai Data Engineer • San Francisco, CA, United States

    Related jobs
    Senior AI Engineer - Agentic Systems

    Senior AI Engineer - Agentic Systems

    Sweya Information Technologies LLP • San Francisco, CA, United States
    Full-time
    Build autonomous AI agents that form feedback-driven, self-improving systems for enterprise operations.We're excited to learn more about you and how you can contribute to our team.Strong background...Show more
    Last updated: 3 hours ago • Promoted • New!
    Lecturer - Department of Statistics - College of Computing, Data Science and Society

    Lecturer - Department of Statistics - College of Computing, Data Science and Society

    InsideHigherEd • Berkeley, California, United States
    Full-time
    Lecturer - Department of Statistics - College of Computing, Data Science and Society.The UC academic salary scales set the minimum pay at appointment. See the following table(s) for the current sala...Show more
    Last updated: 4 days ago • Promoted
    Senior Director, Data and AI Architecture Leader

    Senior Director, Data and AI Architecture Leader

    Dynavax Technologies • Emeryville, CA, United States
    Full-time
    This position can be 100% remote, but must be located in the United States.Dynavax is a commercial-stage biopharmaceutical company developing and commercializing novel vaccines to help protect the ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Distributed Systems Engineer - AI Data Platform

    Senior Distributed Systems Engineer - AI Data Platform

    Alluxio, Inc. • Foster City, CA, United States
    Full-time
    A data orchestration company in California is seeking a Senior Software Engineer to advance their data layer for modern AI and analytics. You will work on optimizing distributed systems and enhancin...Show more
    Last updated: 4 days ago • Promoted
    Senior Backend AI & Data Systems Engineer

    Senior Backend AI & Data Systems Engineer

    salesforce.com, inc. • San Francisco, CA, United States
    Full-time
    A leading tech company in San Francisco is seeking experienced professionals to design and build data processing and machine learning systems. The ideal candidate will have over 10 years in the fiel...Show more
    Last updated: 9 hours ago • Promoted • New!
    Gen AI Engineer : Build Scalable Data Systems

    Gen AI Engineer : Build Scalable Data Systems

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    A leading AI data company is looking for a Software Engineer to design and implement features across various technologies. This full-time role is based in either San Francisco or New York City, requ...Show more
    Last updated: 3 hours ago • Promoted • New!
    Senior GenAI Systems Engineer — Build Scalable AI Agents

    Senior GenAI Systems Engineer — Build Scalable AI Agents

    Salesforce • San Francisco, CA, United States
    Full-time
    A leading cloud-based software company in San Francisco is seeking a Software Engineer specializing in AI systems.You will be responsible for developing scalable generative AI systems, collaboratin...Show more
    Last updated: 5 days ago • Promoted
    Senior AI Systems Engineer, AI Tools & Platform

    Senior AI Systems Engineer, AI Tools & Platform

    Cloudflare, Inc. • San Francisco, CA, United States
    Full-time
    A leading technology company in San Francisco is seeking a Senior AI Engineer to drive and develop innovative AI solutions. You will lead projects focusing on LLM-powered tools and engage with cross...Show more
    Last updated: 9 hours ago • Promoted • New!
    Senior AI Engineer (Gen AI Platform Services, Agentic Systems)

    Senior AI Engineer (Gen AI Platform Services, Agentic Systems)

    Capital One • San Francisco, CA, United States
    Full-time +1
    Senior AI Engineer (Gen AI Platform Services, Agentic Systems).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an indu...Show more
    Last updated: 30+ days ago • Promoted
    Platform Engineer : Data Infra for AI Systems | Hybrid + Equity

    Platform Engineer : Data Infra for AI Systems | Hybrid + Equity

    bem • San Francisco, CA, United States
    Full-time
    A forward-thinking tech company in San Francisco is seeking a Platform Engineer to architect data infrastructures and develop multi-cloud solutions for AI systems. This role combines deep data knowl...Show more
    Last updated: 5 days ago • Promoted
    AI Developer / AI Engineer

    AI Developer / AI Engineer

    Peppr AI (YC W25) • San Francisco, CA, United States
    Full-time
    Get AI-powered advice on this job and more exclusive features.This range is provided by Peppr AI (YC W25).Your actual pay will be based on your skills and experience — talk with your recruiter to l...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer — AI-Driven Research Systems & Cloud Infra

    Software Engineer — AI-Driven Research Systems & Cloud Infra

    Intology • San Francisco, CA, United States
    Full-time
    A tech-focused research firm in San Francisco is seeking talented individuals to help build and deploy end-to-end automated research systems. Collaborate with a core R&D team and contribute to produ...Show more
    Last updated: 7 days ago • Promoted
    Senior Distributed Systems Engineer – AI Cloud & Kernel

    Senior Distributed Systems Engineer – AI Cloud & Kernel

    E2b • San Francisco, CA, United States
    Full-time
    A tech startup in San Francisco seeks an experienced infrastructure engineer to build a cloud platform for AI software.The role involves developing a distributed system and an orchestrator for effi...Show more
    Last updated: 3 hours ago • Promoted • New!
    Hybrid Gen AI Software Engineer — Scalable Data Systems

    Hybrid Gen AI Software Engineer — Scalable Data Systems

    Scale AI • San Francisco, CA, United States
    Full-time
    A leading AI data company in New York is seeking a Software Engineer to design and implement features across various systems. This hybrid role requires over 3 years of experience and a strong track ...Show more
    Last updated: 6 days ago • Promoted
    HPC / AI Data Performance Engineer

    HPC / AI Data Performance Engineer

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    Full-time +1
    In this exciting role, you will serve as a Data Performance Engineer in NERSC's Application Performance Group, architecting HPC and AI data services that advance fundamental science.You'll optimize...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    PG Forsta • Emeryville, CA, United States
    Full-time
    PG Forsta is the leading experience measurement, data analytics, and insights provider for complex industries-a status we earned over decades of deep partnership with clients to help them understan...Show more
    Last updated: 28 days ago • Promoted
    Architect for Data and AI Engines

    Architect for Data and AI Engines

    Oracle • Redwood City, CA, United States
    Full-time
    Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.AI and Data Management Architect. AI & Data Platform Engineering.Are you ready to shape the...Show more
    Last updated: 4 days ago • Promoted
    Distributed Systems Engineer - Data Platform - Analytics and Alerts

    Distributed Systems Engineer - Data Platform - Analytics and Alerts

    Cloudflare Inc • San Francisco, CA, United States
    Full-time
    At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...Show more
    Last updated: 18 days ago • Promoted