Talent.com
AI Systems & Data Engineer
AI Systems & Data EngineerHyperFi • San Francisco, California, United States, 94102
AI Systems & Data Engineer

AI Systems & Data Engineer

HyperFi • San Francisco, California, United States, 94102
30+ days ago
Job type
  • Full-time
Job description

About HyperFi

We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connects systems, abstracts messy workflows, and leaves room for smart automation. The surface is clean and simple. The interactions are seamless and intuitive. The machinery underneath is anything but. That's where you come in.

We're a well-networked founding team with strong execution roots and a clear roadmap. We're backed, focused, and delivering fast.

We are seeking an AI Systems & Data Engineer to join our team. We are building a fast, flexible, and complex platform with a robust, event-driven architecture. This role requires expertise in building data pipelines within the Databricks environment, specifically for ingesting unstructured data, and leveraging that data to build AI agents.

What You'll Do

  • Design and build data pipelines in Databricks for ingesting unstructured data.
  • Construct retrieval-augmented generation (RAG) systems from scratch using ingested data.
  • Build agentic LLM pipelines utilizing frameworks like LangChain, LangGraph, and LangSmith.
  • Own orchestration of PySpark and Databricks workflows to prepare inputs and track outputs for AI models.
  • Instrument evaluation metrics and telemetry to guide the evolution of prompt strategies.
  • Work alongside product, frontend, and backend engineers to tightly integrate AI into user-facing flows.
  • Leverage Databricks features such as Auto Loader for automatic detection of new files on cloud storage and schema changes.
  • Utilize Delta Lake for reliability, security, and performance on the data lake for streaming and batch operations.
  • Apply Databricks Workflows for orchestrating tasks to integrate data.
  • Implement Delta Live Tables for building reliable data pipelines with a declarative approach.

Tech Stack (So Far)

  • Python (primary language for all LLM + orchestration work)
  • LangChain + LangGraph + LangSmith
  • Databricks + PySpark for processing, labeling, and training context
  • Gemini + model routing logic
  • Postgres, and custom orchestration via MCP
  • GitHub Actions, GCP
  • You'll be a crucial member of rolling out products that will have immediate impact.

    How We Build

  • Engineers come first : your time, focus, and judgment are respected
  • Deep work >
  • chaos : fixed cycles & cooldowns protect focus and keep context switching low

  • Autonomy is the default : trusted builders who own outcomes, no babysitters
  • Ship daily, safely : merge early, integrate vertically, ship often, use feature flags, and keep momentum
  • Outcomes over optics : solve real problems, not ticket soup
  • Voice matters : from week one, contribute, improve something, and shape how we build
  • Senior peers, no ego : collaborate in a high-trust, async-friendly environment
  • Bold problems, cool tech : work on complex challenges that actually move the needle
  • Fun is part of it : we move fast, but we also celebrate wins and laugh together
  • What We're Looking For

  • 5-7 years of experience building production-grade ML, data, or AI systems.
  • Strong grasp of prompt engineering, context construction, and retrieval design.
  • Comfortable working in LangChain and building agents.
  • Experience with PySpark and Databricks to handle real-world data scale.
  • Ability to write testable, maintainable Python with clear structure.
  • Understanding of model evaluation, observability, and feedback loops.
  • Excited to push from prototype production iteration.
  • Familiarity with Databricks Data Intelligence Platform which unifies data warehousing and AI use cases on a single platform.
  • Knowledge of Unity Catalog for open and unified governance of data, analytics, and AI on the lakehouse.
  • Understanding of data security concerns related to AI and how to mitigate them using the Databricks AI Security Framework (DASF).
  • Confident English skills to collaborate clearly and effectively with teammates
  • Bonus If You :

  • Have built scalable agent-like workflows on the Databricks platform.
  • Have worked on semantic chunking, vector search, or hybrid retrieval strategies.
  • Can walk us through a real-world prompt failure and how you fixed it.
  • Have contributed to OSS tools or internal AI platforms.
  • Think of yourself as both an engineer and a systems designer.
  • Are familiar with the concept of a data lakehouse architecture.
  • Location & Compensation

  • Must be based in San Francisco, Las Vegas, or Tel Aviv
  • Full-time role with competitive comp
  • Flexible hours, async-friendly culture, engineering-led environment
  • PI0c7bd322b09c-30511-38844592

    Create a job alert for this search

    Ai Data Engineer • San Francisco, California, United States, 94102

    Related jobs
    Artificial Intelligence Engineer

    Artificial Intelligence Engineer

    Harnham • Oakland, CA, US
    Full-time
    An elite AI engineering team within a global organization is hiring a.These systems will power internal automation and cost-efficiency efforts, touching millions of transactions and users across a ...Show more
    Last updated: 19 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Harnham • Fremont, CA, US
    Full-time
    STAFF AI / ML ENGINEER – HEALTHTECH.HYBRID – SUNNYVALE, CA (MON–THU ONSITE, FRI WFH).Are you passionate about applying AI to improve real-world health outcomes and quality of life f...Show more
    Last updated: 20 hours ago • Promoted • New!
    AI / ML Software Engineer

    AI / ML Software Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for an AI / ML Software Engineer.Key Responsibilities Rapidly prototype product features to explore new AI capabilities while writing clean, understandable code Deliver genera...Show more
    Last updated: 30+ days ago • Promoted
    AI / ML Architect

    AI / ML Architect

    Cooley LLP • San Francisco, CA, United States
    Full-time
    Cooley is seeking an AI / ML Architect to join the Practice Engineering team within the Innovation department.As a leading technology law firm, Cooley is determined to become a leader in the digital ...Show more
    Last updated: 20 days ago • Promoted
    Software / API Engineer

    Software / API Engineer

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    Full-time
    The National Energy Research Scientific Computing Center (NERSC) is seeking a versatile Software / API Engineer to join our team building software systems that integrate scientific workflows and su...Show more
    Last updated: 30+ days ago • Promoted
    AI Solutions Architect

    AI Solutions Architect

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for an AI Solutions Architect to design, implement, and optimize intelligent solutions within the Microsoft ecosystem. Key Responsibilities Design end-to-end AI and microservi...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Enterprise AI

    Software Engineer, Enterprise AI

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...Show more
    Last updated: 30+ days ago • Promoted
    Lecturer - Data Science Undergraduate Studies - College of Computing, Data Science, and Society

    Lecturer - Data Science Undergraduate Studies - College of Computing, Data Science, and Society

    University of California-Berkeley • Berkeley, CA, United States
    Full-time
    The UC academic salary scales set the minimum pay at appointment.See the following table for the current salary scale for this position : https : / / www. The current full-time salary range for this posi...Show more
    Last updated: 30+ days ago • Promoted
    Registry Data Systems Analyst- Remote - 136400

    Registry Data Systems Analyst- Remote - 136400

    UC San Diego Health • Richmond, CA, United States
    Remote
    Full-time
    This position is limited to California Residents and may require travel to Richmond and / or Sacramento, California.UCSD Layoff from Career Appointment. Apply by 8 / 27 / 2025 for consideration with prefe...Show more
    Last updated: 8 hours ago • Promoted • New!
    AI Engineer

    AI Engineer

    VirtualVocations • Santa Clara, California, United States
    Full-time
    A company is looking for an AI Engineer.Key Responsibilities Collaborate with data scientists and ML engineers to containerize, deploy, and monitor AI / ML models Design, build, and manage cloud i...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Data Engineer

    Machine Learning Data Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Machine Learning Data Engineer, Research - 3D Data Focus.Key Responsibilities Ingest, clean, normalize, and preprocess data for machine learning model training pipeline...Show more
    Last updated: 4 days ago • Promoted
    HPC / AI Data Performance Engineer

    HPC / AI Data Performance Engineer

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    Full-time +1
    In this exciting role, you will serve as a Data Performance Engineer in NERSC's Application Performance Group, architecting HPC and AI data services that advance fundamental science.You'll optimize...Show more
    Last updated: 17 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    PG Forsta • Emeryville, CA, United States
    Full-time
    PG Forsta is the leading experience measurement, data analytics, and insights provider for complex industries-a status we earned over decades of deep partnership with clients to help them understan...Show more
    Last updated: 4 days ago • Promoted
    Data Scientist

    Data Scientist

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    Full-time
    Lawrence Berkeley National Lab's (.Energy Geosciences Division has an opening for a Data Scientist to join the team.In this role, you will be primarily responsible for the development, implementati...Show more
    Last updated: 21 days ago • Promoted
    Applied AI Engineer Consultant

    Applied AI Engineer Consultant

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for an Applied AI Engineer Consultant.Key Responsibilities Build production-ready Intelligent Automations and Agentic AI solutions for enterprise clients Guide clients in de...Show more
    Last updated: 30+ days ago • Promoted
    AI Research Lead

    AI Research Lead

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for an AI Research Lead - Pre-training to own and scale pre-training efforts in a remote setting.Key Responsibilities Lead strategy and execution of pre-training activities, ...Show more
    Last updated: 15 days ago • Promoted
    Data Engineer

    Data Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Data Engineer, either based in Massachusetts or remote.Key Responsibilities Confer with business partners on data and reporting requests Work on complex projects with ...Show more
    Last updated: 30+ days ago • Promoted
    AI Developer Relations

    AI Developer Relations

    Signify Technology • Hayward, CA, US
    Full-time
    AI company pioneering high-quality data and research workflows for next-generation foundation models.Their platform empowers leading AI labs and Fortune 250 enterprises to build more accurate, reli...Show more
    Last updated: 20 hours ago • Promoted • New!