Talent.com
AI Systems & Data Engineer

AI Systems & Data Engineer

HyperFiSan Francisco, CA, United States
15 hours ago
Job type
  • Full-time
Job description

We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connects systems, abstracts messy workflows, and leaves room for smart automation. The surface is clean and simple. The interactions are seamless and intuitive. The machinery underneath is anything but. Thats where you come in.

Were a well-networked founding team with strong execution roots and a clear roadmap. Were backed, focused, and delivering fast.

We are seeking an AI Systems & Data Engineer to join our team. We are building a fast, flexible, and complex platform with a robust, event-driven architecture. This role requires expertise in building data pipelines within the Databricks environment, specifically for ingesting unstructured data, and leveraging that data to build AI agents.

  • ???? What Youll Do
  • Design and build data pipelines in Databricks for ingesting unstructured data.
  • Construct retrieval-augmented generation (RAG) systems from scratch using ingested data.
  • Build agentic LLM pipelines utilizing frameworks like LangChain, LangGraph, and LangSmith.
  • Own orchestration of PySpark and Databricks workflows to prepare inputs and track outputs for AI models.
  • Instrument evaluation metrics and telemetry to guide the evolution of prompt strategies.
  • Work alongside product, frontend, and backend engineers to tightly integrate AI into user-facing flows.
  • Leverage Databricks features such as Auto Loader for automatic detection of new files on cloud storage and schema changes.
  • Utilize Delta Lake for reliability, security, and performance on the data lake for streaming and batch operations.
  • Apply Databricks Workflows for orchestrating tasks to integrate data.
  • Implement Delta Live Tables for building reliable data pipelines with a declarative approach.
  • Python (primary language for all LLM + orchestration work)
  • LangChain + LangGraph + LangSmith
  • Databricks + PySpark for processing, labeling, and training context
  • Postgres, and custom orchestration via MCP
  • GitHub Actions, GCP

Youll be a crucial member of rolling out products that will have immediate impact.

  • Engineers come first : your time, focus, and judgment are respected
  • Deep work >
  • chaos : fixed cycles & cooldowns protect focus and keep context switching low

  • Autonomy is the default : trusted builders who own outcomes, no babysitters
  • Ship daily, safely : merge early, integrate vertically, ship often, use feature flags, and keep momentum
  • Outcomes over optics : solve real problems, not ticket soup
  • Voice matters : from week one, contribute, improve something, and shape how we build
  • Senior peers, no ego : collaborate in a high-trust, async-friendly environment
  • Bold problems, cool tech : work on complex challenges that actually move the needle
  • Fun is part of it : we move fast, but we also celebrate wins and laugh together
  • ? What Were Looking For
  • 5-7 years of experience building production-grade ML, data, or AI systems.
  • Strong grasp of prompt engineering, context construction, and retrieval design.
  • Comfortable working in LangChain and building agents.
  • Experience with PySpark and Databricks to handle real-world data scale.
  • Ability to write testable, maintainable Python with clear structure.
  • Understanding of model evaluation, observability, and feedback loops.
  • Familiarity with Databricks Data Intelligence Platform which unifies data warehousing and AI use cases on a single platform.
  • Knowledge of Unity Catalog for open and unified governance of data, analytics, and AI on the lakehouse.
  • Understanding of data security concerns related to AI and how to mitigate them using the Databricks AI Security Framework (DASF).
  • Confident English skills to collaborate clearly and effectively with teammates
  • Have built scalable agent-like workflows on the Databricks platform.
  • Have worked on semantic chunking, vector search, or hybrid retrieval strategies.
  • Can walk us through a real-world prompt failure and how you fixed it.
  • Have contributed to OSS tools or internal AI platforms.
  • Think of yourself as both an engineer and a systems designer.
  • Are familiar with the concept of a data lakehouse architecture.
  • Must be based in San Francisco, Las Vegas, or Tel Aviv
  • Full-time role with competitive comp
  • Flexible hours, async-friendly culture, engineering-led environment
  • #J-18808-Ljbffr

    Create a job alert for this search

    Ai Data Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    Senior Director, Data and AI Architecture Leader

    Senior Director, Data and AI Architecture Leader

    Dynavax TechnologiesEmeryville, CA, United States
    Full-time
    This position can be 100% remote, but must be located in the United States.Dynavax is a commercial-stage biopharmaceutical company developing and commercializing novel vaccines to help protect the ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior / Principal Systems Engineer, Workers AI (AI / ML)

    Senior / Principal Systems Engineer, Workers AI (AI / ML)

    Cloudflare, Inc.San Francisco, California, United States
    Full-time
    About Us At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties...Show moreLast updated: 18 hours ago
    • Promoted
    • New!
    AI Engineer - Relational Foundation Models & Agentic Systems

    AI Engineer - Relational Foundation Models & Agentic Systems

    KumoMountain View, CA, United States
    Full-time
    Kumo Relational Foundation Model (RFM).This is your chance to be part of that momentum.This is an engineering role at its core, but you'll also. We're looking for someone who thrives in that hybrid ...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Senior Technical Systems AI Architect - Agentic AI

    Senior Technical Systems AI Architect - Agentic AI

    NVIDIASanta Clara, CA, United States
    Full-time
    Join the AI & Automation Supply Chain Business Transformations team in NVIDIA Operations to architect the future of agentic AI at scale. We Implement AI agents to transform employee productivity and...Show moreLast updated: 15 hours ago
    • Promoted
    • New!
    Senior Data Visualization Engineer - AI Systems Performance

    Senior Data Visualization Engineer - AI Systems Performance

    NVIDIASanta Clara, CA, United States
    Full-time
    NVIDIA is at the forefront of AI innovation, powering breakthroughs from advanced research applications to real-time generative AI systems. The DGX Cloud AI Efficiency team develops and delivers the...Show moreLast updated: 16 hours ago
    • Promoted
    Systems Design Engineer, AI Client and Gaming Devices

    Systems Design Engineer, AI Client and Gaming Devices

    Advanced Micro Devices, Inc.San Jose, CA, United States
    Full-time
    WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded syst...Show moreLast updated: 8 days ago
    • Promoted
    AI Data Engineer - Manager

    AI Data Engineer - Manager

    PwCSan Francisco, CA, United States
    Full-time
    At PwC, our people in business application consulting specialise in consulting services for a variety of business applications, helping clients optimise operational efficiency.These individuals ana...Show moreLast updated: 30+ days ago
    • Promoted
    Systems Engineer - Analytics

    Systems Engineer - Analytics

    Stanford Medicine Children's HealthPalo Alto, CA, United States
    Full-time
    At Lucile Packard Children's Hospital Stanford, we know world-renowned care begins with world-class caring.That's why we combine advanced technologies and breakthrough discoveries with family-cente...Show moreLast updated: 6 days ago
    • Promoted
    Senior AI Engineer (Gen AI Platform Services, Agentic Systems)

    Senior AI Engineer (Gen AI Platform Services, Agentic Systems)

    Capital OneSan Francisco, CA, United States
    Full-time +1
    Senior AI Engineer (Gen AI Platform Services, Agentic Systems).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an indu...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Engineer - Generative AI Solutions

    Principal Engineer - Generative AI Solutions

    Wells FargoSan Francisco, CA, United States
    Full-time
    The COO Technology group provides technology services for the Chief Operating Office.This includes operations, control executives, strategic execution, business continuity and resiliency, data solu...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Data Platform Engineer Lead (AI)

    Data Platform Engineer Lead (AI)

    Evolutionary ScaleSan Francisco, CA, United States
    Full-time
    EvolutionaryScales mission is to develop artificial intelligence to understand biology for the benefit of human health and society, through open, safe, and responsible research, and in partnership ...Show moreLast updated: 16 hours ago
    • Promoted
    AI Data Engineer - Senior Manager

    AI Data Engineer - Senior Manager

    PwCSan Francisco, CA, United States
    Full-time
    At PwC, our people in business application consulting specialise in consulting services for a variety of business applications, helping clients optimise operational efficiency.These individuals ana...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Lead Data Engineer (Generative AI)

    Lead Data Engineer (Generative AI)

    Capital OneSan Francisco, CA, United States
    Full-time +1
    Lead Data Engineer (Generative AI).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterati...Show moreLast updated: 16 hours ago
    • Promoted
    Machine Learning Data Engineer - Systems & Retrieval

    Machine Learning Data Engineer - Systems & Retrieval

    ZyphraPalo Alto, CA, United States
    Full-time
    Machine Learning Data Engineer - Systems & Retrieval.This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw ...Show moreLast updated: 12 days ago
    • Promoted
    • New!
    Distributed Systems Engineer - Data Platform - Analytics and Alerts

    Distributed Systems Engineer - Data Platform - Analytics and Alerts

    Cloudflare IncSan Francisco, CA, United States
    Full-time
    At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...Show moreLast updated: 16 hours ago
    • Promoted
    Software Engineer, Distributed Data Systems

    Software Engineer, Distributed Data Systems

    OpenAISan Francisco, CA, United States
    Full-time
    The Sora team is pioneering multimodal capabilities for OpenAI's foundation models.We're a hybrid research and product team focused on integrating multimodal functionalities into our AI products, e...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Applied AI Engineer - ML for Systems & Infrastructure

    Senior Applied AI Engineer - ML for Systems & Infrastructure

    DatabricksSan Francisco, CA, United States
    Full-time
    As a Senior Applied AI Engineer at Databricks, you will apply machine learning, scheduling and optimization algorithms to improve the efficiency and performance of our engineering systems and infra...Show moreLast updated: 30+ days ago
    • Promoted
    Applied AI Engineer

    Applied AI Engineer

    Wordware IncSan Francisco, CA, United States
    Full-time
    Wordware - from architecture to evals to deployment.We care about what works in production : fast response times, predictable behavior, traceability, and uptime. You'll work across the stack - with i...Show moreLast updated: 30+ days ago