AI Infrastructure Engineer - PlayerZero

HireOTSSan Francisco, CA, United States

30+ days ago

Job type

Full-time

Job description

A stealth-stage AI infrastructure company is building a self-healing system for software that automates defect resolution and development. The platform is used by engineering and support teams to :

Autonomously debug problems in production software
Fix issues directly in the codebase
Prevent recurring issues through intelligent root-cause automation

The company is backed by top-tier investors such as Foundation Capital, WndrCo, and Green Bay Ventures , as well as prominent operators including Matei Zaharia, Drew Houston, Dylan Field, Guillermo Rauch , and others.

We believe that as software development accelerates, the burden of maintaining quality and reliability shifts heavily onto engineering and support teams. This challenge creates a rare opportunity to reimagine how software is supported and sustained -with AI-powered systems that respond autonomously.

About the Role

We're looking for an experienced backend / infrastructure engineer who thrives at the intersection of systems and AI - and who loves turning research prototypes into rock-solid production services. You'll design and scale the core backend that powers our AI inference stack - from ingestion pipelines and feature stores to GPU orchestration and vector search.

If you care deeply about performance, correctness, observability, and fast iteration , you'll fit right in.

What You'll Do

Own mission-critical services end-to-end - from architecture and design reviews to deployment, observability, and service-level objectives.

Scale LLM-driven systems : build RAG pipelines, vector indexes, and evaluation frameworks handling billions of events per day.

Design data-heavy backends : streaming ETL, columnar storage, time-series analytics - all fueling the self-healing loop.

Optimize for cost and latency across compute types (CPUs, GPUs, serverless); profile hot paths and squeeze out milliseconds.

Drive reliability : implement automated testing, chaos engineering, and progressive rollout strategies for new models.

Work cross-functionally with ML researchers, product engineers, and real customers to build infrastructure that actually matters.

You Might Thrive in This Role If You :

Have 2-5+ years of experience building scalable backend or infra systems in production environments

Bring a builder mindset - you like owning projects end-to-end and thinking deeply about data, scale, and maintainability

Have transitioned ML or data-heavy prototypes to production , balancing speed and robustness

Are comfortable with data engineering workflows : parsing, transforming, indexing, and querying structured or unstructured data

Have some exposure to search infrastructure or LLM-backed systems (e.g., document retrieval, RAG, semantic search)

Bonus Points

Experience with vector databases (e.g., pgvector, Pinecone, Weaviate) or inverted-index search (e.g., Elasticsearch, Lucene)

Hands-on with GPU orchestration (Kubernetes, Ray, KServe) or model-parallel inference tuning

Familiarity with Go / Rust (primary stack), with some TypeScript for light full-stack tasks

Deep knowledge of observability tooling (OpenTelemetry, Grafana, Datadog) and profiling distributed systems

Contributions to open-source ML or systems infrastructure projects

Let me know if you'd like a version optimized for careers pages, job boards, or stealth pitch decks.

Create a job alert for this search

Infrastructure Engineer • San Francisco, CA, United States

Related jobs

Promoted

AI Infra Engineer

Pantera CapitalSan Francisco, CA, United States

Full-time

We are looking for an AI Infra engineer to join our growing team.We work with Kubernetes, Slurm, Python, C++, PyTorch, and primarily on AWS. As an AI Infrastructure Engineer, you will be partnering ...Show moreLast updated: 1 day ago

Promoted

AI / ML Infrastructure Engineer

RIT Solutions, Inc.Concord, CA, United States

Full-time

Title : AI / ML Infrastructure Engineer, 3 days onsite, locals only.Grant St Concord California 94520 United States.Lead and design the platform and infrastructure architecture for AIML and NLP in mod...Show moreLast updated: 30+ days ago

Promoted

Principal Optical Engineer - AI Infrastructure

OracleSanta Clara, CA, United States

Full-time

Supports the design, deployment, and operations of a large-scale global Oracle cloud computing environment (Oracle Cloud Infrastructure - OCI). Primarily focused on development and support of AI clu...Show moreLast updated: 8 days ago

Promoted
New!

Infra / Platform Engineer

Approach Venture LLCBerkeley, CA, United States

Full-time

Infrastructure Engineer - Build Data Platforms & Internal Tools for Robotics!.An early-stage robotics startup is creating the foundation for general-purpose robotics by developing the infrastructur...Show moreLast updated: 22 hours ago

Promoted

AI / ML Infrastructure Engineer

Syntricate TechnologiesConcord, CA, United States

Full-time

Grant St Concord California 94520 (3 days onsite in week).Lead and design the platform and infrastructure architecture for AIML and NLP in modern hybrid cloud computing. Participate in day-to-day st...Show moreLast updated: 30+ days ago

Promoted
New!

AI Infrastructure Engineer, Core Infrastructure

Scale AISan Francisco, CA, United States

Full-time

As a Software Engineer on the ML Infrastructure team, you will design and build the next generation of foundational systems that power all ML Infrastructure compute at Scale - from model training a...Show moreLast updated: 22 hours ago

Promoted

AI Platform Engineer, Infrastructure

Brain Co.San Francisco, CA, United States

Full-time

Applied AI startup founded by Elad Gil and Jared Kushner, and backed by many of Silicon Valley’s leading builders — including Patrick Collison (CEO of Stripe), Andrej Karpathy (Cofounder of OpenAI)...Show moreLast updated: 4 days ago

Promoted

AI Infrastructure Engineer, Model Serving Platform

Scale AI, Inc.San Francisco, CA, United States

Full-time

As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show moreLast updated: 30+ days ago

Promoted

AI Infra Engineer

Perplexity AI Inc.San Francisco, CA, United States

Full-time

Promoted
New!

Cluster Infrastructure Engineer

Cartesia, Inc.San Francisco, CA, United States

Full-time

Our mission is to build the next generation of AI : ubiquitous, interactive intelligence that runs wherever you are.Today, not even the best models can continuously process and reason over a year-lo...Show moreLast updated: 22 hours ago

Promoted

Senior Infrastructure Software Engineer, Enterprise AI

Scale AI, Inc.San Francisco, CA, United States

Full-time

Scale GP is building the next generation of enterprise-grade Generative AI products.Our platform provides APIs for knowledge retrieval, inference, and evaluation, enabling customers to build and de...Show moreLast updated: 30+ days ago

Promoted
New!

AI & HPC Infrastructure Engineer

AccentureSan Francisco, CA, United States

Full-time

The Global Infrastructure Engineering AI & HPC team is at the center of enabling infrastructure reinvention for the next era of digital solutions powered by AI and High-Performance Computing (HPC)....Show moreLast updated: 22 hours ago

Promoted
New!

Infrastructure Engineer - SupercomputingPalo Alto, CA

XaiSan Francisco, CA, United States

Full-time

Infrastructure Engineer - Supercomputing.AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly moti...Show moreLast updated: 22 hours ago

Promoted
New!

AI Infrastructure Engineer, Model Serving Platform

Scale AISan Francisco, CA, United States

Full-time

Promoted
New!

Senior AI Infrastructure Engineer

LanceDBSan Francisco, CA, United States

Full-time

LanceDB is a developer-friendly, open-source data lake for multimodal AI.From hyper-scalable vector search to advanced retrieval for RAG, from streaming training data to interactive exploration of ...Show moreLast updated: 22 hours ago

Promoted
New!

Infrastructure Engineer, Data Platform

Together AISan Francisco, CA, United States

Full-time

Lead Cloud Infrastructure Engineer.Lead Cloud Infrastructure Engineer.Get AI-powered advice on this job and more exclusive features. Together AI is hiring a Lead Cloud Infrastructure Engineer to own...Show moreLast updated: 22 hours ago

Promoted

Distinguished AI Engineer (Agentic AI Platform Infrastructure)

Capital OneSan Francisco, CA, United States

Full-time +1

Distinguished AI Engineer (Agentic AI Platform Infrastructure).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an indu...Show moreLast updated: 30+ days ago

Promoted
New!

Senior AI Infrastructure Engineer, Cloud Partnerships - DGX Cloud

NVIDIASanta Clara, CA, United States

Full-time

Senior AI Infrastructure Engineer, Cloud Partnerships - DGX Cloud page is loaded## Senior AI Infrastructure Engineer, Cloud Partnerships - DGX Cloudlocations : US, CA, Santa Clara : US, Remotetime ty...Show moreLast updated: 22 hours ago