AI Infrastructure Engineer

PlayerZeroSan Francisco, CA, United States

2 days ago

Job type

Full-time

Job description

PlayerZero is building a self‑healing system for software—automating defect detection, diagnosis, and remediation so developers ship with confidence. Teams use PlayerZero to spot issues before customers do, pinpoint root causes fast, and close the loop from incident to fix.

About the role

We’re looking for an experienced backend / infrastructure engineer who loves turning research prototypes into rock‑solid production systems. You’ll design and scale the core services that power our AI inference stack—from data ingestion and feature stores to retrieval pipelines and GPU orchestration. If you’re obsessed with performance, correctness, and shipping fast, you’ll feel at home here.

What You’ll Do

Own critical services end‑to‑end —from architecture and design reviews through deployment, observability, and SLOs.

Scale LLM‑driven workloads : build retrieval‑augmented generation pipelines, vector indexes, and evaluation harnesses that handle billions of events per day.

Design data‑intensive systems : streaming ETL, columnar storage, and time‑series analytics that feed our self‑healing algorithms.

Optimize for cost & latency across CPUs, GPUs, and serverless runtimes; profile hot paths and squeeze every millisecond.

Champion reliability : automate testing, chaos drills, and progressive delivery so new models roll out safely.

Collaborate cross‑functionally with ML researchers, product engineers, and customers to ship features that matter.

You might thrive in this role if

2–5+ years of experience building scalable backend or infrastructure systems in a production setting.

Builder mindset —you like owning projects end‑to‑end and are thoughtful about data models, performance, and long‑term maintainability.

Experience transitioning prototypes to production with an understanding of tradeoffs in reliability and scale.

Comfort with data engineering workflows—parsing, transforming, indexing, and querying structured or unstructured data.

Exposure to search infrastructure or LLM‑backed systems (e.g. document retrieval, semantic search, evaluation, or prompt engineering).

Bonus Points

Hands‑on with vector databases (e.g., pgvector, Pinecone, Weaviate) or inverted‑index search (Elasticsearch, Lucene).

Experience operating GPU clusters (Kubernetes, Ray, KServe) or tuning model‑parallel inference.

Familiarity with Go / Rust (our primary stack) and TypeScript for the occasional full‑stack tweak.

Deep knowledge of observability (OpenTelemetry, Grafana, Datadog) and performance profiling.

Contributions to open‑source ML or infrastructure projects.

Our Supporters

Foundation Capital (Ashu Garg, Jaya Gupta)

WndrCo (Sujay Jaswa, ChenLi Wang)

Green Bay Ventures (Anthony Schiller, Dick Kramlich)

Matei Zaharia (Founder & CTO, Databricks)

Guillermo Rauch (CEO, Vercel)

Dylan Field (Founder & CEO, Figma)

Drew Houston (Founder & CEO, Dropbox)

Peter Bailis (CTO, Workday)

Oliver Jay (MD International, OpenAI)

John Lilly (ex CEO, Mozilla)

Bernard Kim (CEO, Match Group)

Others

Job Details

Seniority level : Mid‑Senior level

Employment type : Full‑time

Job function : Information Technology

Industries : Software Development

#J-18808-Ljbffr

Create a job alert for this search

Infrastructure Engineer • San Francisco, CA, United States

Related jobs

Promoted

Flight Software Infrastructure Engineer

Reliable RoboticsMountain View, CA, United States

Permanent

We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago

Promoted

AI Infrastructure Engineer, Core Infrastructure

Scale AI, Inc.San Francisco, CA, United States

Full-time

AI Infrastructure Engineer, Core Infrastructure.Join the team shaping the future of AI at Scale.As a Software Engineer on the ML Infrastructure team, you will design and build the next generation o...Show moreLast updated: 23 hours ago

Promoted

AI Infrastructure Architect

Okta for DevelopersSan Francisco, CA, United States

Full-time

We are looking for a smart and versatile AI Infrastructure Architect to build and evolve the AI infrastructure and platform that powers our identity security solutions. Your work will enable interna...Show moreLast updated: 4 days ago

Promoted

Senior AI Infrastructure Engineer

kadenceSan Francisco, CA, United States

Full-time

This range is provided by kadence.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Direct message the job poster from kadence.Senior Consultant |...Show moreLast updated: 4 days ago

Promoted

AI Infrastructure Architect

OktaSan Francisco, CA, United States

Full-time

AI Infrastructure Architect – We are looking for a smart and versatile AI Infrastructure Architect to build and evolve the AI infrastructure and platform that powers our identity security solutions...Show moreLast updated: 30+ days ago

Promoted

Staff Infrastructure Engineer, AI Scientist Team

AnthropicSan Francisco, CA, United States

Full-time

Staff Infrastructure Engineer, Discovery Team.Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society a...Show moreLast updated: 30+ days ago

Promoted
New!

Platform Engineer AI Infra Specialist

PolySan Francisco, CA, United States

Full-time

Platform Engineer AI Infra Specialist.Platform Engineer AI Infra Specialist.This range is provided by Poly.Your actual pay will be based on your skills and experience talk with your recruiter to le...Show moreLast updated: 5 hours ago

Promoted

Infrastructure Engineer, Data Platform

Together AISan Francisco, CA, United States

Full-time

Together AI is hiring a Lead Cloud Infrastructure Engineer to own and operate the cloud foundation that powers our rapidly scaling data platforms. In this role, you will be the primary engineer resp...Show moreLast updated: 4 days ago

Promoted

AI Infrastructure Engineer, Model Serving Platform

Scale AI, Inc.San Francisco, CA, United States

Full-time

As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show moreLast updated: 30+ days ago

Promoted

AI Infrastructure Engineer, Core Infrastructure

Scale AISan Francisco, CA, United States

Full-time

AI Infrastructure Engineer, Core Infrastructure.As a Software Engineer on the ML Infrastructure team, you will design and build the next generation of foundational systems that power all ML Infrast...Show moreLast updated: 4 days ago

Promoted

AI Infrastructure Engineer - PlayerZero

HireOTSSan Francisco, CA, United States

Full-time

The platform is used by engineering and support teams to : .Autonomously debug problems in production software.Fix issues directly in the codebase. Prevent recurring issues through intelligent root-ca...Show moreLast updated: 30+ days ago

Promoted

AI Infra Engineer

Perplexity AI Inc.San Francisco, CA, United States

Full-time

We are looking for an AI Infra engineer to join our growing team.We work with Kubernetes, Slurm, Python, C++, PyTorch, and primarily on AWS. As an AI Infrastructure Engineer, you will be partnering ...Show moreLast updated: 15 days ago

Promoted

Platform Engineer — AI Infra Specialist

PolySan Francisco, CA, United States

Full-time

Platform Engineer — AI Infra Specialist.Platform Engineer — AI Infra Specialist.This range is provided by Poly.Your actual pay will be based on your skills and experience — talk with your recruiter...Show moreLast updated: 30+ days ago

Promoted
New!

Senior AI Infrastructure Engineer

Menlo VenturesSan Francisco, CA, United States

Full-time

A leading AI research firm in San Francisco seeks a Staff Infrastructure Engineer to identify and resolve infrastructure bottlenecks and design large-scale systems for AI training.The ideal candida...Show moreLast updated: 4 hours ago

Promoted
New!

Infrastructure Engineer, AI & LLM Platform (Hybrid)

IvoSan Francisco, CA, United States

Full-time

A forward-thinking tech company in San Francisco is seeking an Infrastructure Engineer to design and manage complex distributed systems. As part of the engineering team, you will own the future of t...Show moreLast updated: 4 hours ago

Promoted

Infrastructure Software Engineer, Public Sector

Scale AI, Inc.San Francisco, CA, United States

Full-time

Scale AI is seeking a highly skilled and motivated.Software Engineer, AI Infrastructure & Security.Public Sector Engineering team. As a part of this team, you will play a critical role in delivering...Show moreLast updated: 30+ days ago

AI agent Infrastructure Engineers

MercorSan Francisco, California, United States

Remote

Full-time

Quick Apply

AI Agent Infrastructure Engineers.This is a unique opportunity to work with world-class AI researchers and engineers, building the infrastructure that enables advanced reasoning, multi-agent coordi...Show moreLast updated: 1 day ago

Promoted

Distinguished AI Engineer (Agentic AI Platform Infrastructure)

Capital OneSan Francisco, CA, United States

Full-time +1

Distinguished AI Engineer (Agentic AI Platform Infrastructure).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an indu...Show moreLast updated: 30+ days ago