Talent.com
AI Infrastructure Engineer

AI Infrastructure Engineer

StackAISan Francisco, CA, United States
5 days ago
Job type
  • Full-time
Job description

About the Role

We’re hiring an AI Infrastructure Engineer to shape and scale the backend systems that power our AI platform. As a Series A company, your work will be foundational, enabling safe, efficient, and reliable AI workflows from end to end.

What You’ll Do

Design and implement scalable backend architectures for AI workloads (inference, orchestration, monitoring).

Own distributed job orchestration with Temporal and related systems.

Improve data pipeline performance by designing smarter caching strategies (e.g., file deduplication, hot / cold storage, Redis caching layers) to reduce redundant compute and API calls.

Build observability, monitoring, retries, and fault tolerance into all workflows.

Manage infrastructure reliability, incident response, and performance.

Develop tooling and platform infrastructure to support rapid growth.

Partner with ML engineers to bring models to production at scale.

What We’re Looking For

4+ years of backend engineering (Python is a must).

Strong background in distributed systems, job orchestration, and task queues.

Deep knowledge of concurrency, parallelism, and multithreading—including async / await, event loops, thread pools, synchronization primitives, deadlocks, and race conditions—is a must. You should know how to design systems that maximize throughput without sacrificing correctness or safety.

Hands-on experience with Temporal, Redis, Airflow, Celery, RabbitMQ (or similar).

Experience with LLM serving and routing fundamentals (rate limiting, streaming, load balancing, budgets).

Comfortable with containers & orchestration : Docker, Kubernetes.

Familiarity with cloud platforms (AWS / GCP) and IaC (Terraform).

Experience with multiple storage systems : S3, Postgres, MongoDB, Redis, and Elasticsearch.

Track record scaling systems in startups or fast-paced environments.

Understanding of deploying, monitoring, and optimizing AI / ML systems in production with strong CI / CD practices.

Why You’ll Love Working Here

Play a foundational role at a fast-growing Series A startup that is shaping the future of AI in enterprise workflows.

Collaborate across Product, ML, and Platform teams, being the bridge between AI logic and scalable execution.

Build infrastructure that enables real value for large enterprises : low-code, secure, and scalable AI workflows.

Join a company that’s scaling thoughtfully and values developer experience.

#J-18808-Ljbffr

Create a job alert for this search

Infrastructure Engineer • San Francisco, CA, United States

Related jobs
  • Promoted
Flight Software Infrastructure Engineer

Flight Software Infrastructure Engineer

Reliable RoboticsMountain View, CA, United States
Permanent
We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
  • Promoted
Senior Infrastructure Security Engineer

Senior Infrastructure Security Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Senior Infrastructure Security Engineer - DGX Cloud.Key Responsibilities Implement, manage, and troubleshoot firewalls within on-premise and cloud network infrastructur...Show moreLast updated: 30+ days ago
  • Promoted
Senior Infrastructure Engineer

Senior Infrastructure Engineer

VirtualVocationsConcord, California, United States
Full-time
A company is looking for a Senior Software Engineer, Infrastructure.Key Responsibilities Independently execute large DevOps projects, including migrations and infrastructure enhancements Drive r...Show moreLast updated: 30+ days ago
  • Promoted
Senior DGX Cloud AI Infrastructure Software Engineer

Senior DGX Cloud AI Infrastructure Software Engineer

NVIDIASanta Clara, CA, United States
Full-time
Joining NVIDIA's DGX Cloud AI Efficiency Team means contributing to the infrastructure that powers our innovative AI research. This team focuses on optimizing efficiency and resiliency of AI workloa...Show moreLast updated: 3 days ago
  • Promoted
Sr. AI Infrastructure Software Engineer

Sr. AI Infrastructure Software Engineer

KLAMilpitas, CA, United States
Full-time
KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem.Virtually every electronic device in the world is produced using our technologies.No laptop, smartpho...Show moreLast updated: 30+ days ago
  • Promoted
AI Engineer

AI Engineer

VirtualVocationsHayward, California, United States
Full-time
A company is looking for an AI Engineer who has experience architecting and shipping robust multi-agent systems in production. Key Responsibilities Design, develop, and deploy AI Coach capabilitie...Show moreLast updated: 30+ days ago
  • Promoted
AI Infrastructure Engineer, Model Serving Platform

AI Infrastructure Engineer, Model Serving Platform

Scale AI, Inc.San Francisco, CA, United States
Full-time
As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show moreLast updated: 30+ days ago
  • Promoted
Support Infrastructure Engineer

Support Infrastructure Engineer

VirtualVocationsSan Jose, California, United States
Full-time
A company is looking for a Support Infrastructure Engineer III.Key Responsibilities Lead the design and deployment of DDI solutions, managing Infoblox appliances for enterprise network infrastruc...Show moreLast updated: 1 day ago
  • Promoted
AI Infrastructure Engineer

AI Infrastructure Engineer

SpellbrushSan Francisco, CA, US
Full-time
Spellbrush, the world’s leading generative AI studio behind.AI Infrastructure Engineer to join us in building out end-to-end ML infrastructure to run our models on all platforms.Design, imple...Show moreLast updated: 30+ days ago
  • Promoted
AI Infrastructure Engineer, ML Data Platform

AI Infrastructure Engineer, ML Data Platform

Scale AI, Inc.San Francisco, CA, United States
Full-time
Scale's AI Infrastructure team supports both R&D and applied Generative AI initiatives, driving breakthroughs in areas of post-training research such as AI safety, agents, and evaluating state-of-t...Show moreLast updated: 30+ days ago
  • Promoted
Senior Infrastructure Software Engineer, Enterprise AI

Senior Infrastructure Software Engineer, Enterprise AI

Scale AI, Inc.San Francisco, CA, United States
Full-time
Scale GP is building the next generation of enterprise-grade Generative AI products.Our platform provides APIs for knowledge retrieval, inference, and evaluation, enabling customers to build and de...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Azure Cloud Governance Engineer

Azure Cloud Governance Engineer

VirtualVocationsConcord, California, United States
Full-time
A company is looking for a Customer Engineer specializing in Azure Architecture and Cloud Governance.Key Responsibilities Lead architectural discussions and implementation planning using Microsof...Show moreLast updated: 13 hours ago
  • Promoted
Infrastructure Software Engineer, Public Sector

Infrastructure Software Engineer, Public Sector

Scale AI, Inc.San Francisco, CA, United States
Full-time
Scale AI is seeking a highly skilled and motivated.Software Engineer, AI Infrastructure & Security.Public Sector Engineering team. As a part of this team, you will play a critical role in delivering...Show moreLast updated: 30+ days ago
  • Promoted
AI Infrastructure Software Architect

AI Infrastructure Software Architect

KLAMilpitas, CA, United States
Full-time
KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem.Virtually every electronic device in the world is produced using our technologies.No laptop, smartpho...Show moreLast updated: 30+ days ago
  • Promoted
Staff Engineer, Ads Infrastructure

Staff Engineer, Ads Infrastructure

VirtualVocationsConcord, California, United States
Full-time
A company is looking for a Staff Engineer, Ads Development Infra.Key Responsibilities Lead the evolution of the Ads tech stack to enhance scalability and performance Collaborate with engineers t...Show moreLast updated: 2 days ago
  • Promoted
AI Infrastructure Engineer, Agents

AI Infrastructure Engineer, Agents

Scale AI, Inc.San Francisco, CA, United States
Full-time
As a Software Engineer on the ML Infrastructure team, you will design and build the platform for our agent sandboxing platform : the secure, high-performance code execution layer powering our agenti...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Terraform and IaC Engineer

Terraform and IaC Engineer

VirtualVocationsFremont, California, United States
Full-time
A company is looking for a Terraform and IaC Engineer to support a migration project.Key Responsibilities Design, author, and maintain Terraform modules / stacks for various AWS constructs and serv...Show moreLast updated: 13 hours ago
  • Promoted
Engineering Manager, Data Infrastructure

Engineering Manager, Data Infrastructure

VirtualVocationsConcord, California, United States
Full-time
A company is looking for an Engineering Manager, Data Infrastructure & Serving Layer.Key Responsibilities Lead and manage a high-performing engineering team focused on the serving layer Design a...Show moreLast updated: 2 days ago