A company is looking for a Lead Observability Engineer to own and rebuild their observability strategy across agentic systems and SaaS infrastructure.
Key Responsibilities
Own the end-to-end observability strategy, defining standards, tools, and patterns for reliable visibility
Design and implement correlation models linking agent behavior, LLM interactions, and SaaS telemetry
Unify observability tooling across teams, ensuring metrics, logs, and traces flow into a central platform
Required Qualifications
6+ years of experience in SRE, DevOps, or Observability Engineering roles, with 2+ years in a leadership capacity
Deep knowledge of observability tooling such as OpenTelemetry, Prometheus, and Datadog
Experience with agentic / LLM-based systems and orchestration frameworks
Strong understanding of instrumenting and tracing AI / LLM workflows with infrastructure telemetry
Proven ability to define cross-team standards and influence engineering culture
Observability Engineer • Sacramento, California, United States