A company is looking for a Senior Site Reliability Engineer to enhance system reliability and operational efficiency.
Key Responsibilities
Lead initiatives to improve system reliability, scalability, and performance across services
Define and implement SLIs / SLOs and error budgets to guide engineering priorities
Design observability systems that produce actionable alerts and data with minimal noise
Required Qualifications
6 - 10+ years of experience in SRE, infrastructure, or backend systems engineering
Proven experience owning reliability outcomes for complex, distributed systems
Strong experience with cloud infrastructure (AWS, GCP, or Azure) and production-scale systems
Deep understanding of observability, incident management, and system performance
Proficiency in at least one programming language focused on automation and tooling