A company is looking for a Lead Site Reliability Engineer (SRE).
Key Responsibilities
Drive incident response best practices, lead postmortems, and define SLAs / SLOs across platform services
Collaborate with product & app teams to implement reliability reviews
Oversee evolution of the observability stack and build frameworks that empower engineering teams
Required Qualifications
7+ years in SRE, platform, systems, or infrastructure engineering
Strong background in AWS and Kubernetes operations
Experience establishing SLOs / SLA frameworks and driving org-wide adoption
Strong track record leading incident response and postmortem culture
Programming / scripting skills in Python, Go, or similar
Site Reliability Engineer • Charlotte, North Carolina, United States