A company is looking for a Staff Systems Reliability Engineer.
Key Responsibilities
Design and implement scalable, fault-tolerant AWS-based infrastructure
Develop and maintain CI / CD pipelines and automation tools for operations
Lead incident response efforts and mentor junior SREs
Required Qualifications
Minimum of 12 years of related experience with a Bachelor's degree; or 8 years with a Master's degree; or a PhD with 5 years of experience
Expert-level knowledge of AWS services and infrastructure automation tools
Strong proficiency in Python and / or Go for automation and tooling
Experience managing observability and alerting systems at scale
Familiarity with regulatory requirements related to infrastructure and DevOps
Staff Reliability Engineer • Fremont, California, United States