A company is looking for a Staff Software Engineer specializing in Reliability and Performance. Key Responsibilities Design and execute long-lived workloads that simulate production environments Develop metrics and dashboards to monitor system performance and guide improvements Conduct chaos and fault injection experiments to validate system resilience and correctness Required Qualifications Strong background in systems engineering, performance testing, or site reliability engineering Proficiency in Python and Linux fundamentals; Rust experience is a plus Experience with distributed systems and database concepts Familiarity with CI / CD pipeline engineering (GitHub Actions, Docker, Kubernetes) Hands-on experience with large-scale workloads in a cloud-native environment
Reliability Engineer • Vancouver, Washington, United States