We are seeking a Site Reliability Engineer to join our team in Atlanta and play in enhancing the stability, performance, and reliability of our production systems. You’ll work closely with development, DevOps, and security teams to improve observability, optimize system performance, and ensure production readiness. From monitoring to automation, you’ll make a direct impact on our cloud infrastructure and service reliability.
In this role, you will work hand-in-hand with our development, operations, and security teams worldwide to implement best practices, automate deployments, and ensure our platforms are reliable, secure, and scalable. Troubleshooting in Kubernetes is required which will involve you having a deep understanding of pods, nodes, networking, scaling, logs, and service-to-service communication.
This role requires a deep understanding of SRE best practices and a strong ability to troubleshoot complex issues.
Your responsibilities in this role will include :
We are looking for you to have the following skills and experience :
Nice to Have
This is a full-time role and unfortunately we can't sponsor so you must be a USC or be a green card holder. You must currently live in the Atlanta area as you will need to come into our Atlanta office one or two times each month for key meetings with our team.
If you thrive on solving complex technical challenges, have a passion for automation, and want to influence how enterprise platforms evolve and modernize, this is an ideal opportunity for you.
Engineer Sre • Alpharetta, GA, United States