Site Reliability Engineer
Easy Dynamics
McLean, VA
Full-time
We are looking for a Site Reliability Engineer to join our team and develop software systems and automated solutions for operational aspects in an organization.
Site Reliability Engineer responsibilities include monitoring computer systems and building alerts for various operational issues that computer systems can experience.
Ultimately, you will work with our IT team to ensure our organization can continue to deliver products and services in our computer system environment.
Responsibilities :
- You will automate the server provisioning process to reduce the labor of our networking engineering and datacenter operations teams
- Perform deep dives into both systemic and latent reliability issues
- Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization
- Build and maintain low latency, high performance, scalable systems in a polyglot architecture
- Ensure the team can support massive, global user growth while achieving rigorous SLAs
- Enforce best practices for metrics gathering, monitoring, and alarming
- Accelerate the team by making service / application deployment and builds fast, simple, and reliable
- Provide basic data administration and optimization, and monitoring
- Provide basic network administration and troubleshooting
- Assist in the technology selection for core computing and software infrastructure along with data persistence
Qualifications :
- Proven work experience as a Site Reliability Engineer or similar role within a Microsoft Azure or AWS cloud / hybrid-cloud environment
- Experience with infrastructure management tools such as Terraform
- Experience with web server configuration, monitoring, trending, network design, high availability
- Proficiency in a scripting language
- Practical, solid knowledge of shell scripting and at least one higher-level language (Bash or PowerShell preferred)
- Comfortable configuring DNS, DHCP, and LAN / WAN technologies
- Collaborate and communicate asynchronously
- Document all the things so you don’t need to learn the same thing twice
- Have an enthusiastic, go-for-it attitude
- Relevant training and / or certifications as a Site Reliability Engineer
- US Citizenship required
- Ability to obtain a U.S. Government clearance
30+ days ago