Site Reliability Engineer
2 days onsite in Jacksonville FL
MUST HAVE : AWS Ansible Datadog
We are seeking a Site Reliability Engineering (SRE) contractor to support their Global Command Center (GCC) which has historically been network-focused since its launch in 2018 but is now evolving to operate with the same agility and innovation as engineering and infrastructure teams. The ideal candidate will help accelerate this transformation by driving automation observability and operational excellence .
Key Responsibilities
- Observability & Monitoring : Build optimize and maintain dashboards in tools like Datadog Splunk and AppDynamics to deliver actionable insights and improve visibility into systems and services.
- Automation & Tooling : Develop and maintain automation solutions using Ansible Python ServiceNow JIRA and other platforms to reduce manual work streamline processes and improve efficiency.
- Operational Agility : Help modernize how the GCC works-enabling faster change proactive problem detection and measurable value delivery.
- Collaboration & Communication : Clearly present ideas dashboards and proposed improvements to technical teams leadership and stakeholders.
- Continuous Improvement : Support the shift from a traditionally slower network-centric model toward a more software-driven automation-first approach.
Preferred Skills & Tools
Automation : Ansible Python ServiceNow JIRAObservability & Monitoring : Datadog Splunk AppDynamics ZenossCollaboration & Presentation : Strong ability to explain technical concepts to peers and leadershipMindset : Comfortable in a transforming environment where change is expected and speed mattersKey Skills
Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting
Employment Type : Full Time
Experience : years
Vacancy : 1