Job Title : Site Reliability Engineer
Location : Florida City (FL)
ZIP Code : 32004
Experience Required : 12 Years
Employment Type : Contract
About the Role
We are seeking an experienced Site Reliability Engineer (SRE) with more than 12 years of hands-on expertise in building maintaining and improving large-scale highly available systems. The ideal candidate will have strong skills in automation cloud infrastructure performance optimization monitoring and incident response.
Key Responsibilities
Design implement and maintain highly reliable scalable and secure systems.
Develop automation to reduce manual operational tasks and eliminate repeated issues.
Build and maintain CI / CD pipelines to support continuous delivery and deployment.
Manage cloud infrastructure (AWS Azure or GCP) including networking security and scaling.
Create and maintain monitoring logging and alerting systems using modern tooling.
Lead incident response root-cause analysis and post-incident reviews.
Improve system performance and reliability through capacity planning and performance tuning.
Work closely with software engineering teams to ensure smooth production operations.
Implement infrastructure-as-code using Terraform Ansible or similar tools.
Ensure compliance with security and operational standards.
Required Skills and Experience
12 years of experience in Site Reliability DevOps or Production Engineering roles.
Strong hands-on experience with cloud platforms (AWS Azure or GCP).
Expertise in CI / CD pipelines and automation tools such as Jenkins GitLab or GitHub Actions.
Proficiency with containerization and orchestration (Docker Kubernetes).
Experience with monitoring tools such as Prometheus Grafana ELK Splunk or Datadog.
Strong scripting / programming skills (Python Bash Go or similar).
Familiarity with networking concepts load balancing and distributed systems.
Solid understanding of security best practices and infrastructure governance.
Experience managing high-availability systems in production environments.
Preferred Qualifications
Experience working in Agile environments.
Background with database operations (SQL / NoSQL).
Knowledge of infrastructure cost optimization.
Key Skills
Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting
Experience : years
Vacancy : 1
Site Reliability Engineer • Florida City, Florida, USA