Talent.com
Site Reliability Engineer
Site Reliability EngineerUtility Associates, Inc. • Decatur, GA, US
No longer accepting applications
Site Reliability Engineer

Site Reliability Engineer

Utility Associates, Inc. • Decatur, GA, US
30+ days ago
Job type
  • Full-time
Job description

Job Description

Job Description

Position Summary

We are seeking a highly experienced Site Reliability Engineer (SRE) to help design, operate, and continuously improve a highly available, secure, and cost-efficient multi-cloud platform. This role is deeply hands-on and execution-focused, with a strong bias toward action, measurable outcomes, and incremental progress.

The ideal candidate has deep expertise in AWS and Azure, strong instincts for efficient architecture, and a passion for observability, automation, and cost optimization (FinOps). You will work in a high-compliance environment, partnering closely with engineering, product, and operations teams to ensure platform health, reliability, and scalability.

This role also requires leadership through influence—guiding teams, setting technical direction, and raising operational maturity—while remaining directly engaged in delivery.

Essential Duties and Responsibilities

Reliability & Platform Health

  • Design, implement, and operate reliable, scalable, and resilient systems across AWS and Azure
  • Establish and improve SLOs, SLIs, error budgets, and incident response practices
  • Lead root cause analysis and drive corrective actions to prevent recurrence
  • Continuously improve platform uptime, performance, and operational maturity

Observability & Operational Excellence

  • Design and maintain best-in-class observability (metrics, logs, traces, alerting)
  • Ensure actionable alerts with low noise and high signal
  • Use data to identify reliability risks, performance bottlenecks, and efficiency opportunities
  • Drive incremental improvements using clear goals and measurable outcomes

FinOps & Cost Optimization (Top Priority)

  • Own and drive cloud cost optimization initiatives across AWS and Azure
  • Partner with engineering and leadership to align cost with business value
  • Implement cost visibility, forecasting, and accountability practices
  • Identify architectural and operational improvements that reduce waste without sacrificing reliability or security

Security & Compliance

  • Operate within highly regulated environments with strong security controls
  • Support compliance efforts (SOC 2, CJIS preferred, NIST-aligned practices)
  • Embed reliability and compliance requirements into platform design and operations
  • Partner with security teams to ensure secure-by-default systems

Automation & AI-Enabled Efficiency

  • Automate operational workflows using Infrastructure as Code, CI/CD, and tooling
  • Leverage AI tools for analysis, incident investigation, cost insights, capacity planning, and operational efficiency
  • Continuously seek opportunities to move faster and smarter through automation and intelligent tooling

Leadership & Collaboration

  • Act as a technical leader and trusted partner to engineering teams
  • Guide and mentor others on reliability, observability, and cost-efficient design
  • Influence architecture and operational decisions through data and collaboration
  • Drive initiatives end-to-end with accountability and ownership

Required Qualifications

  • Bachelor's Degree in Computer Science or Engineering
  • 5+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering
  • Deep hands-on experience operating production systems in AWS and Azure
  • Strong background in cloud architecture, reliability engineering, and automation
  • Proven experience with observability platforms (metrics, logging, tracing, alerting)
  • Demonstrated success driving cloud cost optimization / FinOps initiatives
  • Experience working in high-compliance environments (SOC 2 required; CJIS a plus)
  • Strong scripting or programming skills (e.g., Python, Go, Bash, or similar)
  • Experience with Infrastructure as Code (e.g., Terraform, CloudFormation, ARM/Bicep)

Preferred Qualifications

  • CJIS compliance experience
  • Experience supporting SaaS platforms serving public sector or regulated customers
  • Exposure to multi-region, high-availability architectures
  • Experience implementing or maturing FinOps practices at scale
  • Prior experience mentoring or leading cross-functional technical initiatives

What Success Looks Like

  • Platform reliability and performance improve measurably over time
  • Cloud costs are visible, controlled, and optimized without compromising outcomes
  • Incidents are fewer, better managed, and result in lasting improvements
  • Teams adopt better operational practices through your guidance and example
  • Incremental progress toward clear goals is made consistently and transparently

Why This Role Matters

This role sits at the intersection of reliability, cost, security, and speed. You will have the opportunity to materially impact how the platform scales, how efficiently we operate, and how quickly we adapt—using modern cloud practices, strong engineering judgment, and AI-enabled insights.

Physical Demands and Work Environment

This role requires the employee to maintain a stationary and upright position consistently. Employees must be able to move frequently within an office environment to utilize office machinery and other resources. The employee should be able to communicate information and concepts consistently and effectively for mutual understanding, including conveying precise details during these interactions. For accurate task execution, it is essential that the employee consistently maintains consistent specific vision abilities, especially the capability to discern close-up details within a few feet of the observer. Seldom does this role entail the transportation of items weighing up to 15 pounds to meet various demands.

Note

This job description in no way states or implies that these are the only duties to be performed by the employee(s) incumbent in this position. Employees will be required to follow any other job-related instructions and to perform any other job-related duties requested by any person authorized to give instructions or assignments. All duties and responsibilities are essential functions and requirements and are subject to possible modification to reasonably accommodate individuals with disabilities. To perform this job successfully, the incumbents will possess the skills, aptitudes, and abilities to perform each duty proficiently. Some requirements may exclude individuals who pose a direct threat or significant risk to the health or safety of themselves or others. The requirements listed in this document are the minimum levels of knowledge, skills, or abilities. This document does not create an employment contract, implied or otherwise, other than an “at-will” relationship. Coreforce is an Equal Opportunity Employer, drug free workplace, and complies with ADA regulations as applicable.

The companies in the COREFORCE organization are innovative technology leaders, delivering groundbreaking digital systems tailored for frontline professionals who rely on speed, accuracy, easy-to-access data, and transparency in their work. COREFORCE is an equal-opportunity employer that promotes justice, advances equity, values diversity, and fosters inclusion. COREFORCE is committed to hiring the best talent – regardless of race, creed, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship status, marital status, disability, gender identity, genetic information, veteran status, or any other characteristic protected by applicable laws, regulations, and ordinances. If you have a disability or special need that requires assistance or accommodation, please email recruiting@coreforcetech.com.

Create a job alert for this search

Site Reliability Engineer • Decatur, GA, US

Similar jobs
Site Reliability Engineer

Site Reliability Engineer

AutoRABIT Holding Inc. • Atlanta, GA, US
Permanent
Quick Apply
AutoRABIT is looking for a Site Reliability/DevSecOps Engineer to help develop, scale and operate our cloud services In this role you will be an experienced business professional able to implement ...Show more
Last updated: 30+ days ago
Sediment Remediation Engineer 5

Sediment Remediation Engineer 5

CDM Smith • Atlanta, GA, United States
Full-time
CDM Smith is seeking candidates for a senior level Sediment Remediation Engineer position.The candidate must have demonstrated expertise working independently as well as leading a team of engineers...Show more
Last updated: 4 days ago • Promoted
Site Civil Engineer PE

Site Civil Engineer PE

Woolpert • Atlanta, GA, United States
Full-time
We seek to move the world forward through innovative thinking.Woolpert is an award-winning, global leader in architecture, engineering, and geospatial services.We blend design excellence with cutti...Show more
Last updated: 4 days ago • Promoted
On-Site Manager

On-Site Manager

Capstone Logistics • Alpharetta, GA, United States
Full-time
Shift: Compensation: Competitive Onsite Manager Classification Exempt Reports to Regional Operations Manager Personal Protective Equipment Required While working in the office, standard steel toe w...Show more
Last updated: 5 days ago • Promoted
Systems & Infrastructure Engineer (Level I - V)

Systems & Infrastructure Engineer (Level I - V)

Oglethorpe Power • Tucker, GA, US
Full-time
This position leverages expertise in system administration to maintain systems critical to GSOC's system operations function.As a member of the Systems and Infrastructure department, this position ...Show more
Last updated: 30+ days ago • Promoted
Site Superintendent

Site Superintendent

InCharge Energy • Atlanta, GA, United States
Full-time
InCharge Energy is seeking a highly organized.EV charging and electrification rollouts.The position reports directly to the Director of Project Management.This role requires significant regional tr...Show more
Last updated: 30+ days ago • Promoted
Cell Leader - Turbine

Cell Leader - Turbine

GE Vernova, Inc. • Atlanta, GA, United States
Full-time
As a member of the site leadership team, you will be an active contributor to the Safety, Quality, Delivery and Cost (SQDC) goals for the business unit.This role will require you to lead the Steam ...Show more
Last updated: 4 days ago • Promoted
Lead Reliability Engineer

Lead Reliability Engineer

Truist Inc • Atlanta, GA, United States
Full-time +2
The position is described below.If you want to apply, click the Apply Now button at the top or bottom of this page.After you click Apply Now and complete your application, you'll be invited to crea...Show more
Last updated: 4 days ago • Promoted
(SRE) Site Reliability Engineer / GCP

(SRE) Site Reliability Engineer / GCP

TekCommands Inc • Atlanta, GA, United States
Full-time +2
Quick Apply
SRE) Site Reliability Engineer / GCP Location: Atlanta, GA Duration: 6 Months w/possible of extension Key Skills Strong hand...Show more
Last updated: 3 days ago
Site Manager

Site Manager

Zimmerman Associates, Inc. • Atlanta, GA, US
Full-time
The Site Manager is responsible for overseeing contractor operations supporting ATF Crime Gun Intelligence (CGI) Field Offices, Firearms Operations Divisions (FOD), NIBIN Branches, Crime Gun Intell...Show more
Last updated: 10 days ago • Promoted
GENERAL LABORER

GENERAL LABORER

Personnel Options Inc • Griffin, GA, US
Full-time
Job Title: GENERAL LABORER Location: GRIFFIN, GA ABOUT US For years, Personnel Options has been a leading Human Resource Management firm in the Georgia area.If you are a Human Resource Manager or i...Show more
Last updated: 11 days ago • Promoted
Sr. Controls Engineer

Sr. Controls Engineer

Lennox International Inc • Stone Mountain, GA, United States
Full-time
Dedicated to sustainability and creating comfortable and healthier environments for our residential and commercial customers while reducing their carbon footprint, we lead the field in innovation w...Show more
Last updated: 4 days ago • Promoted
Sr Analytics Implementation Engineer (Web and Mobile) - Remote

Sr Analytics Implementation Engineer (Web and Mobile) - Remote

DivIHN Integration Inc • Atlanta, GA, US
Remote
Permanent
DivIHN (pronounced “divine”) is a CMMI ML3-certified Technology and Talent solutions firm.Driven by a unique Purpose, Culture, and Value Delivery Model, we enable meaningful connections between tal...Show more
Last updated: 1 day ago • Promoted
Site Superintendent

Site Superintendent

Brightview Landscapes Careers • Tucker, GA, United States
Full-time
The Best Teams are Created and Maintained Here.At BrightView, the best teams are created and maintained here.If you are searching for your next fulfilling career, picture yourself on a best-in-clas...Show more
Last updated: 4 days ago • Promoted
Site Superintendent

Site Superintendent

brightview.com • Tucker, GA, United States
Full-time
The Best Teams are Created and Maintained Here.At BrightView, the best teams are created and maintained here.If you are searching for your next fulfilling career, picture yourself on a best-in-clas...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Bright Vision Technologies • Atlanta, GA, US
Full-time
Quick Apply
Site Reliability Engineer (SRE) Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize t...Show more
Last updated: 27 days ago
Site Manager

Site Manager

Dematic • Atlanta, GA, United States
Full-time
The Site Manager is Responsible for the successful execution of onsite activities starting with Pre-Mobilization and finishing with Project Demobilization.Site Manager is a leader of Project onsite...Show more
Last updated: 4 days ago • Promoted
Chief Engineer

Chief Engineer

Driftwood Hospitality Management • Alpharetta, GA, United States
Full-time
Driftwood Hospitality Management's company culture empowers our associates to take initiative, be proactive, and contribute to the success of their property with well-defined strategies and objecti...Show more
Last updated: 4 days ago • Promoted