Talent.com
Site Reliability Engineering
Site Reliability EngineeringTechniPros • Atlanta, Georgia, USA
Site Reliability Engineering

Site Reliability Engineering

TechniPros • Atlanta, Georgia, USA
10 days ago
Job type
  • Full-time
Job description

Job Title : Site Reliability Engineering (SRE) Architect

Location : Atlanta Georgia (Hybrid)

Long Term Contract

Looking for W2 Candidates. No C2C

Job Description : Role Summary :

As an SRE Architect you will be a pivotal technical leader responsible for designing building and evolving the foundational systems and practices that ensure the reliability scalability performance and efficiency of our critical services. Moving beyond day-to-day operations you will focus on the strategic architectural direction of SRE function defining standards blueprints and frameworks that enable development teams and fellow SRE operations team to build and operate highly resilient systems. Leverage deep expertise in software engineering distributed systems cloud infrastructure and SRE principles to influence technology choices establish best practices and foster a proactive culture of reliability across the organization and much beyond observability pillar.

Key Responsibilities :

Reliability Strategy & Design :

  • Architect and design highly available scalable secure and cost-effective infrastructure and application patterns on AWS
  • Define and evangelize SRE best practices standards and blueprints for service design deployment monitoring and operational readiness across the engineering organization
  • Review current observability implementation to identify gaps and define steps to reach next level maturity of observability setup to provide deep insights into system health and behaviour
  • With overall maturity lead the definition and implementation strategy for Service Level Indicators (SLIs) Service Level Objectives (SLOs) and Error Budgets for critical services

Platform Architecture & Automation :

  • Design solutions to systematically reduce operational toil through automation and improved system design
  • Evaluate current SRE tools and automation frameworks (e.g. CI / CD pipelines Infrastructure as Code modules automated incident remediation chaos engineering platforms) and suggest enhancement that will help overall enhancement of capability
  • Evaluate prototype and recommend new technologies tools and methodologies to enhance system reliability developer productivity and operational efficiency
  • Technical Leadership & Consultation :

  • Act as a senior technical advisor and subject matter expert on reliability scalability and performance for development and platform teams
  • Provide architectural guidance during the design phase of new services and features to ensure reliability principles are embedded early (shift-left)
  • Mentor and coach other SREs and engineers fostering technical excellence and adherence to SRE principles
  • Lead architectural reviews and production readiness assessments for critical systems
  • Resilience :

  • Lead blameless postmortems for significant incidents ensuring root causes are identified and systemic architectural improvements are prioritized and implemented
  • Architect and advocate for resilience patterns (e.g. circuit breaking rate limiting graceful degradation chaos engineering) within applications and infrastructure
  • Required Qualifications :

  • Proven experience in an architectural role designing solutions for reliability scalability and performance
  • Deep understanding and practical application of SRE principles (SLIs / SLOs error budgets toil reduction automation incident management postmortems)
  • Expertise in cloud computing platforms (e.g. AWS) including infrastructure networking and security services
  • Strong experience with containerization and orchestration technologies (Kubernetes Docker serverless computing)
  • Solid experience designing and implementing observability solutions (e.g. Dynatrace Prometheus Grafana ELK / EFK Stack Jaeger OpenTelemetry)
  • Strong programming / scripting skills (e.g. Python Go Bash) for automation and tool development
  • Excellent analytical problem-solving and strategic thinking skills.
  • Strong communication collaboration and leadership skills with the ability to influence technical direction across teams
  • Preferred Qualifications :

  • Experience designing and implementing chaos engineering practices and platforms
  • Best Regards : Jahnavi G

    Phone : 1-

    Email : Key Skills

    Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting

    Employment Type : Full Time

    Experience : years

    Vacancy : 1

    Create a job alert for this search

    Site Reliability Engineering • Atlanta, Georgia, USA

    Related jobs
    Senior Project Manager- Land & Site Development

    Senior Project Manager- Land & Site Development

    Rochester | DCCM • Fayetteville, GA, US
    Full-time
    Our Fayetteville, Georgia office is looking for a talented Senior Project Manager to join our team.In this role you will get to manage and plan detailed phases of engineering work for residential p...Show more
    Last updated: 30+ days ago • Promoted
    Lead Site Technician

    Lead Site Technician

    Peregrine Team • Alpharetta, GA, US
    Full-time
    Quick Apply
    Peregrine Team is hiring for a Lead Site Technician in Alpharetta, GA.This position is a full-time, contract to hire role with full benefits and competitive pay. As a Lead Site Technician, you will ...Show more
    Last updated: 30+ days ago
    Senior Systems Reliability Engineer - Urgently Hiring!

    Senior Systems Reliability Engineer - Urgently Hiring!

    ADP • Alpharetta, GA, United States
    Full-time
    Senior Systems Reliability Engineer in our Alpharetta, GA location.Are you empathetic to client needs and inspired by transformation and impacting the lives of millions of people every day?.Are you...Show more
    Last updated: 9 days ago
    Test Engineering Director - Salisbury, NC (ATLANTA)

    Test Engineering Director - Salisbury, NC (ATLANTA)

    JABIL CIRCUIT, INC • ATLANTA, Georgia, US
    Part-time
    AI data center infrastructure customers.Based onsite at our new Salisbury, NC location.We are offering relocation assistance!. Please note : This role will require extensive travel to another Jabil s...Show more
    Last updated: 12 days ago • Promoted
    Site Support Analyst

    Site Support Analyst

    Chubb • Alpharetta, GA, United States
    Full-time
    Chubb's Global End User Services (EUS) organization is relentlessly focused on driving and improving the end user experience globally thru technology programs, deliveries and innovative solutions.W...Show more
    Last updated: 14 days ago • Promoted
    Software Development Engineer

    Software Development Engineer

    Amazon • Fayetteville, GA, USA
    Full-time
    Join Amazon's engineering team and help us build innovative solutions to complex problems.As a Software Development Engineer, you will design, develop, and test software applications and services.W...Show more
    Last updated: 23 days ago • Promoted
    Engineering & Facilities Operations Manager - Flow Center, Hampton GA

    Engineering & Facilities Operations Manager - Flow Center, Hampton GA

    Target • Hampton, GA, US
    Full-time
    Engineering & Facilities Maintenance Operations Manager.Working at Target means helping all families discover the joy of everyday life. We bring that vision to life through our values and culture.Be...Show more
    Last updated: 1 day ago • Promoted
    Spring 2026 Mechanical Engineering Co-op

    Spring 2026 Mechanical Engineering Co-op

    Oglethorpe Power Corporation • Tucker, GA, USA
    Full-time
    Quick Apply
    Co-op is responsible for assisting the Sr.Reliability Engineer in data collection and reporting for thermal performance analysis and identifying equipment reliability issue trends.May assist with p...Show more
    Last updated: 30+ days ago
    Manager Site Reliability Engineering

    Manager Site Reliability Engineering

    RELX • Alpharetta, GA, US
    Full-time
    Are you an experienced site reliability engineering leader ready to shape strategy, inspire teams, and drive innovation at scale? Are you looking to lead a high-impact sre team where your leadershi...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CD Newco LLC d / b / a Curve Dental • Alpharetta, Georgia, United States, 30009
    Full-time
    At Flex Dental, we go beyond checking boxes; our integration and automation are unparalleled.Every feature serves a purpose, creating seamless collaboration with Open Dental's practice management s...Show more
    Last updated: 30+ days ago
    Senior Systems Reliability Engineer

    Senior Systems Reliability Engineer

    ADP • Alpharetta, GA, United States
    Full-time
    Senior Systems Reliability Engineer in our Alpharetta, GA location.Are you empathetic to client needs and inspired by transformation and impacting the lives of millions of people every day?.Are you...Show more
    Last updated: 9 days ago
    Reliability Director - Total Productive Maintenance

    Reliability Director - Total Productive Maintenance

    MCC • Atlanta, GA, US
    Full-time
    Maintenance and Reliability Director.Build Your Career with an Industry Leader.As the global leader of premium labels, Multi-Color Corporation (MCC) helps brands stand out in competitive markets an...Show more
    Last updated: 6 days ago • Promoted
    Lead Engineer

    Lead Engineer

    Chesapeake Utilities Corporation • Norcross, GA, United States
    Full-time
    Remote Within Service Territory -.DE, PA, OH, GA, NC, VA, MD or FL).The Lead Engineer plays a pivotal role in training and process improvement, developing and leading training programs for the Engi...Show more
    Last updated: 30+ days ago • Promoted
    Site Superintendent

    Site Superintendent

    Georgia Staffing • Tucker, GA, US
    Full-time
    Site Superintendent Opportunity.The best teams are created and maintained here.At BrightView, if you are searching for your next fulfilling career, picture yourself on a best-in-class team where yo...Show more
    Last updated: 30+ days ago • Promoted
    Test Engineering Director - Salisbury, NC - Relocation Assistance Available (ATLANTA)

    Test Engineering Director - Salisbury, NC - Relocation Assistance Available (ATLANTA)

    JABIL CIRCUIT, INC • ATLANTA, Georgia, US
    Part-time
    AI data center infrastructure customers.Based onsite at our new Salisbury, NC location.We are offering relocation assistance!. Please note : This role will require extensive travel to another Jabil s...Show more
    Last updated: 2 days ago • Promoted
    Systems Engineer

    Systems Engineer

    Delta Dental of California • Alpharetta, GA, United States
    Full-time
    EMPLOYER : Delta Dental Insurance Company.Location : 1130 Sanctuary Pkwy, Alpharetta, GA 30009; Must live within reasonable distance from HQ and appear in office as required.Monitor and work ticket q...Show more
    Last updated: 29 days ago • Promoted
    Software Engineering / Applications Leadership Advisory - Executive Technology Services for Global Enterprises

    Software Engineering / Applications Leadership Advisory - Executive Technology Services for Global Enterprises

    Gartner • Sandy Springs, GA, United States
    Full-time
    Enterprise IT Leaders - Global Enterprise - Software Engineering / Applications Advisory.Enterprise IT Leaders (EITL) is an executive-level advisory service that delivers expert insight and guidance ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Systems Reliability Engineer - Now Hiring!

    Senior Systems Reliability Engineer - Now Hiring!

    ADP • Alpharetta, GA, United States
    Full-time
    Senior Systems Reliability Engineer in our Alpharetta, GA location.Are you empathetic to client needs and inspired by transformation and impacting the lives of millions of people every day?.Are you...Show more
    Last updated: 9 days ago