Lead SREOpenkyber • GA, United States

Lead SRE

Openkyber • GA, United States

10 hours ago

Job type

Full-time

Quick Apply

Job description

Job Summary We are seeking an experienced Site Reliability Engineer (SRE) to join the Applied AI and Data Science program. This role focuses on deploying, monitoring, and optimizing cloud-based applications and infrastructure to ensure high availability and performance. The ideal candidate will have strong expertise in AWS, containerized microservices, infrastructure automation, and monitoring tools.

Key Responsibilities :

Release Management : Build and deploy application, service, and infrastructure releases; validate system integrity post-deployment; document release notes.
Production Support : Maintain 99.999% availability of critical systems; monitor infrastructure and applications; perform root cause analysis for outages; respond to incidents.
Monitoring & Alerting : Implement monitoring policies; build dashboards; track system efficiency and resource consumption; alert stakeholders for SLA deviations.
Optimization : Manage resource scaling; optimize system performance and resource utilization.
Team Collaboration : Assist with user support; coordinate with onshore / offshore teams; develop bug fixes; become an expert in system architecture and deployment pipelines.

Required Qualifications :

6+ years of DevOps or SRE experience in large, complex environments.

Strong background in software development (OOP) and ability to read / debug code.

Expertise in AWS services (EKS, S3, DocumentDB) and Terraform for Infrastructure as Code.

Experience with Kubernetes, containerized microservices, and cloud deployments.

Proficiency with GitLab or similar CI / CD tools for pipeline management.

Hands-on experience with monitoring tools such as Datadog or Splunk.

Bachelors degree in a related field or equivalent experience.

Preferred Qualifications :

Familiarity with Python, Node.js, React, TypeScript, and GraphQL.

Exposure to relational (SQL) and NoSQL databases.

Experience with Docker, Redis, and ORM frameworks.

Knowledge of experimentation, statistical testing, and data analysis.

Masters degree in a related field is a plus.

Create a job alert for this search

Sre • GA, United States

Related jobs

Manager, Learning & Development

Georgia Staffing • Nicholls, GA, US

Full-time

Manager, Learning And Development.At CoreCivic, our employees are driven by a deep sense of service, high standards of professionalism and a responsibility to better the public good.We are currentl...Show more

Last updated: 1 day ago • Promoted

Plant People & Culture Specialist

Southwire Company, LLC • Douglas, GA, United States

Full-time

A leader in technology and innovation, Southwire Company, LLC is one of North America's largest wire and cable producers. Southwire and its subsidiaries manufacture building wire and cable, utility ...Show more

Last updated: 12 days ago • Promoted

SRE Architect

Openkyber • GA, United States

Full-time

Quick Apply

RESPONSIBILITIES : Kforce has a client that is seeking a Remote Senior Site Reliability Engineer (SRE) to join a dynamic, collaborative team supporting enterprise-level experiment...Show more

Last updated: 1 day ago

Sr. Director - Infrastructure Mergers and Acquisitions

McKesson Corporation • GA, United States

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare.We are known for delivering insights, products, and services that make quality care more accessibl...Show more

Last updated: 3 days ago • Promoted

SAP Architect

Openkyber • GA, United States

Full-time

Quick Apply

Job Title : SAP Architect Location : Onsite, Normal, IllinoisShow more

Last updated: 1 day ago

Team Lead de Landscaping

The Grounds Guys • EASTMAN, GA, US

Aquí está el trato : somos increíbles cuidando a nuestros clientes y brindando un servicio excepcional de cuidado del césped y paisajismo residencial y comercial. Pero eso solo sucede porque tenemos ...Show more

Last updated: 30+ days ago

Sr. QA Lead - Specialized

Unicorn Technologies LLC • GA, United States

Full-time

Quick Apply

QA Lead - Specialized Location : Atlanta, GA (onsite) Qualifications : • •Verify P...Show more

Last updated: 8 days ago

ML Deployment Specialist

Openkyber • GA, United States

Temporary

Quick Apply

Looking for an AI Engineer for a 7 month contract with possible long term extension.AI Engineer : The AI Engineer will play a pivotal role in designing, developing, and deploying ...Show more

Last updated: 16 hours ago • New!

Store Shift Lead

Murphy USA • Eastman, GA, US

Full-time +1

As one of the largest national gasoline and convenience retailers with more than 1,700 stores in 27 states, we know that without our committed team, we are simply another retailer.Ready to be empow...Show more

Last updated: 8 hours ago • Promoted • New!

SOC Lead / Team Lead

Openkyber • GA, United States

Full-time

Quick Apply

Job Title : SOC Lead / Team Lead Location : Alpharetta, GA (Onsite In-Person Interview Required)Show more

Last updated: 30+ days ago

Foreperson - NON-UNION

Asplundh Tree Expert • Eastman, GA, US

Full-time

This position ensures the productivity of daily operations, working closely with management to determine recruiting / hiring needs, deadlines, and safety protocols to enforce among the crew.The Forep...Show more

Last updated: 30+ days ago

AI Delivery Leader

Celersoft • Georgia, USA

Full-time

Essential Duties and Responsibilities : We are seeking a highly motivated and experienced Delivery Leader to oversee the execution of strategic Analytics and ...Show more

Last updated: 22 days ago

Team Lead de Mantenimiento

The Grounds Guys • EASTMAN, GA, US

Last updated: 30+ days ago

Regional Sales Director

Vyve Broadband • Douglas, GA, US

Full-time

Quick Apply

Vyve is a leading broadband Internet provider serving largely non-urban communities in 16 states.A technology leader in the cable and broadband sectors, Vyve Broadband offers an extensive range of ...Show more

Last updated: 30+ days ago