Talent.com
Sr. Site Reliability Engineer
Sr. Site Reliability EngineerKēSTA I.T. • Culver City, CA, US
Sr. Site Reliability Engineer

Sr. Site Reliability Engineer

KēSTA I.T. • Culver City, CA, US
7 days ago
Job type
  • Full-time
  • Permanent
Job description

Job Description

Job Description

Come build, innovate, disrupt, and thrive!

KēSTA I.T. is actively seeking a Sr. Site Reliability Engineer for an immediate full-time opportunity with our industry leading client.

Are you on the lookout for a unique career opportunity that offers leadership, responsibility, and the chance to make a significant impact? If you're eager to contribute to a thriving and stable organization while maintaining your confidentiality, continue reading.

Overview

A leading technology company in the immersive content space is searching for a Senior Site Reliability Engineer to help scale a global platform that delivers their product. This role is ideal for someone who thrives in fast-paced environments, enjoys automating at scale, and is passionate about building fault-tolerant systems that serve millions of users reliably.

In this position, you’ll design and refine the backbone of our cloud infrastructure — ensuring uptime, observability, and security across multiple tenants and delivery pipelines. You’ll collaborate closely with software and DevOps teams to define reliability goals, establish best practices, and develop the monitoring and automation that keep our systems healthy around the clock.

Key Responsibilities

  • Architect and automate scalable cloud infrastructure leveraging Terraform and modern container technologies such as Kubernetes .
  • Optimize global CDN performance and end-to-end content delivery pipelines to improve streaming quality and latency.
  • Build and maintain observability frameworks that include defining SLIs / SLOs, implementing alerting systems, and ensuring actionable insights through Grafana and Prometheus dashboards.
  • Establish proactive capacity planning and load testing strategies to ensure reliability during rapid growth and high traffic periods.
  • Drive continuous improvement through incident management , root cause analysis , and post-incident reviews that strengthen system resiliency.
  • Participate in on-call rotations and help define escalation workflows that uphold 24 / 7 service availability.
  • Collaborate with engineering teams to embed reliability and security principles into every stage of the deployment lifecycle.
  • Mentor team members on operational readiness, reliability patterns, and scalable system design.
  • Contribute to internal standards for compliance and data protection (SOC 2, GDPR, ISO 27001, open-source licensing).

Qualifications

  • 7+ years of hands-on experience in SRE or DevOps , focused on building reliable, distributed systems at scale.
  • Deep technical knowledge of AWS , CoreWeave , or other major cloud environments, with strong experience in container orchestration and Terraform-based automation .
  • Proven success managing multi-tenant architectures and applying best practices for data isolation, access control, and system hardening.
  • Skilled in monitoring, metrics, and tracing tools (e.g., Prometheus, Grafana) and experienced in using data-driven insights to enhance system performance.
  • Familiarity with security frameworks and automated auditing for compliance (SOC 2, GDPR, ISO 27001).
  • Strong leadership and mentorship capabilities; able to influence engineering culture and champion best practices around uptime, scalability, and operational excellence.
  • About KēSTA I.T. :

    Our name says it all; KēSTA I.T. (Keys-to-I.T.) AND our people are our keys to our success! KēSTA I.T. is a premier Utah-based technical staffing and consulting services firm. We specialize in temporary and permanent placement of Software, Hardware, Network, Cloud, CRM / ERP, Data, End-User support, Web and Executive / leadership-based positions on a full time and consulting basis. If you're interested in a role where top performance is rewarded, personal time is valued, and excellence is demanded at every level we want to talk to you today!

    Where do you want to go? We've got the keys! ~ KēSTA I.T.

    WWW.KeSTAIT.COM

    Create a job alert for this search

    Site Reliability Engineer • Culver City, CA, US

    Related jobs
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    VirtualVocations • Fullerton, California, United States
    Full-time
    A company is looking for a Senior Site Reliability Engineer.Key Responsibilities Maintain scalable, secure, and reliable cloud services to ensure system operations within Service Level Objectives...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocations • Huntington Beach, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer to join a Cloud Services team in a remote role.Key Responsibilities Serve as a cloud SME for clients, providing expertise in design, architect...Show more
    Last updated: 30+ days ago • Promoted
    Systems Engineer (Reliability, Maintainability & Availability – RMA)

    Systems Engineer (Reliability, Maintainability & Availability – RMA)

    G2 Ops, Inc. • El Segundo, CA, US
    Full-time
    Quick Apply
    El Segundo, CA at our customer site Work Setting : In person, some remote opportunity, and / or flexible working hours, not a fully remote position Salary Range : $105,000 – 160,000 plus com...Show more
    Last updated: 30+ days ago
    Sr. Project Engineer

    Sr. Project Engineer

    Granite Construction • USA, California, La Mirada
    Full-time
    Building a career at Granite may be the most valuable thing you could do.Find your dream job today, and be part of something great. Our most powerful partnership is the one we have with our employee...Show more
    Last updated: 30+ days ago
    Sr. Operations Engineer - Space Control

    Sr. Operations Engineer - Space Control

    Solutions Through Innovative Technologies, Inc • El Segundo, CA, US
    Temporary
    Solutions Through Innovative Technologies, Inc.STI-TEC) specializes in the delivery of professional business and information management services. STI-TEC offers government and commercial clients a c...Show more
    Last updated: 30+ days ago • Promoted
    Sr Software Engineer - Full Stack

    Sr Software Engineer - Full Stack

    The Walt Disney Studios • Burbank, CA, US
    Full-time
    You must be in the local area or open to relocation.Do you remember seeing ads for.Disney+ in the past few weeks? If you did, it was likely because of the detailed planning and execution of The Wal...Show more
    Last updated: 6 days ago • Promoted
    Site Reliability Manager

    Site Reliability Manager

    VirtualVocations • Huntington Beach, California, United States
    Full-time
    A company is looking for a Manager, SRE to lead engineering teams in building a reliable and secure identity platform.Key Responsibilities Lead and manage teams responsible for cloud infrastructu...Show more
    Last updated: 3 days ago • Promoted
    Regional Reliability Engineer

    Regional Reliability Engineer

    Catalyst Recruiting Inc. • Los Angeles, CA, US
    Full-time
    A large company is looking for a reliability engineer to lead reliability and asset engineering and management efforts and projects at multiple sites for a chemical / materials company.Mechanical or ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Director of SRE

    Senior Director of SRE

    VirtualVocations • Van Nuys, California, United States
    Full-time
    A company is looking for a Senior Director of Site Reliability Engineering & Software Engineering.Key Responsibilities Lead and develop the SRE team, optimizing practices and enhancing performanc...Show more
    Last updated: 2 days ago • Promoted
    Principal Reliability Engineer I

    Principal Reliability Engineer I

    CesiumAstro • El Segundo, CA, US
    Full-time +1
    To conform with the United States Government Space Technology Export Regulations, the applicant must be a U.UAVs, launch vehicles, and other space and airborne platforms. We take pride in our dynami...Show more
    Last updated: 16 days ago • Promoted
    Sr. Liquid System Responsible Engineer IV - Lunar Permanence

    Sr. Liquid System Responsible Engineer IV - Lunar Permanence

    Blue Origin • Los Angeles, CA, United States
    Permanent
    Applications will be accepted on an ongoing basis until the requisition is closed.At Blue Origin, we envision millions of people living and working in space for the benefit of Earth.We're working t...Show more
    Last updated: 16 days ago • Promoted
    Staff Systems Reliability Engineer

    Staff Systems Reliability Engineer

    VirtualVocations • Signal Hill, California, United States
    Full-time
    A company is looking for a Staff Systems Reliability Engineer.Key Responsibilities Design and implement scalable, fault-tolerant AWS-based infrastructure Develop and maintain CI / CD pipelines and...Show more
    Last updated: 4 days ago • Promoted
    Sr. Main Engine Propulsion Engineer - Lunar Permanence

    Sr. Main Engine Propulsion Engineer - Lunar Permanence

    Blue Origin • Los Angeles, CA, United States
    Permanent
    Applications will be accepted on an ongoing basis until the requisition is closed.At Blue Origin, we envision millions of people living and working in space for the benefit of Earth.We're working t...Show more
    Last updated: 30+ days ago • Promoted
    Sales Account Executive - Remote

    Sales Account Executive - Remote

    Riverside Payments • Altadena, California, US
    Remote
    Full-time
    Job Summary Riverside Payments is one of the largest and fastest growing merchant services companies in the nation.We give businesses the ability to accept debit and credit cards as a form of payme...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    VirtualVocations • Signal Hill, California, United States
    Full-time
    A company is looking for a Site Reliability Engineering Manager to lead their Site Reliability Engineering team.Key Responsibilities Lead and mentor a team of SREs, promoting growth and collabora...Show more
    Last updated: 30+ days ago • Promoted
    Reliability Engineer, Robotics

    Reliability Engineer, Robotics

    Galactic Resource Advancement Mechanism Technologies Corporation • El Segundo, CA, US
    Full-time
    At GRAM, we believe the next great leap for humanity will be physical, not digital.What AGI is to bits, Self-Replication (SR) is to atoms. As an SR research and deployment company, our mission is to...Show more
    Last updated: 30+ days ago • Promoted
    South Carolina Licensed System Engineer III

    South Carolina Licensed System Engineer III

    VirtualVocations • Fullerton, California, United States
    Full-time
    A company is looking for a System Engineer III specializing in Identity and Access Management (IAM).Key Responsibilities Evaluate and resolve complex technical issues related to identity and acce...Show more
    Last updated: 4 days ago • Promoted
    10802 - Sr. Software Engineer, KMNA Development

    10802 - Sr. Software Engineer, KMNA Development

    Hyundai Autoever America • Costa Mesa, CA, US
    Full-time
    This position will be involved in the design, development, maintenance and modification of code for Connected Car Applications partnering with the product owner, scrum master, solution architect, t...Show more
    Last updated: 30+ days ago • Promoted