Talent.com
Site Reliability Engineer

Site Reliability Engineer

Red HatBoston, MA, United States
4 days ago
Job type
  • Full-time
  • Permanent
Job description

Join to apply for the Site Reliability Engineer role at Red Hat

Red Hat is looking for a Platform Engineer to join its Platform Engineering team! In this role, you will help architect, implement, improve, and support the OpenShift-based platform that runs many of Red Hat’s most important multi-tenant Software-as-a-Service (SaaS) and Managed-service offerings. Using your expertise in SRE principles, you will help create an environment where reliability, scalability, and security come first, and are not treated as an afterthought.

In this role, you will spend a portion of your time working across teams to define and iterate upon processes for onboarding new managed services at Red Hat and demonstrate good judgment in employing onboarding methods and techniques that can be repeated and iterated upon. You will also contribute to the codebase of command-and-control software that automates the building, deployment, monitoring, and alerting of Red Hat managed services. The remainder will be spent on various other tasks, such as diagnosing issues, planning, documenting, and mentoring.

What You Will Do

  • Design, write, and maintain software (primarily in Python and Golang) that automates the deployment, monitoring, and maintenance of Red Hat managed services.

Onboarding of new services onto our OpenShift-based platform

  • Adhering to cloud-native design principles & best practices to ensure reliability, scalability, and security
  • Contribute to documents, like standard operating procedures (SOPs) and playbooks, that assist in issue resolution and new-service onboarding.
  • Proactively utilize AI-assisted development tools (e.g., GitHub Copilot, Cursor, Claude Code) for code generation, auto‑completion, and intelligent suggestions to accelerate development cycles and enhance code quality.
  • Participate in an Agile Scrum team that scopes, prioritizes, and allocates work items.
  • Participate in an on‑call rotation that is responsible for responding to service incidents.
  • What You Will Bring

  • Background writing object‑oriented automation software in Python, experience with Golang is only plus
  • Background administering production cloud‑native services, preferably containerized and deployed via a container‑orchestration system like Kubernetes or OpenShift
  • Experience diagnosing service failures and carrying out incident response procedures
  • Familiarity with Linux operating system and its configuration
  • Ability to effectively work in a globally distributed team
  • Understanding of computer networking and protocols, including TCP / IP and DNS
  • Understanding of computer security and cryptography basics, including certificates, TLS, and credential‑storage systems like Vault is a plus
  • Familiarity with CI / CD pipeline concepts and systems, like Jenkins and Tekton / Argo is a plus
  • Familiarity with observability tools like Prometheus and Grafana, and how to define metrics that can be used to measure service health and reliability is a plus
  • Salary range for this position is $94,550.00 - $151,170.00. Actual offer will be based on your qualifications.

    Pay Transparency

    Red Hat determines compensation based on several factors including but not limited to job location, experience, applicable skills and training, external market value, and internal pay equity. Annual salary is one component of Red Hat’s compensation package. This position may also be eligible for bonus, commission, and / or equity. For positions with Remote-US locations, the actual salary range for the position may differ based on location but will be commensurate with job duties and relevant work experience.

    About Red Hat

    Red Hat is the world’s leading provider of enterprise open source software solutions, using a community‑powered approach to deliver high‑performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in‑office, to office‑flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We’re a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.

    Benefits

  • Comprehensive medical, dental, and vision coverage
  • Flexible Spending Account - healthcare and dependent care
  • Health Savings Account - high deductible medical plan
  • Retirement 401(k) with employer match
  • Paid time off and holidays
  • Paid parental leave plans for all new parents
  • Leave benefits including disability, paid family medical leave, and paid military leave
  • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more!
  • Note

    These benefits are only applicable to full time, permanent associates at Red Hat located in the United States.

    Inclusion at Red Hat

    Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from different backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions that compose our global village.

    Equal Opportunity Policy (EEO)

    Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.

    Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

    Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email application-assistance@redhat.com. General inquiries, such as those regarding the status of a job application, will not receive a reply.

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • Boston, MA, United States

    Related jobs
    • Promoted
    • New!
    Principal Reliability Engineer

    Principal Reliability Engineer

    RaytheonTewksbury, Massachusetts, US
    Full-time
    Date Posted : 2025-10-27 Country : United States of America Location : MA134 : Innovation Dr Tewks Bdg 400 836 North Street Building 400, Tewksbury, MA, 01876 USA Position Role Type : Onsite U.Person, o...Show moreLast updated: 11 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    DevOps projectsBoston, MA, United States
    Full-time
    Cimulate is an AI-native eCommerce search and discovery platform built on cutting‑edge LLM technology.We help commerce brands deliver radically better shopping experiences—faster, more relevant, an...Show moreLast updated: 11 hours ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    National Society for Black EngineersBoston, MA, United States
    Full-time
    Join Axon and be a Force for Good.At Axon, we’re on a mission to Protect Life.We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud sof...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    LogRocketBoston, MA, United States
    Full-time
    Site Reliability Engineer (SRE) - Platform Infrastructure team (100% Remote - USA).Founded in 2016, LogRocket's goal is to make every experience on the web as perfect as possible.We solve a huge ch...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WabbisoftBoston, MA, United States
    Full-time
    Boston, MA or Remote / / Full-time Position.Are you interested in helping companies transform the way they think about security as part of their software development pipeline? If “Yes!,” then keep re...Show moreLast updated: 30+ days ago
    • Promoted
    OpenShift / Kubernetes Site Reliability Engineer

    OpenShift / Kubernetes Site Reliability Engineer

    Ford Motor CompanyBoston, MA, United States
    Full-time
    This Kubernetes Site Reliability Engineer position will design and provision infrastructure supporting cloud native applications alongside a geographically distributed team.Emphasis will be on cont...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CimulateBoston, MA, United States
    Full-time
    In this pivotal role, you’ll own the reliability, availability, and performance of our SaaS production environment—monitoring critical systems, managing deployments, and ensuring seamless operation...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Site Reliability Engineer - Observability

    Staff Site Reliability Engineer - Observability

    Hispanic Alliance for Career EnhancementBoston, MA, United States
    Full-time
    At CVS Health, we’re building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care.As the nation’s leading h...Show moreLast updated: 29 days ago
    • Promoted
    Lead Site Reliability Engineer (SRE)

    Lead Site Reliability Engineer (SRE)

    EPAM SystemsBoston, MA, United States
    Full-time
    At EPAM, we’re not just building software — we’re engineering excellence.Lead Site Reliability Engineer (SRE).This role is ideal for someone who thrives in fast-paced financial systems, has a passi...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    LogRocket, IncBoston, MA, United States
    Full-time
    LogRocket is an equal opportunity employer.We celebrate diversity and are committed to creating an inclusive environment for all employees. LogRocket will consider sponsoring visas for applicants in...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Red Hat, Inc.Boston, MA, United States
    Full-time +1
    Red Hat is looking for a Platform Engineer to join its Platform Engineering team! In this role, you will help architect, implement, improve, and support the OpenShift-based platform that runs many ...Show moreLast updated: 6 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Cimulate AIBoston, MA, United States
    Full-time
    In this pivotal role, you will own the reliability, availability, and performance of our SaaS production environment—monitoring critical systems, managing deployments, and ensuring seamless operati...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    SS&C TechnologiesBoston, MA, United States
    Full-time
    Join the Site Reliability Engineer (SRE) role at SS&C Technologies.SS&C Technologies is a leading financial services and healthcare technology company headquartered in Windsor, Connecticut with ove...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    AxonBoston, MA, United States
    Full-time
    Site Reliability Engineer II role at Axon.At Axon, we’re on a mission to Protect Life.We work to solve critical safety and justice issues with our ecosystem of devices and cloud software, emphasizi...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalBoston, MA, United States
    Full-time
    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Digital RealtyBoston, MA, United States
    Full-time
    Position Title : Site Reliability Engineer, Interconnection Service and Network Delivery.Location : Hybrid : Austin, Dallas, Boston, Ashburn, Atlanta, London, or Amsterdam. In this role, you will be re...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer III - AWM

    Site Reliability Engineer III - AWM

    JPMorgan Chase & Co.Boston, MA, United States
    Full-time
    We have an exciting and rewarding opportunity for you to take your software engineering career to the next level.As a Software Engineer III at JPMorganChase within the Asset and Wealth Management A...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    startusBoston, MA, United States
    Full-time
    As a member of a small cross functional squad, you’ll own a particular infrastructure challenge at Spotify.Design and document systems, including writing and reviewing code, to automate away proble...Show moreLast updated: 30+ days ago