Talent.com
Site Reliability Engineer

Site Reliability Engineer

CanonicalWashington, DC, United States
4 days ago
Job type
  • Full-time
Job description

Overview

Canonical is a leading provider of open source software and operating systems. Our platform, Ubuntu, is used across enterprise initiatives in public cloud, data science, AI, engineering innovation, and IoT. Canonical has 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person in various locations around the world. The company is founder-led, profitable, and growing.

We are hiring a Site Reliability Engineer to enhance enterprise infrastructure DevOps practices with a model-driven approach, whether on-premise or on public clouds.

We run hundreds of private cloud environments, Kubernetes clusters, and applications for customers across physical and public cloud estates. The role focuses on identifying and addressing incidents, monitoring and observing applications, anticipating issues, and enabling product refinement to achieve high-quality standards in our open source portfolio.

Location : Globally remote role.

Responsibilities

  • Deploy and run OpenStack, Kubernetes, storage solutions, and open source applications using DevOps practices.
  • Identify and address incidents, monitor and observe applications, and enable product refinement to achieve high-quality standards in our open source portfolio.
  • Work across the full stack—from bare-metal networking and kernel to Kubernetes and open source applications.
  • Collaborate on automation as a software engineering problem, driven by metrics and code, to operate at scale.

What we are looking for

  • Software engineer fluent in Python with genuine interest in the full open source infrastructure stack from bare metal to containers.
  • Strong background in Linux, Python, networking, and knowledge of how clouds work.
  • Ability to work in operations with mission-critical services for global brand-name customers.
  • Degree in software engineering or computer science.
  • Operational experience in Linux environments.
  • Experience with Kubernetes deployment or operations.
  • Excellent interpersonal skills, curiosity, flexibility, and accountability.
  • Ability to travel internationally twice a year, for company events up to two weeks long.
  • Bonus skills

  • Familiarity with OpenStack deployment or operations.
  • Familiarity with public cloud deployment or operations.
  • Familiarity with private cloud management.
  • What we offer

    Compensation is based on geographical location, experience, and performance. We adjust compensation every 6 months and offer annual bonuses in addition to base pay. We provide a range of benefits aligned to local needs and global fairness.

  • Distributed work environment with twice-yearly team sprints in person
  • Personal learning and development budget of USD 2,000 per year
  • Every 6 months compensation review
  • Recognition rewards
  • Annual holiday leave
  • Maternity and paternity leave
  • Employee Assistance Programs
  • Opportunity to travel to new locations to meet colleagues
  • Priority Pass and travel upgrades for long-haul company events
  • About Canonical

    Canonical is a pioneering tech firm at the forefront of the move to open source. As the publisher of Ubuntu, we recruit globally and hold high standards for new hires. Most colleagues have worked from home since 2004. Working at Canonical challenges you to think differently, work smarter, learn new skills, and raise your game.

    Equal opportunity

    Canonical is an equal opportunity employer. We foster a workplace free from discrimination. We value diversity of experience, perspectives, and background, and we will give every applicant fair consideration.

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • Washington, DC, United States

    Related jobs
    • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    VisaAshburn, VA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Leidos IncReston, VA, United States
    Full-time
    The Multi Domain Solutions Division at Leidos is looking for a.This role involves supporting the delivery of comprehensive IT and support services to ensure mission success while adhering to DoD st...Show moreLast updated: 16 days ago
    • Promoted
    Cloud Site Reliability Engineer (SRE) (Azure / AWS)

    Cloud Site Reliability Engineer (SRE) (Azure / AWS)

    Leidos IncAlexandria, VA, United States
    Full-time
    Join us in transforming how technology serves those who serve.At Leidos, we're not just delivering solutions - we're pioneering the future of defense and intelligence technology.Our diverse teams o...Show moreLast updated: 13 days ago
    • Promoted
    Sr. Manager - Site Reliability Engineer

    Sr. Manager - Site Reliability Engineer

    VisaAshburn, VA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 30+ days ago
    • Promoted
    Site Implementation Engineer

    Site Implementation Engineer

    Leidos IncReston, VA, United States
    Full-time
    Leidos Digital Modernization Sector is looking for a Site Implementation Engineer to work on the Army Global Unified Network (AGUN) - Increment 1 (INC1) program. The Global Enterprise Network Modern...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    VerisignReston, VA, United States
    Full-time
    Verisign helps enable the security, stability, and resiliency of the internet.We are a trusted provider of internet infrastructure services for the networked world and deliver unmatched performance...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Tax AnalystsFalls Church, VA, US
    Full-time
    Quick Apply
    Tax Analysts is seeking a Site Reliability Engineer (SRE) to help establish and shape our reliability engineering practice from the ground up. This is a unique opportunity to join a mission-driven o...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Federated ITWashington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer (SRE) – TS / SCI Clearance

    Site Reliability Engineer (SRE) – TS / SCI Clearance

    Tech CraticWashington, DC, United States
    Full-time
    Site Reliability Engineer (SRE) – TS / SCI Clearance.Technology has revolutionized how we approach job hunting, and this book streamlines the process into a fast, efficient system that works.Instead ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CSCI ConsultingQuantico, VA, United States
    Full-time
    CSCI Consulting is looking for a.Site Reliability Engineer (SRE).This role combines deep systems engineering knowledge with DevOps automation, proactive monitoring, and incident response practices....Show moreLast updated: 30+ days ago
    • Promoted
    Senior Reliability Engineer

    Senior Reliability Engineer

    The Johns Hopkins University Applied Physics LaboratoryLaurel, MD, United States
    Full-time
    Are you passionate about applying reliability and system engineering principles to analyze and assess the resilience of future strategic weapon systems?. Do you have a strong technical background in...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Powder River IndustriesWashington, DC, United States
    Full-time
    Conduct analysis of alternatives for configuration tools, make recommendations, work with team to design, develop, test, implement, and maintain tool choice. Responsible for the administration, moni...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    EngFlowWashington, DC, United States
    Full-time
    Join to apply for the Site Reliability Engineer role at EngFlow.At EngFlow, we help developers save time by accelerating software builds and tests. Our cloud-based, distributed service optimizes dev...Show moreLast updated: 4 days ago
    • Promoted
    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    JobgetherWashington, DC, United States
    Full-time
    Principal Site Reliability Engineer (SRE) job at Jobgether.This position is posted by Jobgether on behalf of.We are currently looking for a. Principal Site Reliability Engineer (SRE).Join a high-imp...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapeWashington, DC, United States
    Full-time
    Cape was founded in early 2022 by Palantir and Anduril alums with deep expertise in privacy and national security.While running Palantir’s US national security business, our CEO became passionate a...Show moreLast updated: 4 days ago
    • Promoted
    • New!
    Site Reliability Engineer — Scale mission-critical platforms

    Site Reliability Engineer — Scale mission-critical platforms

    Anduril IndustriesWashington, DC, United States
    Full-time
    A defense technology company is seeking a Site Reliability Engineer in Washington, DC.The role involves solving challenges in networking and systems integration while working with cross-functional ...Show moreLast updated: 10 hours ago
    • Promoted
    Gitlab Site Reliability Engineer

    Gitlab Site Reliability Engineer

    2Prod Technologies Corp.Washington, DC, United States
    Full-time
    Site Reliability Engineer (SRE) with strong GitLab expertise to support and enhance enterprise platforms.This role will focus primarily on GitLab while also maintaining Jira and Confluence in a sec...Show moreLast updated: 4 days ago
    • Promoted
    Staff Site Reliability Engineer (Federal)

    Staff Site Reliability Engineer (Federal)

    OktaWashington, DC, United States
    Full-time
    Okta is The World's Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secur...Show moreLast updated: 30+ days ago