Talent.com
Site Reliability Engineer
Site Reliability EngineerCanonical • Washington, DC, United States
Site Reliability Engineer

Site Reliability Engineer

Canonical • Washington, DC, United States
13 days ago
Job type
  • Full-time
Job description

Overview

Canonical is a leading provider of open source software and operating systems. Our platform, Ubuntu, is used across enterprise initiatives in public cloud, data science, AI, engineering innovation, and IoT. Canonical has 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person in various locations around the world. The company is founder-led, profitable, and growing.

We are hiring a Site Reliability Engineer to enhance enterprise infrastructure DevOps practices with a model-driven approach, whether on-premise or on public clouds.

We run hundreds of private cloud environments, Kubernetes clusters, and applications for customers across physical and public cloud estates. The role focuses on identifying and addressing incidents, monitoring and observing applications, anticipating issues, and enabling product refinement to achieve high-quality standards in our open source portfolio.

Location : Globally remote role.

Responsibilities

  • Deploy and run OpenStack, Kubernetes, storage solutions, and open source applications using DevOps practices.
  • Identify and address incidents, monitor and observe applications, and enable product refinement to achieve high-quality standards in our open source portfolio.
  • Work across the full stack—from bare-metal networking and kernel to Kubernetes and open source applications.
  • Collaborate on automation as a software engineering problem, driven by metrics and code, to operate at scale.

What we are looking for

  • Software engineer fluent in Python with genuine interest in the full open source infrastructure stack from bare metal to containers.
  • Strong background in Linux, Python, networking, and knowledge of how clouds work.
  • Ability to work in operations with mission-critical services for global brand-name customers.
  • Degree in software engineering or computer science.
  • Operational experience in Linux environments.
  • Experience with Kubernetes deployment or operations.
  • Excellent interpersonal skills, curiosity, flexibility, and accountability.
  • Ability to travel internationally twice a year, for company events up to two weeks long.
  • Bonus skills

  • Familiarity with OpenStack deployment or operations.
  • Familiarity with public cloud deployment or operations.
  • Familiarity with private cloud management.
  • What we offer

    Compensation is based on geographical location, experience, and performance. We adjust compensation every 6 months and offer annual bonuses in addition to base pay. We provide a range of benefits aligned to local needs and global fairness.

  • Distributed work environment with twice-yearly team sprints in person
  • Personal learning and development budget of USD 2,000 per year
  • Every 6 months compensation review
  • Recognition rewards
  • Annual holiday leave
  • Maternity and paternity leave
  • Employee Assistance Programs
  • Opportunity to travel to new locations to meet colleagues
  • Priority Pass and travel upgrades for long-haul company events
  • About Canonical

    Canonical is a pioneering tech firm at the forefront of the move to open source. As the publisher of Ubuntu, we recruit globally and hold high standards for new hires. Most colleagues have worked from home since 2004. Working at Canonical challenges you to think differently, work smarter, learn new skills, and raise your game.

    Equal opportunity

    Canonical is an equal opportunity employer. We foster a workplace free from discrimination. We value diversity of experience, perspectives, and background, and we will give every applicant fair consideration.

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • Washington, DC, United States

    Related jobs
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Federated IT • Washington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show more
    Last updated: 13 days ago • Promoted
    Reliability Engineer

    Reliability Engineer

    NewGen Technologies • Washington, DC, United States
    Full-time
    Evaluate and analyze products, components, materials, and equipment for the purpose of understanding and predicting failures. Review product designs, material specifications, and manufacturing capab...Show more
    Last updated: 13 days ago • Promoted
    Senior Site Reliability Engineer - Network Operations (Remote)

    Senior Site Reliability Engineer - Network Operations (Remote)

    Fastly • Bethesda, MD, United States
    Remote
    Full-time
    Senior Site Reliability Engineer - Network Operations (Remote).SRE DevOps Automation Networking BGP.Fastly is seeking a Senior Site Reliability Engineer (Networking) to join our Technical Operation...Show more
    Last updated: 30+ days ago • Promoted
    Manager, Site Reliability Engineer (Global Payment Network)

    Manager, Site Reliability Engineer (Global Payment Network)

    Capital One • WASHINGTON D.C., District of Columbia, United States
    Full-time +1
    Manager, Site Reliability Engineer (Global Payment Network).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborativ...Show more
    Last updated: 28 days ago • Promoted
    Site Reliability Engineer, Platform Discovery

    Site Reliability Engineer, Platform Discovery

    Slope • Washington, DC, United States
    Full-time
    Anduril Industries is a defense technology company with a mission to transform U.By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the def...Show more
    Last updated: 4 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Improvix Technologies • Washington, DC, United States
    Full-time
    Site Reliability Engineer (SRE).We are seeking a Site Reliability Engineer (SRE) with strong GitLab expertise to support and enhance enterprise platforms. This role will focus primarily on GitLab wh...Show more
    Last updated: 11 days ago • Promoted
    Systems Security Engineer

    Systems Security Engineer

    Compass Pointe Consulting • Columbia, MD, United States
    Full-time
    Candidates are strong security engineers with over 5 years of experience who can bring innovation to our customer projects. You will help us enhance our capabilities in designing and implementing se...Show more
    Last updated: 6 days ago • Promoted
    Reliability Engineer

    Reliability Engineer

    Lockheed Martin • Bethesda, MD, United States
    Full-time
    Lockheed Martin is a global security and aerospace company that employs some of the greatest minds in the industry.They are passionate about purposeful innovation, dedicated to keeping people safe ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Powder River Industries • Washington, DC, United States
    Full-time
    Conduct analysis of alternatives for configuration tools, make recommendations, work with team to design, develop, test, implement, and maintain tool choice. Responsible for the administration, moni...Show more
    Last updated: 13 days ago • Promoted
    Principal Systems Engineer

    Principal Systems Engineer

    Leidos Inc • Columbia, MD, United States
    Full-time
    National Security Sector's (NSS) Cyber & Analytics Business Area (CABA).Our talented team is at the forefront in Security Engineering, Computer Network Operations (CNO), Mission Software, Analytica...Show more
    Last updated: 17 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    EngFlow • Washington, DC, United States
    Full-time
    Join to apply for the Site Reliability Engineer role at EngFlow.At EngFlow, we help developers save time by accelerating software builds and tests. Our cloud-based, distributed service optimizes dev...Show more
    Last updated: 13 days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Bridge Defense • Washington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show more
    Last updated: 13 days ago • Promoted
    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Jobgether • Washington, DC, United States
    Full-time
    Principal Site Reliability Engineer (SRE) job at Jobgether.This position is posted by Jobgether on behalf of.We are currently looking for a. Principal Site Reliability Engineer (SRE).Join a high-imp...Show more
    Last updated: 30+ days ago • Promoted
    Deployment Site Reliability Engineer - Connected Warfare

    Deployment Site Reliability Engineer - Connected Warfare

    Anduril Industries, Inc. • Washington, DC, United States
    Full-time
    Senior Deployed Site Reliability Engineer, Connected Warfare.Washington, District of Columbia, United States.Anduril Industries is a defense technology company with a mission to transform U.By brin...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Cape • Washington, DC, United States
    Full-time
    Cape was founded in early 2022 by Palantir and Anduril alums with deep expertise in privacy and national security.While running Palantir’s US national security business, our CEO became passionate a...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer — Scale mission-critical platforms

    Site Reliability Engineer — Scale mission-critical platforms

    Anduril Industries • Washington, DC, United States
    Full-time
    A defense technology company is seeking a Site Reliability Engineer in Washington, DC.The role involves solving challenges in networking and systems integration while working with cross-functional ...Show more
    Last updated: 9 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ClientMind Recruiting Inc. • Bethesda, MD, United States
    Full-time
    Clientmind Recruiting is searching for a Site Reliability Engineer for a growing tech company based in the Bethesda, MD area. This will be onsite 1x per week (Tuesday).This role centers on maintaini...Show more
    Last updated: 6 days ago • Promoted
    COMSEC Engineer

    COMSEC Engineer

    Stellar Solutions • Arlington, VA, United States
    Full-time
    Stellar Solutions is a recognized leader in delivering high-impact engineering services and mission success for national and international space programs. We specialize in aligning top-tier talent w...Show more
    Last updated: 5 days ago • Promoted