Talent.com
Senior Manager, Site Reliability Engineering
Senior Manager, Site Reliability EngineeringPlume • Palo Alto, CA, US
Senior Manager, Site Reliability Engineering

Senior Manager, Site Reliability Engineering

Plume • Palo Alto, CA, US
3 days ago
Job type
  • Full-time
Job description

Life at Plume

At Plume, we believe that technology isn't about moving faster, it's about making life's moments better. Which is why we've built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spacesand human experiencesat massive scale.

We now deliver services to over 60 million locations globally and have managed over 3 billion devices on our platform. We're expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Internet Service Providers (ISPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data.

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can't do it exceptionally well, we don't do it. It's how we've assembled a team of world-class builders, thinkers, and doers. And it's how we're reinventing what's possible every day.

Senior Manager, Site Reliability Engineering (SRE)

Life at Plume

At Plume, we believe that technology isn't about moving faster, it's about making life's moments better. Which is why we've built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spacesand human experiencesat massive scale.

We now deliver services to over 60 million locations globally and have managed over 3 billion devices on our platform. We're expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Internet Service Providers (ISPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data.

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can't do it exceptionally well, we don't do it. It's how we've assembled a team of world-class builders, thinkers, and doers. And it's how we're reinventing what's possible every day.

We are seeking a highly experienced and visionary Director of Site Reliability Engineering (SRE) to lead our growing global SRE organization. This critical role requires a strong people leader who can manage managers, set the strategic direction for the SRE function, and ensure the reliability, scalability, and performance of our systems.

What You'll Do :

  • People Leadership and Management :
  • Lead, mentor, and develop a team of SRE managers and individual contributors across multiple geographical locations.
  • Foster a culture of continuous learning, collaboration, and operational excellence within the SRE organization.
  • Conduct regular 1 : 1s, provide constructive feedback, and support career development for all team members.
  • Mediate conflicts and facilitate effective communication within the team and with other departments.
  • Organizational Strategy and Direction :
  • Define and articulate the strategic vision and roadmap for the global SRE organization, aligning with overall business objectives.
  • Establish and enforce best practices for incident response, problem management, change management, and disaster recovery.
  • Drive the adoption of SRE principles and methodologies across engineering teams.
  • Stay abreast of industry trends and emerging technologies to continuously improve our SRE capabilities.
  • Hiring and Team Growth :
  • Lead the recruitment efforts for SRE roles, including defining job requirements, interviewing candidates, and making hiring decisions.
  • Develop and implement onboarding programs to ensure new hires are successfully integrated into the team.
  • Identify skill gaps and implement training programs to enhance the capabilities of the SRE team.
  • Enabling and Empowering the Team :
  • Provide the necessary tools, resources, and support to enable SRE teams to effectively monitor, troubleshoot, and optimize system performance.
  • Empower teams to take ownership of system reliability and drive continuous improvement initiatives.
  • Remove roadblocks and facilitate cross-functional collaboration to ensure the success of SRE projects.
  • Delivery and Results :
  • Ensure the successful delivery of SRE initiatives, projects, and goals, meeting defined SLAs and KPIs.
  • Drive efforts to reduce operational toil, improve system availability, and enhance overall system stability.
  • Report on key SRE metrics and progress to senior leadership.

What You'll Bring :

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 10+ years of experience in Site Reliability Engineering or a similar role.
  • 3-5 years of experience managing and leading engineering teams, including managing managers, in a global environment.
  • Proven track record of building and scaling high-performing SRE organizations.
  • Deep understanding of SRE principles, methodologies, and best practices.
  • Experience with large-scale distributed systems and cloud platforms.
  • Strong communication, interpersonal, and leadership skills.
  • Ability to think strategically and execute tactically.
  • Experience with budgeting and resource allocation.
  • About Plume

    As the creator of the only open, hardware-independent, cloud-controlled experience platform for ISPs and their subscribers, Plume partners with over 400 ISP customers, including some of the world's largest such as Comcast, Charter, Liberty Global, and J : COM.

    Using OpenSync, the most widely supported open-source, silicon-to-cloud framework for smart spaces, Plume's software-defined network allows ISPs to decouple their service offerings from hardware and rapidly curate and deliver new services over a multi-vendor, open-platform architecture.

    Plume is an equal opportunity workplace that maintains a continuing policy of nondiscrimination in all employment practices and

    decisions, ensuring equal employment opportunities for all qualified individuals without regard to

    race, color, creed, religion, sex, national origin, age, physical or mental disability, sexual

    orientation, gender identity, marital status, pregnancy, childbirth or related individual conditions,

    medical conditions (as defined by state law), military or veteran status, or any other characteristic protected by federal, state or local law.

    Create a job alert for this search

    Engineering Manager • Palo Alto, CA, US

    Related jobs
    Site Reliability Engineer Team Lead

    Site Reliability Engineer Team Lead

    VirtualVocations • San Francisco, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer, Team Lead.Key Responsibilities Ensure 24x7 availability of production application systems Drive initiatives to improve operational efficienc...Show more
    Last updated: 1 day ago • Promoted
    Senior Site Reliability Engineer, Scalability

    Senior Site Reliability Engineer, Scalability

    Meraki, LLC • San Francisco, CA, United States
    Full-time
    Application window is open until further notice.The Infrastructure SRE team is responsible for the compute, storage and security underpinning Meraki's cloud in 10 data centers worldwide.Meraki's hi...Show more
    Last updated: 30+ days ago • Promoted
    Manager, Systems Reliability Engineering

    Manager, Systems Reliability Engineering

    Serve Robotics • Redwood City, CA, US
    Full-time
    Manager Of Systems Reliability Engineering.At Serve Robotics, we're reimagining how things move in cities.Our personable sidewalk robot is our vision for the future. It's designed to take deliveries...Show more
    Last updated: 30+ days ago • Promoted
    Senior / Staff Site Reliability Engineer, Storage

    Senior / Staff Site Reliability Engineer, Storage

    Fluidstack • San Francisco, CA, United States
    Full-time
    Fluidstack is building GPU supercomputers for top AI labs, governments, and enterprises.Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivate...Show more
    Last updated: 30+ days ago • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Altana AI • San Francisco, CA, United States
    Full-time
    AI can be a powerful tool for good in the world – at Altana we apply AI to the world’s largest organized body of supply chain data to power a more resilient, more secure, and more sustainable model...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer Lead

    Site Reliability Engineer Lead

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer, Team Lead.Key Responsibilities Ensure 24x7 availability of production application systems and drive operational efficiency initiatives Ident...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantum • Palo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer, Storage

    Senior Site Reliability Engineer, Storage

    Epoch Biodesign • San Francisco, CA, United States
    Full-time
    Crusoe Energy is on a mission to unlock value in stranded energy resources through the power of computation.Take a look at what we do! - https : / / www. We aim to align the long term interests of the c...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood Materials, Inc. • San Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer - Technical Lead

    Site Reliability Engineer - Technical Lead

    ZipRecruiter • San Francisco, CA, United States
    Full-time
    Veryon is a leading software and technology company that enables aviation teams around the world to improve efficiency and safety. Our products maximize uptime for aircraft maintenance teams through...Show more
    Last updated: 19 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood Materials • San Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition.Founded in...Show more
    Last updated: 28 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    VirtualVocations • San Francisco, California, United States
    Full-time
    A company is looking for a Senior Site Reliability Engineer.Key Responsibilities Design, develop, and implement software to enhance system availability, scalability, latency, and efficiency Lead...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineering Manager II, Site Reliability Engineering

    Software Engineering Manager II, Site Reliability Engineering

    Google Inc. • Sunnyvale, CA, United States
    Full-time
    Software Engineering Manager II, Site Reliability Engineering.Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. Master's degree in Computer Science or Engin...Show more
    Last updated: 21 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer to provide engineering and operational support for cloud and application services in Oracle Cloud Infrastructure (OCI).Key Responsibilities De...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    VirtualVocations • San Francisco, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer II- Process Automation.Key Responsibilities Optimize and automate incident and change management processes to enhance system efficiency and re...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Gridware Technologies Inc. • San Francisco, CA, United States
    Full-time
    Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...Show more
    Last updated: 5 days ago • Promoted
    Software Engineering Manager II, Site Reliability Engineering

    Software Engineering Manager II, Site Reliability Engineering

    Google • San Francisco, CA, United States
    Full-time
    Software Engineering Manager II, Site Reliability Engineering at Google.Lead a team and provide technical leadership on key projects, empowering and developing teams to ensure reliability and scala...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for a Manager, Site Reliability Engineer.Key Responsibilities Ensure systems and services maintain high availability, reliability, and scalability Develop and maintain autom...Show more
    Last updated: 30+ days ago • Promoted