Talent.com
Director of Site Reliability Engineering

Director of Site Reliability Engineering

Crypto Pro NetworkSan Francisco, California, US
1 day ago
Job type
  • Full-time
Job description

Director of Site Reliability Engineering

Interested in working on cutting-edge blockchain technology and creating equitable access to the global financial system? Since 2014, the mission-driven team at the Stellar Development Foundation (SDF) has helped fuel the tremendous growth of the Stellar blockchain network, an open-source platform that operates at high-scale today. Developers and companies around the world build on it, and the SDF team is expanding to support the rapidly growing and changing Stellar ecosystem.

Apply below after reading through all the details and supporting information regarding this job opportunity.

You will lead an experienced Site Reliability Engineering team, ensuring our services and tooling are available, building infrastructure to make our team's production and testing environments available, and greasing the rails of our systems and processes to ensure they're robust, efficient, and easy to deploy.

SDF has a robust career path for both individual contributors and managers.

In this role, you will :

  • Establish a clear vision and mandate for the Site Reliability Engineering team
  • Define the SRE team's quarterly OKRs to best align with the company's goals
  • Define processes of collaboration between SREs and development teams throughout the software development lifecycle
  • Define a career growth path for the SRE team, as well as coach and mentor individual contributors on the team
  • Define and track metrics across engineering and help hold engineering teams accountable for their KPIs
  • Coordinate priorities with other teams and areas of the organization
  • Participate in sprint planning and execution, track progress and oversee day-to-day tactical decisions
  • Design and build reliable systems, and infrastructure that is easy to use by software engineers
  • Monitor and troubleshoot systems in production
  • Define and participate in 24 / 7 on-call rotations alongside the team
  • Mediate technical discussions and review PRs
  • Jump in as needed with code fixes, troubleshooting and hands-on contributions
  • Collaborate across the Stellar ecosystem, engaging with key partners and advising on their integration to set them up for success

You have :

  • 3+ years of experience working as a Site Reliability Engineer
  • 3+ years of experience managing an SRE team
  • Site Reliability Engineering experience :
  • Strong track record of collaborating with dev teams at all stages of product development (design, development / CI, beta testing, production)

  • Strong track record collaborating on defining, measuring and driving improvements in KPIs
  • Strong track record assisting teams during Root Cause Analysis and post mortems
  • Infrastructure and Operations experience :
  • Designing and building out the infrastructure for large distributed systems

  • Maintaining highly-available infrastructure
  • Troubleshooting and understanding complex technical problems
  • Using configuration Management or IaC tooling such as Terraform, Ansible, Puppet
  • Building and maintaining infrastructure using Kubernetes
  • Highly autonomous; able to find clarity in ambiguous circumstances
  • Excellent communicator; comfortable working with remote team members
  • Bonus Points if (optional) :

  • 3+ years of experience writing code in a major programming language
  • You have worked on an open source project
  • You have managed a distributed team
  • You build things for fun in your spare time
  • We offer competitive pay with a base salary range for this position of $210,000 - $310,000 depending on job-related knowledge, skills, experience, and location. In addition, we offer lumen-denominated grants along with the following perks and benefits :

  • Competitive health, dental & vision coverage with most plans covered at 100% for the employee + any dependents
  • Flexible time off + 15 company holidays including a company-wide holiday break
  • Up to 12 weeks of paid parental leave for both non-birthing and birthing parents, as well as up to 14 weeks of paid pregnancy leave for birthing parents
  • Gym reimbursement ($80 per month)
  • Life & ADD (up to $50K)
  • Short & Long term disability
  • 401K with 4% match
  • Health & Dependent Care FSA Accounts
  • Commuter benefits with $250 / month employer contribution
  • Health Savings Account (HSA) with monthly employer contribution
  • Family building benefits through Kindbody
  • L&D budget of $1,500 / year
  • Daily lunch and snacks in office
  • Company retreats
  • #LI-Hybrid

    About Stellar Stellar is more than a blockchain. Powered by a decentralized, fast, scalable, and uniquely sustainable network made for financial products and services and a thriving and passionate ecosystem that includes a non-profit organization driven by a mission, Stellar is paving the path to unlock the world’s economic potential through blockchain technology. Built with speed and low costs in mind, the Stellar network provides builders and financial institutions worldwide a platform to issue assets, and to send and convert currencies in real time creating real world utility. Founded in 2014, the Stellar Development Foundation (SDF) supports the continued development and growth of the Stellar network and also serves the ecosystem of NGOs, corporations, universities, small businesses, governments, and solo entrepreneurs building on the Stellar network through tooling, funding and strategic collaborations. Together, Stellar is where blockchain meets the real world. About the Stellar Development Foundation The Stellar Development Foundation (SDF) is a non-profit organization focused on working with and supporting changemakers to create equitable access to the global financial system through blockchain technology. SDF provides grants, investments, funding, and other awards to builders and organizations. SDF also develops resources and tooling on the Stellar network to help unlock real world utility. As a nonprofit foundation, SDF puts the health of the Stellar network and the Stellar ecosystem and its mission above all else.We look forward to hearing from you! Privacy Policy By submitting your application, you are agreeing to our use and processing of your data in accordance with our Privacy Policy .SDF is committed to diversity in its workforce and is proud to be an equal opportunity employer. SDF does not make hiring or employment decisions on the basis of race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other basis protected by applicable local, state or federal law.

    #J-18808-Ljbffr

    Create a job alert for this search

    Director Of Engineering • San Francisco, California, US

    Related jobs
    • Promoted
    Director of Engineering

    Director of Engineering

    AllTrailsSan Francisco, CA, United States
    Full-time
    AllTrails is the world’s most popular and trusted platform for outdoor exploration.We connect people to the outdoors, help them discover new places, and elevate their experiences on the trail.With ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantumPalo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood Materials, Inc.San Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Director of Solutions Engineering

    Sr. Director of Solutions Engineering

    Epoch BiodesignSan Francisco, CA, United States
    Full-time
    Denver, CO - US, San Francisco, CA - US, Seattle, WA - US, Sunnyvale, CA - US.Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a worl...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Manager, Site Reliability Engineering

    Sr Manager, Site Reliability Engineering

    PayPal, Inc.San Jose, CA, United States
    Full-time
    PayPal has been revolutionizing commerce globally for more than 25 years.Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empow...Show moreLast updated: 12 days ago
    • Promoted
    Director of Engineering, Observability

    Director of Engineering, Observability

    AstronomerSan Francisco, CA, United States
    Full-time
    Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow...Show moreLast updated: 30+ days ago
    • Promoted
    Director of Engineering - Platform

    Director of Engineering - Platform

    RipplingSan Francisco, CA, United States
    Full-time
    Rippling gives businesses one place to run HR, IT, and Finance.It brings together all of the workforce systems that are normally scattered across a company, like payroll, expenses, benefits, and co...Show moreLast updated: 28 days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Hewlett Packard Enterprise Development LPSan Jose, CA, United States
    Full-time
    Principal Site Reliability Engineer.This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week from an HPE office. Hewlett Packard Enterprise is the gl...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PSI QuantumPalo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineering Tech Lead

    Site Reliability Engineering Tech Lead

    DataHub IncPalo Alto, CA, United States
    Full-time
    DataHub is an AI & Data Context Platform adopted by over 3,000 enterprises, including Apple, CVS Health, Netflix, and Visa. Innovated jointly with a thriving open-source community of 13,000+ members...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XaiPalo Alto, CA, United States
    Full-time
    AIs mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellen...Show moreLast updated: 5 days ago
    • Promoted
    Director of Engineering, Mapping

    Director of Engineering, Mapping

    Socotra, Inc.San Francisco, CA, United States
    Full-time
    At Lyft, our purpose is to serve and connect.We aim to achieve this by cultivating a work environment where all team members belong and have the opportunity to thrive. About 4 years ago, Lyft began ...Show moreLast updated: 30+ days ago
    • Promoted
    Director, Reliability Engineering

    Director, Reliability Engineering

    F5 NetworksSan Jose, CA, United States
    Full-time
    At F5, we strive to bring a better digital world to life.Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital...Show moreLast updated: 27 days ago
    • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    Prosper.comSan Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Rockwoods IncPleasanton, CA, US
    Full-time
    Note : Candidates must have relevant experience in Medical / Healthcare domains, this is mandatory.Senior SRE Engineer - Pleasanton, 5 days office. Primary work : 24x7 On-call support and setting up mo...Show moreLast updated: 23 days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    HPESan Jose, CA, United States
    Full-time
    Principal Site Reliability Engineer.This role has been designed as ‘Hybrid’ with an expectation that you will work on average 2 days per week from an HPE office. Hewlett Packard Enterprise is the gl...Show moreLast updated: 5 days ago
    • Promoted
    Senior / Principal Site Reliability Engineer

    Senior / Principal Site Reliability Engineer

    DatacrunchSan Francisco, CA, United States
    Full-time +1
    Imagine a future where everyone has instant, low-cost access to intelligence.We’re building a fully featured European AI cloud - with everything one needs to train, experiment with, and deploy AI m...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PrimerSan Francisco, California, US
    Full-time
    Primer helps B2B products break out of the B2C-centric marketing box.Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market...Show moreLast updated: 1 day ago