Talent.com
Senior Site Reliability Engineer
Senior Site Reliability EngineerStellar Development Foundation • New York, NY, United States
No longer accepting applications
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Stellar Development Foundation • New York, NY, United States
30+ days ago
Job type
  • Full-time
Job description

Interested in working on cutting-edge blockchain technology and creating equitable access to the global financial system? Since 2014, the mission-driven team at the Stellar Development Foundation (SDF) has helped fuel the tremendous growth of the Stellar blockchain network, an open-source platform that operates at high-scale today. Developers and companies around the world build on it, and the SDF team is expanding to support the rapidly growing and changing Stellar ecosystem.

SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our engineering teams. You’ll ensure the reliability and scalability of our systems, design and improve the infrastructure behind our production environments, and automate operational work so developers can focus on building great products.

In this role, you will:

  • Maintain, improve, scale and secure our AWS/GCP infrastructure and Linux systems.

  • Assist our development teams in running, packaging, deploying and troubleshooting applications

  • Work with developers on streamlining deployment processes with Jenkins and other CI/CD tooling.

  • Build, maintain, monitor and improve our Kubernetes clusters.

  • Work with development teams on migrating applications to Kubernetes.

  • Be responsible for maintenance and improvements to multiple internal services, for example Kubernetes, Prometheus, ELK.

  • Monitor, triage and respond to alerts in our high availability environments.

  • Participate in design and code reviews, and ensure that the foundation for our services is best in class.

  • Evaluate new technologies, design and implement as appropriate.

  • Identify automation opportunities and implement by creating custom or by using off the shelf solutions.

You have:

  • 5+ years of experience of working in cloud-based systems operations, as a SRE or DevOps engineer.

  • First-hand experience with configuration management and infrastructure as code (Ansible, Puppet, Terraform).

  • Proficient in utilizing SRE methodologies like capacity planning and disaster recovery testing to ensure the scalability, resilience, and availability of critical services.

  • A strong understanding of computer networking, TCP/UDP, load balancing, distributed computing, web services, and the fundamental protocols used by the internet (HTTP, HTTPS, DNS, etc.).

  • Experienced in managing production workloads and skilled in using monitoring tools to detect issues early.

  • Comfortable with participating in on-call rotations and conducting thorough root cause analyses to keep systems running smoothly.

  • Proficiency in at least one programming language.

  • Committed to supporting teammates, especially during challenging times, and excited about working in a close-knit, growing team. Approachable, empathetic, and proactive in promoting collaboration and innovation.

  • Excels in working independently, demonstrating the ability to accomplish tasks without constant monitoring.

  • Production experience building and maintaining Kubernetes clusters.

Bonus Points if:

  • Ability to understand Go, Rust, C++ and TypeScript source code

  • Experience experimenting with AI-driven approaches to operations

We offer competitive pay with a base salary range for this position of $165,000 - $225,000 depending on job-related knowledge, skills, experience, and location. In addition, we offer lumen-denominated grants along with the following perks and benefits:

USA Benefits/Perks:

  • Competitive health, dental & vision coverage with most plans covered at 100% for the employee + any dependents

  • Flexible time off + 15 company holidays including a company-wide holiday break

  • Up to 12 weeks of paid parental leave for both non-birthing and birthing parents, as well as up to 14 weeks of paid pregnancy leave for birthing parents

  • Gym reimbursement ($80 per month)

  • Life & ADD (up to $50K)

  • Short & Long term disability

  • 401K with 4% match

  • Health & Dependent Care FSA Accounts

  • Commuter benefits with $250/month employer contribution

  • Health Savings Account (HSA) with monthly employer contribution

  • Family building benefits through Kindbody

  • Wellbeing benefits (One Medical, Rightway, Headspace)

  • L&D budget of $1,500/year

  • Daily lunch and snacks in office

  • Company retreats

About Stellar

Stellar is more than a blockchain. Powered by a decentralized, fast, scalable, and uniquely sustainable network made for financial products and services and a thriving and passionate ecosystem that includes a non-profit organization driven by a mission, Stellar is paving the path to unlock the world’s economic potential through blockchain technology. Built with speed and low costs in mind, the Stellar network provides builders and financial institutions worldwide a platform to issue assets, and to send and convert currencies in real time creating real world utility. Founded in 2014, the Stellar Development Foundation (SDF) supports the continued development and growth of the Stellar network and also serves the ecosystem of NGOs, corporations, universities, small businesses, governments, and solo entrepreneurs building on the Stellar network through tooling, funding and strategic collaborations. Together, Stellar is where blockchain meets the real world.

About the Stellar Development Foundation

The Stellar Development Foundation (SDF) is a non-profit organization focused on working with and supporting change-makers to create equitable access to the global financial system through blockchain technology. SDF provides grants, investments, funding, and other awards to builders and organizations. SDF also develops resources and tooling on the Stellar network to help unlock real world utility. As a nonprofit foundation, SDF puts the health of the Stellar network and the Stellar ecosystem and its mission above all else.

We look forward to hearing from you!

Privacy Policy

By submitting your application, you are agreeing to our use and processing of your data in accordance with our .

SDF is committed to diversity in its workforce and is proud to be an equal opportunity employer. SDF does not make hiring or employment decisions on the basis of race, color, religion, creed, gender, national origin, age, disability, veteran status, marital status, pregnancy, sex, gender expression or identity, sexual orientation, citizenship, or any other basis protected by applicable local, state or federal law.

Create a job alert for this search

Senior Site Reliability Engineer • New York, NY, United States

Similar jobs

Senior DevOps and Site Reliability Engineer, remote

CherreNew York City, NY, United States
Remote
Full-time

Cherre is the real estate industry's leading data management platform, powering more than $3 trillion AUM globally.Our end-to-end platform helps clients connect, transform, analyze, and act on trus...Show more

 • Promoted

Senior Site Civil Engineer

HDRWoodcliff Lake, NJ, United States
Full-time +1

At HDR, our employee-owners are fully engaged in creating a welcoming environment where each of us is valued and respected, a place where everyone is empowered to bring their authentic selves and n...Show more

 • Promoted

Reliability Engineer

Mini-CircuitsNew York, NY, United States
Full-time

Mini-Circuits designs, manufactures and distributes integrated circuits, modules, and sub-systems for high-performance radio frequency (RF) and microwave applications.With design, sales and manufac...Show more

 • Promoted

DevOps & Site Reliability Engineer - AWS / Terraform / Laravel - Remote

SportsRecruitsNew York City, NY, United States
Remote
Full-time

OverviewDevOps / Site Reliability Engineer (Remote)Location :Remote (US-based)Reports to :CTO, SportsRecruitsAbout SportsRecruitsSportsRecruits is the leading sports recruiting network, connecting ...Show more

 • Promoted

Lead Site Reliability Engineer

JPMorgan Chase Bank, N.A.New York, NY, United States
Full-time

As a Site Reliability Engineering at JPMorgan Chase within the Enterprise technology, liquidity risk team, you are the non-functional requirement owner and champion for the applications in your rem...Show more

 • Promoted

Team Lead, Site Reliability Engineering - Storage Layer Service

MongoDBNew York, NY, United States
Full-time

MongoDB's Storage Layer Services (SLS) team is re-architecting the MongoDB cloud storage layer and sits at the heart of our next-generation cloud storage architecture.This relatively new team is bu...Show more

 • Promoted

Technical Support Engineer/Site Reliability Engineer (experienced) - New York, NY

FDM GroupNew York, NY, United States
Full-time

This position requires the successful candidate to work on a W2 directly with FDM.We cannot accept C2C, 1099 or employment sponsorship (e.FDM is a global business and technology consultancy deliver...Show more

 • Promoted

Senior Engineer - Commodity Index Platform Lead

Bank of AmericaNew York, NY, United States
Full-time

Senior Engineer - Commodity Index Platform Lead.To proceed with your application, you must be at least 18 years of age.Lateral-US/job/New-York/Senior-Engineer---Commodity-Index-Platform-Lead_260071...Show more

 • Promoted

Site Reliability Engineer, Commodities Technology

Point72New York, NY, United States
Full-time

Site Reliability Engineer, Commodities Technology.A Career with point72's technology team.As Point72 reimagines the future of investing, our Technology group is constantly improving our company's I...Show more

 • Promoted

Product Reliability Engineer - Defense

Palantir TechnologiesNew York, NY, United States
Permanent

Palantir builds the world's leading software for data-driven decisions and operations.By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving ...Show more

 • Promoted

Site Reliability Engineer - (Linux & Python/Go)

Elliot PartnershipNew York, NY, United States
Full-time

Site Reliability Engineer - (Linux & Python/Go).New York, NY (Hybrid, 3 days in office).Highly competitive compensation package.Join an elite technology and research group at the forefront of globa...Show more

 • Promoted

Senior Forward Deployed Engineer

Recruiting from ScratchNew York, NY, United States
Full-time

Who is Recruiting from Scratch:.Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients.Our team is 100% remote and we work with teams across North Ameri...Show more

 • Promoted

Site Reliability Engineer

PicoNew York, NY, United States
Full-time

Pico fuels the global capital markets community by providing exceptional market data services and customized managed infrastructure solutions.As financial industry experts at the center of markets ...Show more

 • Promoted

Senior DevOps / Site Reliability Engineer (Terraform)

Purple DriveNew York, NY, United States
Full-time

Job Title: Senior DevOps / Site Reliability Engineer (Terraform).We are seeking a highly skilled DevOps / Site Reliability Engineer (SRE) to join our team in New York.The ideal candidate will have ...Show more

 • Promoted

AVP, Reliability Engineer

Synchrony FinancialNEW YORK, New York, United States
Full-time

The AVP, Reliability Engineer craves working in a hands-on system design and architecture environment, and leads by example to make sure time sensitive projects get done on time and to specificatio...Show more

 • Promoted

Platform Reliability Engineer

TWG Global AINew York, NY, United States
Full-time

At TWG Group Holdings, LLC (“TWG Global”), we drive innovation and business transformation across a range of industries—including financial services, insurance, technology, media, and sports—by lev...Show more

 • Promoted

Lead Site Reliability Engineer (Remote)

LivepeerNew York City, NY, United States
Remote
Full-time

Location :RemoteHours :North America working hoursAbout LivepeerLivepeer is on a mission to build the world's open video infrastructure.Founded in 2017, it is the world's first open-source protocol...Show more

 • Promoted

Bilingual Mandarin Site Reliability Engineer - SRE

ComriseNew York, NY, United States
Full-time

Site Reliability Engineer(SRE).Global Architecture & Disaster Recovery.Participate in the design and implementation of the company's global infrastructure architecture.Responsible for cross-region ...Show more