Search jobs > Los Angeles, CA > Senior site reliability

Senior Site Reliability Engineer - Edge Delivery

Fastly
Los Angeles, CA, United States
$167.8K-$209.7K a year
Full-time

Fastly helps people stay better connected with the things they love. Fastly's edge cloud platform enables customers to create great digital experiences quickly, securely, and reliably by processing, serving, and securing our customers' applications as close to their end-users as possible - at the edge of the Internet.

The platform is designed to take advantage of the modern internet, to be programmable, and to support agile software development.

Fastly's customers include many of the world's most prominent companies, including Vimeo, Pinterest, The New York Times, and GitHub.

We're building a more trustworthy Internet. Come join us.

We are looking for a Senior SRE who is passionate about building, scaling, and automating our internal platforms and providing global visibility to the health and performance of our hosting infrastructure.

You will be working alongside other engineering and support teams to provide insights and tooling to make our services more observable as well as assisting in designing automated build and test infrastructure.

Additionally, you will assist in troubleshooting and debugging production services as well as collaborating with engineering and support teams to improve documentation and debugging tools.

What You'll Do :

  • Assist operations and customer support teams in diagnosing and debugging production services
  • Analyze and improve telemetry collection, instrumentation, monitoring systems, and dashboards
  • Participate in incident reviews to build improved alerts for both detection and potential proactive mitigations
  • Improve operational processes (including builds, automated tests, and deployments) with the goal of making them as uneventful as possible
  • Identify gaps in testing and debugging tools and assist in their development and improvement

What We're Looking For :

  • In order to be successful in this role, you must have 5 years of experience operating highly available distributed systems and supporting mission critical infrastructure
  • You have a minimum of 2 years of experience with build and packaging tools, CI systems (either Jenkins or Github Actions), and release engineering practices
  • You must have at least 2 years of experience with tools like Grafana, Prometheus, Splunk or BigQuery
  • You not only know how to write shell scripts, but know when it's time to replace those shell scripts with something more complex
  • You have at least 1 year of software development experience with Rust, Go, Python or another programming language
  • You can remain calm under pressure while coordinating incident response and mitigation efforts

We'll be super impressed if you have experience in any of these :

  • Understanding of the open source software community and the contribution process
  • Background in technical writing and desire to document and share knowledge
  • Knowledge of HTTP protocol internals, with bonus points if you can spell "referrer" correctly

Work Hours :

  • This position will require you to be available during core business hours.
  • You will participate in a on-call rotation to support platform availability.

Work Locations & Travel Requirements :

This position is open to the following preferred office locations :

  • San Francisco, CA
  • New York City, NY
  • Denver, CO
  • Los Angeles, CA

Fastly currently embraces a largely hybrid model for most roles which allows employees flexibility to split their time between the office and home.

Salary :

The estimated salary range for this position is $167,790 to $209,740.

Starting salary may vary based on permissible, non-discriminatory factors such as experience, skills, qualifications, and location.

This role may be eligible to participate in Fastly's equity and discretionary bonus programs.

Benefits :

We care about you. Fastly works hard to create a positive environment for our employees, and we think your life outside of work is important too.

We support our teams with great benefits that start on the first day of your employment with Fastly. Curious about our offerings?

We offer a comprehensive benefits package including medical, dental, and vision insurance. Family planning, mental health support along with Employee Assistance Program, Insurance (Life, Disability, and Accident), a Flexible Vacation policy and up to 18 days of accrued paid sick leave are there to help support our employees.

We also offer 401(k) (including company match) and an Employee Stock Purchase Program. For 2024, we offer 10 paid local holidays, 11 paid company wellness days.

Why Fastly?

We have a huge impact. Fastly is a small company with a big reach. Not only do our customers have a tremendous user base, but we also support a growing number of open source projects and initiatives.

Outside of code, employees are encouraged to share causes close to their heart with others so we can help lend a supportive hand.

We love distributed teams. Fastly's home-base is in San Francisco, but we have multiple offices and employees sprinkled around the globe.

As a new hire, you will be able to attend our IN-PERSON new hire orientation in our San Francisco office! It is an exciting week-long experience that we offer to new employees to build connections with colleagues across Fastly, participate in hands-on learning opportunities, and immerse yourself in our culture firsthand.

  • We value diversity. Growing and maintaining our inclusive and diverse team matters to us. We are committed to being a company where our employees feel comfortable bringing their authentic selves to work and have the ability to be successfulevery day.
  • We are passionate. Fastly is chock full of passionate people and we're not 'one size fits all'. Fastly employs authors, pilots, skiers, parents (of humans and animals), makeup geeks, coffee connoisseurs, and more.

We love employees for who they are and what they are passionate about.

We're always looking for humble, sharp, and creative folks to join the Fastly team. If you think you might be a fit please apply! A fully completed application and resume or CV are required when applying.

Fastly is committed to ensuring equal employment opportunity and to providing employees with a safe and welcoming work environment free of discrimination and harassment.

Our employment decisions are based on business needs, job requirements and individual qualifications. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, family or parental status, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.

Consistent with the Americans with Disabilities Act (ADA) and federal or state disability laws, Fastly will provide reasonable accommodations for applicants and employees with disabilities.

If reasonable accommodation is needed to participate in the job application or interview process, to perform essential job functions, and / or to receive other benefits and privileges of employment, please contact your Recruiter, or the Fastly Employee Relations team at [email protected] or 501-287-4901.

Fastly collects and processes personal data submitted by job applicants in accordance with our Privacy Policy. Please see our privacy notice for job applicants.

16 days ago
Related jobs
Promoted
VirtualVocations
Burbank, California

A company is looking for a Senior Site Reliability Engineer. ...

Promoted
Robert Half
CA, United States
Remote

Immediate hiring for a Sr Site Reliability Engineer for a fully remote opportunity to join a growing company operating in a new market category -they are helping customers globally with digitalizing their data with high portable trust assurance. Strong knowledge in at least one (scripting-preferrabl...

Promoted
SpaceX
Hawthorne, California

SITE RELIABILITY ENGINEER), DATA (APPLICATION SOFTWARE). Aerospace experience is not required to be successful here - rather we look for smart, motivated, collaborative site reliability engineers who love solving problems and want to make an impact on a super inspiring mission. Bachelor's degree in ...

Promoted
VirtualVocations
Bell Gardens, California

A company is looking for a Site Reliability Engineer in Developer Enablement. ...

Promoted
SpaceX
Hawthorne, California

Bachelor's degree in computer science, information systems/IT, or an engineering discipline; OR 2+ years of professional experience in software, DevOps, or site reliability engineering in lieu of a degree. As a Site Reliability Engineer, you will design, develop, and test key aspects of an in-house ...

Promoted
VirtualVocations
Inglewood, California

A company is looking for a Site Reliability Engineer - Observability in the United States. ...

Tencent
California, US
Remote

Are you passionate about gaming and skilled in managing distributed online systems? Uncapped Games is looking for a Site Reliability Engineer like you! Join us in our quest to revolutionize the Real-Time Strategy (RTS) genre with our groundbreaking new game. Knowledge of tools such as Jenkins, Argo ...

SpaceX
Hawthorne, California

As a Senior Site Reliability Engineer, you will architect, develop, and test key aspects of the infrastructure for an in-house solution for analysis, simulation, prototyping, and operation of software in support of all SpaceX flight systems. Bachelor’s degree in computer science, information systems...

GEICO
Los Angeles, California

Our Senior Manager is an engineering leader who works with the engineering staff to innovate and build new engineering solutions, improveand enhance existing solutions as well as leverage engineering solutions to solve critical operational problems. Senior Manager, Site Reliability Engineering – Dat...

NBCUniversal
Universal City, California

Establish and reinforce best software engineering practices – including: code quality, continuous delivery, and automated testing. Partner with other managerial leads involving cloud, networking, product and engineering stakeholders to proactively identify operational needs and deliver solutions. Ba...