Talent.com
Staff Engineer - Observability & Performance
Staff Engineer - Observability & PerformanceFastly • New York City, NY
No longer accepting applications
Staff Engineer - Observability & Performance

Staff Engineer - Observability & Performance

Fastly • New York City, NY
30+ days ago
Job type
  • Full-time
Job description

What You'll Do

:

  • This role is approximately 50% Cross-functional Operations, 40% Data Analysis / Traffic Insights, and 10% Site Reliability Engineering, balancing technical expertise with collaboration and strategic impact.
  • Drive the development of automation and observability tooling that improves operational efficiency and platform reliability, including traffic monitoring, alerting, and surveillance tools.
  • Partner with observability teams to implement and improve existing dashboards (Grafana, Prometheus) and metrics pipelines that provide meaningful visibility into traffic patterns, surges, and seasonal trends.
  • Help define SLIs/SLOs, and improve monitoring frameworks, ensuring alerts and dashboards reflect operational reality and proactively surface issues before customer impact.
  • Collaborate with data/analytics teams to leverage data pipelines (e.g., SQL, BigQuery or other large-scale data stores) for trend analysis, capacity planning, traffic pattern recognition
  • Step in to run daily operational standups or coordination meetings as needed. Ensuring priorities are clear, follow ups are tracked, and cross functional execution maintains momentum.
  • Facilitate cross-team communication during high-impact initiatives or incident reviews, surfacing blockers early and maintaining execution momentum
  • Assist in root-cause investigations of performance, scalability or traffic anomalies, translate learnings into improvements in tooling and architecture
  • Act as a technical liaison, helping contextualize traffic behavior, system performance, and support escalations with clear insight
  • Help define and evolve run-books, incident response processes, post-mortems, knowledge base, ensuring that repeated issues are proactively surfaced and addressed via automation or tooling rather than reactive firefighting
  • Monitor seasonal patterns, major events, and global traffic distribution, helping ensure the platform remains resilient during shifts in demand

What We're Looking For:

  • 8+ years of progressive technical experience in Content Delivery Networks (CDN), Streaming Media, Security, or other high-volume internet traffic environments.
  • Deep understanding of network/distributed/cloud systems: TCP/IP, DNS, HTTP/S, TLS, caching/proxy/CDN technologies; direct experience in CDN, Web Application and API Security a plus
  • Demonstrated ability to build automation, tooling, and observability systems: e.g., dashboards, alerts, instrumentation, data pipelines. Experience with Prometheus, Grafana, BigQuery/SQL, etc
  • Hands-on experience with scripting or programming (e.g., Python, Go, Shell) and comfortable building tooling rather than just consuming.
  • Experience working cross-functionally with engineering, infrastructure, operations, analytics, and customer/account teams. Strong communication skills, ability to translate technical findings to non-technical stakeholders.
  • Demonstrated ability to coordinate complex technical work across multiple teams, facilitate daily standups or working sessions, and maintain operational momentum in complex, fast-moving environments.
  • Proven track record of driving mission-critical reliability and performance improvements in production systems. Strong sense of ownership and accountability
  • Experience with monitoring/alerting systems and incident response. Bonus for experience with live streaming, high-variability traffic, or global seasonality at scale.

We’ll be super impressed if you have experience in any of these:

  • Experience with large-scale data analytics systems (BigQuery, Spark, Presto) to derive operational insights from traffic telemetry
  • Familiarity with cloud platforms (AWS, GCP, Azure), infrastructure as code, or container orchestration (Terraform, Kubernetes)
  • Experience evaluating build‐vs‐buy decisions and driving platform wide tooling improvements
  • Background in media, live events, or streaming operations in a high throughput, latency sensitive environment a plus

Work Hours:

This position will require you to be available during core business hours and occasional nights and weekends as required for on call and incident response

Work Location(s) & Travel Requirements:

The preferred locations for this position are:

  • San Francisco, CA
  • New York, NY
  • Denver, CO

Fastly currently embraces a largely hybrid model for most roles which allows employees flexibility to split their time between the office and home.

There is a strong preference for Hybrid near a local office. However, we may be willing to consider exceptionally qualified remote candidates within the US.

This position will require travel as required by your role or requested by your manager.

SF / LA Fair Chance Ordinance Statement

Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Salary

The estimated salary range for this position is $195,720 to $234,864. Starting salary may vary based on permissible, non-discriminatory factors such as experience, skills, qualifications, and location.

This role may be eligible to participate in Fastly’s equity and discretionary bonus programs.

Benefits

We care about you. Fastly works hard to create a positive environment for our employees, and we think your life outside of work is important too. We support our teams with great benefits that start on the first day of your employment with Fastly. Curious about our offerings?

We offer a comprehensive benefits package including medical, dental, and vision insurance. Family planning, mental health support along with Employee Assistance Program, Insurance (Life, Disability, and Accident), a Flexible Vacation policy and up to 18 days of accrued paid sick leave are there to help support our employees. We also offer 401(k) (including company match) and an Employee Stock Purchase Program. For 2026, we offer 12 paid local holidays, 12 paid company wellness days.

Why Fastly?

  • We have a huge impact. Fastly is a small company with a big reach. Not only do have a tremendous user base, but we also support a growing number of. Outside of code, employees are encouraged to share causes close to their heart with others so we can help lend a supportive hand.

  • We love distributed teams. Fastly’s home-base is in San Francisco, but we have multiple offices and employees sprinkled around the globe. As a new hire, you will be able to attend our IN-PERSON new hire orientation in our San Francisco office! It is an exciting week-long experience that we offer to new employees to build connections with colleagues across Fastly, participate in hands-on learning opportunities, and immerse yourself in our culture firsthand.

  • We value diversity. Growing and maintaining our inclusive and diverse team matters to us. We are committed to being a company where our employees feel comfortable bringing their authentic selves to work and have the ability to be successful -- every day.

  • We are passionate. Fastly is chock full of passionate people and we’re not ‘one size fits all’. Fastly employs authors, pilots, skiers, parents (of humans and animals), makeup geeks, coffee connoisseurs, and more. We love employees for who they are and what they are passionate about.

Create a job alert for this search

Staff Engineer - Observability & Performance • New York City, NY

Similar jobs
Staff Software Engineer, Enterprise GenAI

Staff Software Engineer, Enterprise GenAI

Scale AI • New York, NY, United States
Full-time
Staff Software Engineer, Enterprise GenAI.Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, an...Show more
Last updated: 30+ days ago • Promoted
Staff Software Engineer - Remote

Staff Software Engineer - Remote

TradeJobsWorkForce • 10704 Yonkers, NY, US
Remote
Full-time
Staff Software Engineer Remote Job Duties: • Implement and evolve a Data Lake storage system with low latency and high throughput for bulk data ingestion and query • Implement metadata, data govern...Show more
Last updated: 30+ days ago • Promoted
Senior Full-Stack Engineer

Senior Full-Stack Engineer

Mercury • New York, NY, United States
Full-time
San Francisco, CA, New York, NY, Portland, OR, or Remote within Canada or United States.Since the dawn of aviation, pilots have faced the constant challenge of air resistance, much like sailors nav...Show more
Last updated: 30+ days ago • Promoted
Senior Staff Engineer, AI‑Native Benefits Platform

Senior Staff Engineer, AI‑Native Benefits Platform

Gusto • New York, NY, United States
Full-time
A leading tech company in Seattle is looking for a Senior Staff Software Engineer to advance their benefits technology platform.In this role, you will architect scalable systems and implement AI-dr...Show more
Last updated: 30+ days ago • Promoted
Senior Staff Fullstack Engineer - Time Products

Senior Staff Fullstack Engineer - Time Products

Rippling • New York, New York, United States, 10007
Full-time
Quick Apply
Rippling gives businesses one place to run HR, IT, and Finance.It brings together all of the workforce systems that are normally scattered across a company, like payroll, expenses, benefits, and co...Show more
Last updated: 6 days ago
STATIONARY ENGINEER

STATIONARY ENGINEER

NYC Jobs • New York, NY, United States
Full-time
If you are hired provisionally in this title, you must take and pass the Civil Service Exam, when it becomes available, to be eligible for continued employment.The Department of Social Services (DS...Show more
Last updated: 12 days ago • Promoted
Senior / Staff Software Engineer

Senior / Staff Software Engineer

DRH Search • New York, NY, United States
Full-time
We're assisting a well-funded startup in the healthtech space with their search for Senior or Staff level engineers.Their product helps hospitals manage patient nutrition safely and efficiently.The...Show more
Last updated: 30+ days ago • Promoted
Staff AI Product Engineer, Code

Staff AI Product Engineer, Code

Semgrep • New York, NY, United States
Full-time
As an AI product engineer on Semgrep’s Code team, you’ll apply cutting-edge AI/ML tools from industry leaders to build user-facing security tools that help developers write and ship secure software...Show more
Last updated: 30+ days ago • Promoted
Senior Full-Stack Engineer

Senior Full-Stack Engineer

Uncountable Inc. • New York, NY, United States
Full-time
Thank you for your interest in Uncountable Engineering!.Uncountable is seeking experienced platform engineers who are passionate about user experience and scaling web applications.Our goal is to re...Show more
Last updated: 30+ days ago • Promoted
Staff Engineer

Staff Engineer

Flora © • New York, NY, United States
Full-time
FLORA is Cursor for Creatives: the first agentic creative tool built for creative professional teams to 10x their creative output by automating entire creative workflows.FLORA has all the best text...Show more
Last updated: 30+ days ago • Promoted
Staff Software Engineer, Platform

Staff Software Engineer, Platform

Scale • New York, NY, United States
Full-time
Software is eating the world, but AI is eating software.We live in unprecedented times – AI has the potential to exponentially augment human intelligence.Every person will have a personal tutor, co...Show more
Last updated: 30+ days ago • Promoted
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Ro • New York, New York, US
Full-time
Job Description Job Description Ro is a direct-to-patient healthcare company with a mission of helping patients achieve their health goals by delivering the easiest, most effective care possible.Ro...Show more
Last updated: 14 days ago • Promoted
Staff Software Engineer, Machine Learning

Staff Software Engineer, Machine Learning

Headspace Sourcing • New York, NY, United States
Full-time
Staff Software Engineer, Machine Learning.About the Staff Software Engineer, Machine Learning at Headspace: Machine Learning at Headspace is a dynamic and innovative group whose mission is to impro...Show more
Last updated: 30+ days ago • Promoted
Staff Software Engineer,Fullstack

Staff Software Engineer,Fullstack

Headway • New York, NY, United States
Full-time
Headway’s mission is to build a new mental health care system everyone can access.We’ve built technology that helps people find great therapists with the first software-enabled national network of ...Show more
Last updated: 30+ days ago • Promoted
RF Engineer, Staff

RF Engineer, Staff

Lockheed Martin Corporation • Brooklyn, NY, United States
Full-time +1
Job ID: 709535BR Date posted: Feb.Description: At Lockheed Martin Rotary and Mission Systems, we are driven by innovation and integrity.We believe that by applying the highest standards of business...Show more
Last updated: 15 hours ago • Promoted • New!
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Gradle Technologies • New York, New York, US
Full-time
Job Description Job Description Who We Are Develocity is a first-of-its-kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (includi...Show more
Last updated: 28 days ago • Promoted
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Garner Health • New York, New York, US
Full-time
Job Description Job Description Garner's mission is to transform the healthcare economy, delivering high-quality and affordable care for all.We are fundamentally reimagining how healthcare works in...Show more
Last updated: 28 days ago • Promoted
Senior I&C Engineer - Kiewit Power Engineering

Senior I&C Engineer - Kiewit Power Engineering

Kiewit Corporation • Woodcliff Lake, NJ, United States
Full-time
Kiewit Power Engineering Group is seeking an experienced-level Instrumentation and Controls Engineer to be a part of our growing team.Looking for ideal candidates that can provide instrumentation &...Show more
Last updated: 9 days ago • Promoted