Talent.com
Site Reliability Engineer L4, Netflix Technology Services

Site Reliability Engineer L4, Netflix Technology Services

NetflixRemote, Remote, United States
30+ days ago
Job type
  • Full-time
Job description

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

The N-Tech SRE (Site Reliability Engineering) team aims to improve the reliability and resilience of the many services that Netflix employees leverage daily. Collaborating with cross-functional teams, you will focus on implementing best practices, automation, and proactive measures to maintain and enhance the reliability of our systems. You will join a team of SREs working to improve the availability, reliability, and observability of internal Netflix services and reduce the burden of human toil with tooling and automation.

This role is rewarding for people who have a passion for leveraging technology to solve business problems.  Our team is seeking individuals with a broad set of technical skills who have seen a lot of distributed systems break!  If this excites you, we invite you to bring your unique career and life experiences to enrich the culture and diversity of our team.

| RESPONSIBILITIES

Design, implement, and maintain scalable and reliable infrastructure to support our services.

Collaborate with engineering and product teams to integrate observability, reliability, and security considerations into the entire software development lifecycle.

Develop and implement automation tools for monitoring, deployment, and incident response to ensure efficient and reliable operations.

Conduct or participate in capacity planning, performance analysis, and system tuning to optimize system reliability.

Participate in on-call rotations and contribute to incident response, diagnosis, and resolution.

Implement and improve monitoring and alerting systems to proactively identify and address potential issues.

Implement and maintain robust disaster recovery and business continuity plans.

Continuously evaluate and recommend improvements to enhance system observability and reliability.

Proactively identify sources of instability in distributed systems and analyze how complex systems fail from a reliability and resilience perspective.

Engage with product teams to diagnose operational surprises and drive improvements.

Implement and maintain a robust incident response framework, including blame-aware incident reviews to learn from operational surprises.

Champion a growth mindset and continuous learning culture, encouraging proactive innovation and ongoing skill development.

| SKILLS AND EXPERIENCE

3+ years of experience as a Site Reliability Engineer or in a similar role

Strong scripting and programming skills (Python, Go, Java or JavaScript / Node.js)

Experience with complex sociotechnical systems and their successful operations at scale

Experience with incident management and response

Experience with Infrastructure as code like Terraform and container orchestration tools like Kubernetes, Docker

Experience with cloud platforms like AWS, microservices architecture, and enterprise software solutions like Slack & GSuite

Excellent communication & collaboration skills and a continuous improvement mindset

Proven ability to cultivate relationships through influence

Proven ability to troubleshoot complex issues and implement effective solutions

Familiarity with Human Factors Engineering

Ability to grow expertise, influence & educate others

Our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $100,000 - $720,000

Inclusion is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation / adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

Create a job alert for this search

Site Reliability Engineer • Remote, Remote, United States

Related jobs
Staff Platform Engineer (Remote)

Staff Platform Engineer (Remote)

JobgetherUS
Remote
Full-time
Quick Apply
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a.This role offers an exciting opportunity for a technical leader to drive the architecture, design,...Show moreLast updated: 3 days ago
  • New!
Senior Site Reliability Engineer

Senior Site Reliability Engineer

JobgetherUS
Remote
Full-time
Quick Apply
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a.Senior Site Reliability Engineer. This role offers a hands-on opportunity to ensure the reliability...Show moreLast updated: 15 hours ago
  • Promoted
Wireless Sales Pro

Wireless Sales Pro

Kansas StaffingParsons, KS, US
Full-time
Premium operates wireless locations in over 1,300 Wireless Retail outlets via Walmart Supercenter, with a dedicated sales team of over 3,200 brand representatives. As one of Premium's Wireless Sales...Show moreLast updated: 2 days ago
  • Promoted
Host Home Provider

Host Home Provider

BrightSpring Health ServicesParsons, KS, United States
Full-time
A Host Home provider is an independent contractor who opens their home and heart to an adult with a development disability. Depending on the individual, supports will likely include support with per...Show moreLast updated: 20 days ago
Staff Platform Engineer (Remote - US)

Staff Platform Engineer (Remote - US)

JobgetherUS
Remote
Full-time
Quick Apply
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a.This role focuses on designing and leading scalable, multi-cloud infrastructure that underpins AI-...Show moreLast updated: 21 days ago
Senior Site Reliability Engineer (Remote - US)

Senior Site Reliability Engineer (Remote - US)

JobgetherUS
Remote
Full-time
Quick Apply
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a.This role offers the opportunity to lead the development of highly reliable, scalable SaaS systems...Show moreLast updated: 30+ days ago
  • Promoted
Technical Support Specialist

Technical Support Specialist

ABBBartlesville, OK, United States
Full-time
At ABB, we help industries outrun - leaner and cleaner.Here, progress is an expectation - for you, your team, and the world. As a global market leader, we'll give you what you need to make it happen...Show moreLast updated: 2 days ago
Principal Site Reliability Engineer (Remote - US)

Principal Site Reliability Engineer (Remote - US)

JobgetherUS
Remote
Full-time
Quick Apply
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a Principal Site Reliability Engineer in the United States. As a Principal Site Reliability Engineer,...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Watch Netflix (Tagger) : $12-$46

Watch Netflix (Tagger) : $12-$46

NetflixChelsea, OK
Full-time
TV shows, documentaries, and other original content on Netflix and assign relevant metadata and tags that help improve Netflix’s recommendation algorithm. Your insights help personalize what million...Show moreLast updated: 6 hours ago
Onsite Engineer

Onsite Engineer

Medix Dental ITUS
Full-time
Quick Apply
About Us At Medix Dental IT, we are dedicated to our core values, providing top-notch IT solutions specifically tailored for dental practices. Based in the Midwest, we have been a trusted partner in...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

EngFlow Inc.US
Remote
Full-time
Quick Apply
Our cloud-based, distributed service optimizes developer workflows through remote execution and caching, improving efficiency, productivity, and product quality. Backed by top investors, EngFlow is ...Show moreLast updated: 30+ days ago
  • Promoted
O&M Technician I

O&M Technician I

Veolia Environnement SABartlesville, OK, United States
Full-time
Veolia in North America is the top-ranked environmental company in the United States for three consecutive years, and the country's largest private water operator and technology provider as well as...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Lead Software Engineer - DevSecOps / Cloud / Site Reliability Engineer (SRE)

Lead Software Engineer - DevSecOps / Cloud / Site Reliability Engineer (SRE)

BoeingUS
Permanent +1
At Boeing, we innovate and collaborate to make the world a better place.We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportu...Show moreLast updated: 17 hours ago
  • Promoted
Direct Support Professional

Direct Support Professional

BrightSpring Health ServicesParsons, KS, United States
Full-time
US-KS-PARSONS | US-KS-CHERRYVALE | US-KS-INDEPENDENCE | US-KS-ERIE | US-KS-CHANUTE.ResCare Community Living - Direct Support Professional. Thank you for reviewing our Direct Support Professional pos...Show moreLast updated: 30+ days ago
  • Promoted
Shift Lead

Shift Lead

Kansas StaffingParsons, KS, US
Part-time
Opens and closes the store in the absence of store management, including all required systems start-ups, required cash handling, and ensuring the floor and stock room are ready for the business day...Show moreLast updated: 2 days ago
Staff Engineer (Remote - US)

Staff Engineer (Remote - US)

JobgetherUS
Remote
Full-time
Quick Apply
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a Staff Engineer in United States. In this role, you will contribute to critical environmental and ge...Show moreLast updated: 30+ days ago
Site Reliability Engineer (Remote - US)

Site Reliability Engineer (Remote - US)

JobgetherUS
Remote
Full-time +1
Quick Apply
This position is posted by Jobgether on behalf of a partner company.We are currently looking for a Site Reliability Engineer in United States. We are seeking a Senior Site Reliability Engineer (SRE)...Show moreLast updated: 30+ days ago
  • Promoted
Direct Support Lead

Direct Support Lead

BrightSpring Health ServicesParsons, KS, United States
Full-time
Are you driven to serve and help others in your community? Caregivers and Direct Support Professionals (DSP) are the heart of our company with their compassion, dependability, and care.If you want ...Show moreLast updated: 20 days ago