Talent.com
Staff Site Reliability Engineer

Staff Site Reliability Engineer

VirtualVocationsSan Jose, California, United States
30+ days ago
Job type
  • Full-time
Job description

A company is looking for a Staff Site Reliability Engineer.

Key Responsibilities

Define and drive the strategic direction for SRE practices and reliability engineering

Architect and implement complex systems and solutions for scalability and reliability

Lead major incident response efforts and postmortem analyses to improve system resilience

Required Qualifications

8+ years of experience as an SRE in AWS environments within medium to large-scale organizations

8+ years of hands-on experience with observability tools like Prometheus and Grafana

Exceptional proficiency in programming, particularly in Python, Go, and Bash

5+ years of experience in designing and building infrastructure deployment pipelines using tools like Terraform and Git

Advanced expertise in managing production environments in AWS and deep knowledge of Linux systems

Create a job alert for this search

Site Reliability Engineer • San Jose, California, United States

Related jobs
  • Promoted
Senior Engineer, Site Reliability

Senior Engineer, Site Reliability

VirtualVocationsSan Francisco, California, United States
Full-time
A company is looking for a Senior Engineer in Site Reliability Engineering for Digital Banking.Key Responsibilities Ensure the reliability, availability, and performance of applications in produc...Show moreLast updated: 3 days ago
  • Promoted
Site Reliability Engineer - SRE at Descope Los Altos, CA

Site Reliability Engineer - SRE at Descope Los Altos, CA

Itlearn360Los Altos, CA, United States
Full-time
Site Reliability Engineer - SRE job at Descope.Descope R&D group is a skilled team of developers with a unique DNA of creativity,flexibility,anopen mindset. We are looking for a passionate SRE to jo...Show moreLast updated: 30+ days ago
Site Reliability Engineer

Site Reliability Engineer

DTEX SystemsFremont, CA, US
Full-time
Quick Apply
We are excited that you’ve taken the time to explore our business and potentially join us on this incredible journey.We are already the leader in the Insider Risk Management, but our story do...Show moreLast updated: 30+ days ago
  • Promoted
Staff Site Reliability Engineer - Managed AI

Staff Site Reliability Engineer - Managed AI

ZipRecruiterSan Francisco, CA, United States
Full-time
Job DescriptionJob Description.Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI —...Show moreLast updated: 11 days ago
  • Promoted
Senior / Staff Site Reliability Engineer, Storage

Senior / Staff Site Reliability Engineer, Storage

FluidstackSan Francisco, CA, United States
Full-time
Fluidstack is building GPU supercomputers for top AI labs, governments, and enterprises.Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivate...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ZapierSan Francisco, CA, United States
Full-time
We're humans who simply think computers should do more work.At Zapier, we’re not just making software—we’re building a platform to help millions of businesses globally scale with automation and AI....Show moreLast updated: 7 days ago
  • Promoted
Staff Site Reliability Engineer, Storage

Staff Site Reliability Engineer, Storage

Epoch BiodesignSan Francisco, CA, United States
Full-time
Crusoe is building the World’s Favorite AI-first Cloud infrastructure company.We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to p...Show moreLast updated: 30+ days ago
  • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Rollbar, Inc.San Francisco, CA, United States
Full-time
Wikimedia Foundation is hiring a Senior Site Reliability Engineer (SRE) to join our Service Operations SRE team, where we take care of the infrastructure that runs wikipedia.The SRE team at Wikimed...Show moreLast updated: 30+ days ago
  • Promoted
Staff Reliability Engineer

Staff Reliability Engineer

SPANSan Francisco, CA, United States
Full-time
Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.SPAN is enabling electrification for all. We are a mission-driven company designing, building, and depl...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

PsiQuantumPalo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
  • Promoted
Senior / Staff Site Reliability Engineer, Compute

Senior / Staff Site Reliability Engineer, Compute

FluidstackSan Francisco, CA, United States
Full-time
Fluidstack is building GPU supercomputers for top AI labs, governments, and enterprises.Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivate...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer - Technical Lead

Site Reliability Engineer - Technical Lead

ZipRecruiterSan Francisco, CA, United States
Full-time
Veryon is a leading software and technology company that enables aviation teams around the world to improve efficiency and safety. Our products maximize uptime for aircraft maintenance teams through...Show moreLast updated: 12 days ago
Site Reliability Engineer

Site Reliability Engineer

Foxconn Industrial Internet - FIISan Jose, CA, US
Full-time +1
Quick Apply
Site Reliability Engineer Foxconn Industrial Internet (Fii), is a world leading professional design and manufacturing service provider of communication network equipment, cloud service equipment, p...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

FortinetSunnyvale, California, United States
Full-time
At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer (SRE) - grok.com & API

Site Reliability Engineer (SRE) - grok.com & API

Pantera CapitalPalo Alto, CA, United States
Full-time
AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show moreLast updated: 10 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

CanonicalSan Francisco, CA, United States
Full-time
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

WritemedSan Francisco, CA, United States
Full-time
Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Air AppsSan Francisco, CA, United States
Full-time
At Air Apps, we believe in thinking bigger—and moving faster.We’re a family-founded company on a mission to create the world’s first AI-powered Personal & Entrepreneurial Resource Planner (PRP), an...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer II

Site Reliability Engineer II

PinterestSan Francisco, CA, United States
Full-time
Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show moreLast updated: 10 days ago
  • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

BasetenSan Francisco, CA, United States
Full-time
Site Reliability Engineer (SRE).Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed.By uniting a...Show moreLast updated: 30+ days ago