Talent.com
Staff Systems Reliability Engineer
Staff Systems Reliability EngineerVirtualVocations • Concord, California, United States
Staff Systems Reliability Engineer

Staff Systems Reliability Engineer

VirtualVocations • Concord, California, United States
5 days ago
Job type
  • Full-time
Job description

A company is looking for a Staff Systems Reliability Engineer.

Key Responsibilities

Design and implement scalable, fault-tolerant AWS-based infrastructure

Develop and maintain CI / CD pipelines and automation tools for operations

Lead incident response efforts and mentor junior SREs

Required Qualifications

Minimum of 12 years of related experience with a Bachelor's degree; or 8 years with a Master's degree; or a PhD with 5 years of experience

Expert-level knowledge of AWS services and infrastructure automation tools

Strong proficiency in Python and / or Go for automation and tooling

Experience managing observability and alerting systems at scale

Familiarity with regulatory requirements related to infrastructure and DevOps

Create a job alert for this search

Staff Reliability Engineer • Concord, California, United States

Related jobs
Staff Systems Engineer

Staff Systems Engineer

Bio-Rad Laboratories • Pleasanton, CA, United States
Full-time
Working within Bio-Rad's Life Science R&D Group as a Systems Engineer, you will take engineering concepts, requirements and transform them into functional prototypes and finished products that impr...Show more
Last updated: 10 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Bits to Atoms • San Francisco, CA, United States
Full-time
Site Reliability Engineer (SRE).You’ll work at the intersection of infrastructure, AI / ML systems, and mission-critical physical operations. You’ll collaborate directly with engineering, AI, and oper...Show more
Last updated: 30+ days ago • Promoted
Staff Software Engineer, Site Reliability

Staff Software Engineer, Site Reliability

Asana • San Francisco, CA, United States
Full-time
Asana’s rapid growth brings new challenges in keeping our systems fast, reliable, and resilient.As our product evolves, we’re making a major investment in reliability – and building a brand new SRE...Show more
Last updated: 15 hours ago • Promoted • New!
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Crusoe • San Francisco, CA, United States
Full-time
Crusoe is building the World’s Favorite AI-first Cloud infrastructure company.We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to p...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Redwood Materials, Inc. • San Francisco, CA, United States
Full-time
Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Together AI • San Francisco, CA, United States
Full-time
As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show more
Last updated: 30+ days ago • Promoted
Staff Site Reliability Engineer - Managed AI

Staff Site Reliability Engineer - Managed AI

Crusoe • San Francisco, CA, United States
Full-time
At Crusoe, our Site Reliability Engineering team ensures the reliability and scalability of Crusoe’s AI-optimized cloud platform. We’re looking for an SRE with a strong background in distributed sys...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Fractal • San Francisco, CA, United States
Full-time
This range is provided by Fractal.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Fractal Analytics is a strategic AI partner to Fortune 500 com...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer I

Site Reliability Engineer I

Prosper • San Francisco, CA, United States
Full-time
As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show more
Last updated: 5 days ago • Promoted
Staff Site Reliability Engineer (SRE) San Francisco, California

Staff Site Reliability Engineer (SRE) San Francisco, California

HeartFlow, Inc. • San Francisco, CA, United States
Full-time
Heartflow is a medical technology company advancing the diagnosis and management of coronary artery disease, the #1 cause of death worldwide, using cutting-edge technology.The flagship product—an A...Show more
Last updated: 2 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Primer • San Francisco, CA, United States
Full-time
Primer helps B2B products break out of the B2C-centric marketing box.Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market...Show more
Last updated: 30+ days ago • Promoted
Founding Site Reliability Engineer

Founding Site Reliability Engineer

Assort Health • San Francisco, CA, United States
Full-time
Our mission is to make exceptional healthcare accessible anytime, anywhere, for everyone.At Assort Health, we believe healthcare should feel effortless and connected — quick answers, clear communic...Show more
Last updated: 6 days ago • Promoted
Sr. Staff Engineer

Sr. Staff Engineer

Bio-Rad Laboratories • Pleasanton, CA, United States
Full-time
You'll drive the development of hardware products that directly impact healthcare innovation and improve lives worldwide. You'll collaborate cross-functionally to.Your expertise in electrical engine...Show more
Last updated: 30+ days ago • Promoted
Staff Engineer

Staff Engineer

Bio-Rad Laboratories • Pleasanton, CA, United States
Full-time
As a Senior Electrical Engineer, you will play a critical role in designing, debugging, and supporting custom electronics solutions for cutting-edge life science research platforms.You'll drive the...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer II

Site Reliability Engineer II

Hinge Health • San Francisco, CA, United States
Full-time
From scaling Kubernetes clusters to improving observability with Datadog, we build the tooling and automation that empower product teams to ship with confidence. Collaborate with engineering teams t...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer II

Site Reliability Engineer II

Hinge-Health • San Francisco, CA, United States
Full-time
Site Reliability Engineers at Hinge Health are infrastructure engineers with a strong sense of ownership over the systems that keep our platform running reliably, securely, and efficiently.From sca...Show more
Last updated: 30+ days ago • Promoted
Staff Software Engineer

Staff Software Engineer

Bio-Rad Laboratories • Hercules, CA, United States
Full-time
This role is both technical and collaborative.You will work closely with cross-functional teams including systems engineers, mechanical designers, assay development scientists, and quality engineer...Show more
Last updated: 30+ days ago • Promoted
Staff Site Reliability Engineer, Storage

Staff Site Reliability Engineer, Storage

Epoch Biodesign • San Francisco, CA, United States
Full-time
Crusoe is building the World’s Favorite AI-first Cloud infrastructure company.We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to p...Show more
Last updated: 30+ days ago • Promoted