Talent.com
Staff Site Reliability Engineer
Staff Site Reliability EngineerVeeam • San Francisco, CA, United States
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Veeam • San Francisco, CA, United States
17 days ago
Job type
  • Full-time
Job description

Veeam, the #1 global market leader in data resilience, believes businesses should control all their data whenever and wherever they need it. Veeam provides data resilience through data backup, data recovery, data portability, data security, and data intelligence. Based in Seattle, Veeam protects over 550,000 customers worldwide who trust Veeam to keep their businesses running. Join us as we move forward together, growing, learning, and making a real impact for some of the world’s biggest brands. The future of data resilience is here – go fearlessly forward with us.

About the Role

Veeam is launching a global Site Reliability Engineering (SRE) function to support the rollout and operation of our new SaaS offering : the Veeam Data Cloud. As a Staff Site Reliability Engineer , you will serve as a hands‑on technical leader within the SRE team, guiding senior engineers, influencing product development teams, and ensuring the systems we operate are built to be reliable, scalable, and observable from the ground up. You will drive strategic initiatives, mentor others in the practice of SRE, and help define architectural best practices across our platform. This role is pivotal in aligning teams, enforcing high standards, and scaling SRE principles globally within Veeam.

Reliability Engineering & Resilience

  • Act as a technical authority in your area, mentoring senior engineers and guiding design choices that improve service reliability and resilience.
  • Lead the definition and enforcement of SLIs, SLOs, and error budgets; drive adherence across engineering teams.
  • Collaborate with Staff peers across teams to align strategy and champion shared reliability standards and goals.
  • Partner with development and product teams to proactively design for failure, build resilient architecture, and operationalize reliability from the start.

Observability & Operational Excellence

  • Drive company‑wide adoption of observability best practices and tooling.
  • Ensure metrics, logs, and traces provide deep, actionable insights across systems.
  • Lead complex incident responses, post‑mortems, and systemic reliability improvements.
  • Promote and enforce a blameless culture of learning and continuous improvement.
  • Engineering at Scale

  • Lead initiatives in infrastructure as code, deployment automation, and resilience testing.
  • Influence the development and adoption of chaos engineering practices and release validation frameworks.
  • Partner with platform and security teams to ensure production readiness.
  • Collaboration & Culture

  • Work closely with your peer Staff Engineers to plan, align, and deliver against reliability goals.
  • Provide architectural guidance and advocate for engineering rigor and consistency.
  • Represent the SRE team in technical leadership forums and product planning discussions.
  • What We Are Looking For

    Required

  • 8+ years of experience in a Software Engineering or SRE role, including technical leadership.
  • Demonstrated experience mentoring and guiding senior engineers.
  • Deep expertise in building distributed systems on public cloud (Azure preferred).
  • Strong skills in programming (e.g., JS, Go, Typescript, Java, or C#).
  • Hands‑on experience with observability tooling (e.g., Prometheus, Grafana, OpenTelemetry).
  • Mastery of infrastructure automation tools (Terraform, Pulumi) and container orchestration (Kubernetes).
  • Ability to communicate clearly across geographies and disciplines.
  • Preferred

  • Experience leading SRE initiatives across multiple product teams.
  • Background in chaos engineering, incident learning, or performance and load testing.
  • Familiarity with global compliance standards (ISO, SOC 2, GDPR, FedRAMP, CMMC).
  • Why Join Veeam?

  • Be a core architect in the rollout of Veeam’s first global SaaS offering – the Veeam Data Cloud.
  • Help shape a modern, engineering‑driven SRE practice from the ground up.
  • Influence long‑term reliability and architecture across a global product portfolio.
  • Work in a collaborative environment with engineering leaders who value strategic thinking, hands‑on problem solving, and customer empathy.
  • Enjoy competitive pay and benefits, flexible work arrangements, and a team culture built on learning, ownership, and impact.
  • Benefits

  • Unlimited PTO
  • Paid Holidays
  • Veeam Care Days : 24 hours paid time for volunteering
  • Medical, dental, and vision coverage starting on day one (multiple plan options)
  • Flexible Spending Accounts (FSA) and Health Savings Account (HSA) options
  • Employer HSA contributions (for HDHP participants)
  • Life and AD&D insurance (employee, spouse / partner, and child options)
  • Company‑paid short‑term and long‑term disability insurance
  • Supplemental individual disability insurance (IDI)
  • Family planning support : fertility, adoption, surrogacy, and parental resources
  • Paid parental leave
  • Employee Assistance Program
  • Additional voluntary benefits : accident, critical illness, hospital indemnity, legal, identity theft protection, commuter benefits, pet care
  • Mental health support
  • 401(k) plan
  • Professional training and education, on‑demand learning libraries (LinkedIn Learning, O’Reilly), mentoring, workshops, and Global Day of Learning
  • Compensation Transparency

    Veeam is committed to pay transparency and equitable compensation. For this role, the compensation range below reflects the expected total target compensation (TTC), inclusive of base pay and a competitive performance‑based bonus. For roles with a commission plan, the compensation range represents On Target Earnings (OTE), which includes base salary plus variable commission. Offers are typically made below the midpoint of the range.

  • Zone 1 : San Francisco Bay Area, New York City Boroughs – $293,100 — $544,200 USD
  • Zone 2 : Washington, California (excluding San Francisco Bay Area) – $268,600 — $498,900 USD
  • Zone 3 : Texas, Illinois, North Carolina, Colorado, Massachusetts, Pennsylvania, Virginia, Oregon, Nevada, Hawaii, New York (excluding NYC boroughs); Sales roles located in Georgia, Ohio, and Arizona – $244,200 — $453,500 USD
  • Zone 4 : All other US locations – $212,500 — $394,600 USD
  • Veeam Software is an equal opportunity employer and does not tolerate discrimination in any form on the basis of race, color, religion, gender, age, national origin, citizenship, disability, veteran status or any other classification protected by federal, state or local law. All your information will be kept confidential.

    Please note that any personal data collected from you during the recruitment process will be processed in accordance with our Recruiting Privacy Notice.

    The Privacy Notice sets out the basis on which the personal data collected from you, or that you provide to us, will be processed by us in connection with our recruitment processes.

    By submitting your application, you acknowledge that the information provided in your job application and any supporting documents is complete and accurate to the best of your knowledge. Any misrepresentation, omission, or falsification of information may result in disqualification from consideration for employment or, if discovered after employment begins, termination of employment.

    #J-18808-Ljbffr

    Create a job alert for this search

    Staff Site Reliability Engineer • San Francisco, CA, United States

    Similar jobs
    Senior Staff Site Reliability Engineer

    Senior Staff Site Reliability Engineer

    WEX • San Francisco, CA, United States
    Full-time
    We are looking for a highly motivated and high-potential Senior Staff Site Reliability Engineer (SRE) to join our team as a senior technical leader, driving transformational change and delivering s...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer - Platform

    Site Reliability Engineer - Platform

    CodeRabbit • San Francisco, CA, United States
    Full-time
    CodeRabbit is an innovative research and development company focused on building extraordinarily productive human‑machine collaboration systems. Our primary goal is to create the next generation of ...Show more
    Last updated: 16 days ago • Promoted
    Senior Staff Site Reliability Engineer

    Senior Staff Site Reliability Engineer

    WEX, Inc. • San Francisco, CA, United States
    Full-time
    We are looking for a highly motivated and high-potential Senior Staff Site Reliability Engineer (SRE) to join our team as a senior technical leader, driving transformational change and delivering s...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    gamma.app • San Francisco, CA, United States
    Full-time
    We're building the creative layer for modern communication.Every month, over a billion people make presentations — but the tools they use to make them haven't evolved in decades.We're changing that...Show more
    Last updated: 30+ days ago • Promoted
    Sr / Staff Site Reliability Engineer, Consumer Apps Chicago, IL; Redwood City, CA

    Sr / Staff Site Reliability Engineer, Consumer Apps Chicago, IL; Redwood City, CA

    Attaindata • Redwood City, CA, United States
    Full-time
    Sr / Staff Site Reliability Engineer, Consumer Apps.Klover’s engineering team powers one of the fastest-growing fintech platforms in the U. Our systems process and move more than $1.As part of this te...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Attain • Redwood City, CA, United States
    Full-time
    Built for consumers and companies, alike.In a world driven by data, we believe consumers and businesses can coexist.Our founders had a vision to empower consumers to leverage their greatest asset—t...Show more
    Last updated: 11 days ago • Promoted
    Software Engineer, Site Reliability

    Software Engineer, Site Reliability

    Roblox • San Mateo, California, United States
    Full-time
    You’ll collaborate with cross-functional teams to build robust infrastructure that supports our growth.If you have a track record of solving complex technical challenges, we want to hear from you.J...Show more
    Last updated: 13 days ago • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Redwood Materials, Inc. • San Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Mercor • San Francisco, CA, United States
    Full-time
    Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...Show more
    Last updated: 26 days ago • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Bigeye • San Francisco, CA, United States
    Full-time
    We build trusted tools that enable enterprises to move fast with confidence in their data and AI - combining early signal data observability, clear lineage, and an AI trust platform that governs ac...Show more
    Last updated: 4 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Alembic Technologies • San Francisco, CA, United States
    Full-time
    Senior Site Reliability Engineer.This range is provided by Alembic Technologies.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.We’re looking fo...Show more
    Last updated: 30+ days ago • Promoted
    Staff Site Reliability Engineer, Compute

    Staff Site Reliability Engineer, Compute

    Crusoe • San Francisco, CA, United States
    Full-time
    Crusoe is building the World’s Favorite AI-first Cloud infrastructure company.We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to p...Show more
    Last updated: 30+ days ago • Promoted
    Staff Software Engineer, Site Reliability Engineer (SRE)

    Staff Software Engineer, Site Reliability Engineer (SRE)

    Harvey • San Francisco, California, United States
    Full-time
    Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customi...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Fractal • San Francisco, CA, United States
    Full-time
    This range is provided by Fractal.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Fractal Analytics is a strategic AI partner to Fortune 500 com...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Mercor, Inc. • San Francisco, CA, United States
    Full-time
    Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...Show more
    Last updated: 27 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Happyrobot Inc. • San Francisco, CA, United States
    Full-time
    HappyRobot is the AI-native operating system for the real economy—a system that closes the circuit between intelligence and action. By combining real-time truth, specialized AI workers, and an orche...Show more
    Last updated: 25 days ago • Promoted
    Staff Site Reliability Engineer - Platform

    Staff Site Reliability Engineer - Platform

    Icon Ventures • San Francisco, CA, United States
    Full-time
    At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way.Our $1B+ learning platform serves tens of millions of students every month, includin...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    The Voleon Group • Berkeley, CA, United States
    Full-time
    Voleon is a technology company that applies state‑of‑the‑art AI and machine learning techniques to real‑world problems in finance. For nearly two decades, we have led our industry and worked at the ...Show more
    Last updated: 17 days ago • Promoted