Talent.com
Software Engineer, Site Reliability
Software Engineer, Site ReliabilityDevOps projects • San Francisco, CA, United States
Software Engineer, Site Reliability

Software Engineer, Site Reliability

DevOps projects • San Francisco, CA, United States
1 day ago
Job type
  • Full-time
Job description

Get weekly curated DevOps opportunities, salary insights, and career tips no spam, only relevant roles that match your stack and experience level.

Software Engineer, Site Reliability

Why Harvey

Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning‑adept LLMs that have been customized and developed by our expert team of lawyers, engineers and research scientists. We’ve found product market fit and are scaling our team very quickly. Some reasons to join Harvey are :

  • Exceptional product market fit : We have partnered with the largest law firms and professional service providers in the world, including Paul Weiss, A&O Shearman, Ashurst, O’Melveny & Myers, PwC, KKR, and many others.
  • Strategic investors : Raised over $500 million from strategic investors including Sequoia, Google Ventures, Kleiner Perkins, and OpenAI.
  • World‑class team : Harvey is hiring the best talent from DeepMind, Google Brain, Stripe, FAIR, Tesla Autopilot, Glean, Superhuman, Figma, and more.
  • Partnerships : Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services.
  • Performance : 4x ARR in 2024.
  • Competitive compensation.

Role Overview

As a Software Engineer on the Site Reliability team at Harvey, you will ensure the reliability, scalability, and performance of our legal AI platform. You’ll join a high‑leverage team that sits at the intersection of infrastructure and product, owning the systems that keep our platform fast, secure, and always on. From scaling across 50+ regions to automating mission‑critical operations, your work will ensure that Harvey remains resilient as we grow. If you’re passionate about building robust systems and reducing complexity through automation, we’d love to work with you.

This role is based in San Francisco, CA. We use an in‑person work model and offer relocation assistance to new employees.

What You’ll Do

  • Design, implement, and manage monitoring, alerting, and infrastructure resources (compute, storage, networking) across 50+ global regions.
  • Lead incident management processes, including post‑mortems, root‑cause analyses, and driving actionable improvements.
  • Automate operational tasks and workflows, building tools and processes for capacity planning, graceful rollouts, and safe data access to maintain high reliability and reduce manual intervention.
  • Collaborate across teams to drive reliability, security, and compliance throughout the software lifecycle.
  • Optimize infrastructure costs through strategic capacity planning and build‑versus‑buy decisions while maintaining system performance, reliability, and functionality.
  • What You Have

  • 3+ years of experience in Site Reliability Engineering or similar roles supporting production environments.
  • Expertise in infrastructure‑as‑code (IaC) tools (Pulumi, Terraform, CloudFormation, etc.).
  • Deep familiarity with observability tools (Datadog, Sentry, etc.) and incident response practices (PagerDuty, IncidentIO, etc.).
  • Proficiency with cloud infrastructure platforms (Azure, GCP, AWS, etc.).
  • Strong programming skills (Python, Bash, Go, or similar languages).
  • Proven track record of diagnosing complex system problems and implementing durable solutions.
  • Solid understanding of CI / CD, Kubernetes, containerization, networking, and cloud security principles.
  • Excellent problem‑solving skills, meticulous attention to detail, and a commitment to operational excellence.
  • Compensation Range

    $175,000 - $250,000 USD

    Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity / expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

    We are in the early innings of a generational company. Joining early at a hypergrowth startup has proven to lead to exponential growth in responsibility, access, and ability.

    Apply today!

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • San Francisco, CA, United States

    Related jobs
    Software Engineer, Site Reliability

    Software Engineer, Site Reliability

    Fireworks AI • Redwood City, CA, United States
    Full-time
    Get AI-powered advice on this job and more exclusive features.Here at Fireworks, we're building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highe...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer (Site Reliability Engineer)

    Software Engineer (Site Reliability Engineer)

    Cerebras • San Francisco, CA, United States
    Full-time
    San Francisco or Palo Alto, CA.At Anyscale, we take a market-based approach to compensation.We are data-driven, transparent, and consistent. As the market data changes over time, the target salary f...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Latent • San Francisco, CA, United States
    Full-time
    Location : San Francisco, CA (5 Days In-Office).You are the infrastructure expert who enables our rapid product development and guarantees. AI platform for major health systems.Your focus on operatio...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Site Reliability (SRE)

    Software Engineer, Site Reliability (SRE)

    Sierra • San Francisco, CA, United States
    Full-time
    At Sierra, we’re creating a platform to help businesses build better, more human customer experiences with AI.We are primarily an in-person company based in San Francisco, with growing offices in A...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Site Reliability Engineer (SRE)

    Software Engineer, Site Reliability Engineer (SRE)

    Harvey • San Francisco, California, United States
    Full-time
    Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customi...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Conductorone • San Francisco, California, United States
    Full-time
    ConductorOne is the modern identity governance platform that makes it possible to move beyond the limitations of legacy IGA and reduce the identity attack surface with confidence.Designed for flexi...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Together AI • San Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    Prosper Marketplace • San Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Zoox • Foster City, California, United States
    Full-time
    Zoox is looking for a platform / site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous veh...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Site Reliability Engineer (SRE)

    Senior Software Engineer, Site Reliability Engineer (SRE)

    harvey.ai • San Francisco, CA, United States
    Full-time
    At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end.By combining frontier agentic AI, an enterprise-grade platform, and deep domain experti...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer (Site Reliability Engineer)

    Software Engineer (Site Reliability Engineer)

    Anyscale • San Francisco, CA, United States
    Full-time
    Software Engineer (Site Reliability Engineer).Software Engineer (Site Reliability Engineer).At Anyscale, we're on a mission to democratize distributed computing and make it accessible to software d...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    Prosper • San Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Flexton, Inc. • San Francisco, CA, United States
    Full-time
    Skill : You have excellent written and verbal communication skills.You have experience managing large websites or services within the context of a large scale web environment.You are able to execute...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Speak • San Francisco, CA, United States
    Full-time
    Our mission is to reinvent the way people learn, starting with language.Learning a language can change a life by opening doors to new cultures, careers, and communities. Two billion people around th...Show more
    Last updated: 9 days ago • Promoted
    Software Engineer, Reliability

    Software Engineer, Reliability

    OpenAI • San Francisco, CA, United States
    Full-time
    Join the engineering teams that bring OpenAI's ideas safely to the world!!.The Applied Engineering team works across research, engineering, product, and design to bring OpenAI's technology to consu...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Replit • Foster City, California, United States
    Full-time
    Replit is the fastest way to turn ideas into software.With our powerful AI-powered Agent and Assistant, anyone can create and launch apps from natural language in just one click.Build and deploy fu...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Checkr • San Francisco, California, United States
    Full-time
    Checkr is building the data platform to power safe and fair decisions.Established in 2014, Checkr’s innovative technology and robust data platform help customers assess risk and ensure safety and c...Show more
    Last updated: 30+ days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Visa • Foster City, California, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 30+ days ago • Promoted