Talent.com
Staff Software Engineer, Site Reliability Engineer (SRE)
Staff Software Engineer, Site Reliability Engineer (SRE)Harvey • San Francisco, California, United States
Staff Software Engineer, Site Reliability Engineer (SRE)

Staff Software Engineer, Site Reliability Engineer (SRE)

Harvey • San Francisco, California, United States
30+ days ago
Job type
  • Full-time
Job description

Why Harvey

Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customized and developed by our expert team of lawyers, engineers and research scientists. We’ve found product market fit and are scaling our team very quickly. Some reasons to join Harvey are :

Exceptional product market fit : We have partnered with the largest law firms and professional service providers in the world, including Paul Weiss , A&O Shearman , Ashurst , O'Melveny & Myers, PwC , KKR, and many others.

Strategic investors : Raised over $500 million from strategic investors including Sequoia, Google Ventures, Kleiner Perkins, and OpenAI.

World-class team : Harvey is hiring the best talent from DeepMind, Google Brain, Stripe, FAIR, Tesla Autopilot, Glean, Superhuman, Figma, and more.

Partnerships : Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services.

Performance : 4x ARR in 2024.

Competitive compensation.

Role Overview

As a Staff Software Engineer on the Site Reliability team at Harvey, you will ensure the reliability, scalability, and performance of our legal AI platform. You’ll join a high-leverage team that sits at the intersection of infrastructure and product, owning the systems that keep our platform fast, secure, and always on. From scaling across 50+ regions to automating mission-critical operations, your work will ensure that Harvey remains resilient as we grow. If you’re passionate about building robust systems and reducing complexity through automation, we’d love to work with you.

This role is based in San Francisco, CA. We use an in-person work model and offer relocation assistance to new employees.

What You’ll Do

Design, implement, and manage monitoring, alerting, and infrastructure resources (compute, storage, networking) across 50+ global regions

Lead incident management processes, including postmortems, root cause analyses, and driving actionable improvements

Automate operational tasks and workflows, building tools and processes for capacity planning, graceful rollouts, and safe data access to maintain high reliability and reduce manual intervention

Develop and enforce best practices for security, compliance, and infrastructure reliability while collaborating cross-functionally to integrate these principles throughout the software lifecycle

Optimize infrastructure costs through strategic capacity planning and build-versus-buy decisions while maintaining system performance, reliability, and functionality

Provide technical mentorship and leadership, promoting best practices and fostering team growth

What You Have

10+ years of experience in Site Reliability Engineering or similar roles supporting production environments, with proven ability to mentor and guide technical teams

Expertise in infrastructure as code(IaC) tools (Pulumi, Terraform, CloudFormation, etc.)

Deep familiarity with observability tools (Datadog, Sentry, etc.) and incident response practices (PagerDuty, IncidentIO, etc.)

Proficiency with cloud infrastructure platforms (Azure, GCP, AWS, etc.)

Strong programming skills (Python, Bash, Go, or similar languages)

Proven track record of diagnosing complex system problems and implementing durable solutions

Solid understanding of CI / CD, Kubernetes, containerization, networking, and cloud security principles

Excellent problem-solving skills, meticulous attention to detail, and a commitment to operational excellence

Compensation Range

$250,000 - $290,000 USD

Please find our CA applicant privacy notice here .

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity / expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

We are in the early innings of a generational company. Joining early at a hypergrowth startup has proven to lead to exponential growth in responsibility, access, and ability. Apply here today!

Create a job alert for this search

Staff Site Reliability Engineer • San Francisco, California, United States

Related jobs
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Altana • San Francisco, California, United States
Full-time
AI can be a powerful tool for good in the world – at Altana we apply AI to the world’s largest organized body of supply chain data to power a more resilient, more secure, and more sustainable model...Show more
Last updated: 30+ days ago • Promoted
Cloud Site Reliability Engineer (SRE)

Cloud Site Reliability Engineer (SRE)

Promise • Oakland, California, United States
Full-time +1
Promise empowers utilities and government agencies to create flexible, affordable solutions for individuals struggling with debt. Our innovative approach to payment plans and relief distribution sig...Show more
Last updated: 30+ days ago • Promoted
Staff Software Engineer, Site Reliability

Staff Software Engineer, Site Reliability

Asana • San Francisco, CA, United States
Full-time
Asana’s rapid growth brings new challenges in keeping our systems fast, reliable, and resilient.As our product evolves, we’re making a major investment in reliability – and building a brand new SRE...Show more
Last updated: 29 days ago • Promoted
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Altana AI • San Francisco, CA, United States
Full-time
AI can be a powerful tool for good in the world – at Altana we apply AI to the world’s largest organized body of supply chain data to power a more resilient, more secure, and more sustainable model...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, Site Reliability

Senior Software Engineer, Site Reliability

startups • San Francisco, CA, United States
Full-time
Parabola is a workflow builder that makes it easy to organize and transform messy data from anywhere—even PDFs, emails, and spreadsheets—so that forward-thinking teams can automate the work they th...Show more
Last updated: 30+ days ago • Promoted
Software Engineer, Site Reliability (SRE)

Software Engineer, Site Reliability (SRE)

Sierra • San Francisco, CA, United States
Full-time
At Sierra, we’re creating a platform to help businesses build better, more human customer experiences with AI.We are primarily an in-person company based in San Francisco, with growing offices in A...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Together AI • San Francisco, CA, United States
Full-time
As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show more
Last updated: 30+ days ago • Promoted
Senior / Staff Site Reliability Engineer

Senior / Staff Site Reliability Engineer

Fluidstack • San Francisco, CA, United States
Full-time
At Fluidstack, we’re building the infrastructure for abundant intelligence.We partner with top AI labs, governments, and enterprises - including Mistral, Poolside, Black Forest Labs, Meta, and more...Show more
Last updated: 30+ days ago • Promoted
Sr. Site Reliability Engineer

Sr. Site Reliability Engineer

Prosper • San Francisco, California, United States
Full-time
As a Senior Site Reliability Engineer (SRE) at Prosper, you will be instrumental in enhancing the reliability, scalability, and maintainability of our technology platform.This role bridges the gap ...Show more
Last updated: 30+ days ago • Promoted
Software Engineer, Site Reliability (SRE)

Software Engineer, Site Reliability (SRE)

Sierra Business Solution • San Francisco, CA, United States
Full-time
Software Engineer, Site Reliability (SRE).Software Engineer, Site Reliability (SRE).We are an in‑person company based in San Francisco with growing offices in Atlanta, New York, and London, buildin...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, Site Reliability Engineer (SRE)

Senior Software Engineer, Site Reliability Engineer (SRE)

harvey.ai • San Francisco, CA, United States
Full-time
At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end.By combining frontier agentic AI, an enterprise-grade platform, and deep domain experti...Show more
Last updated: 30+ days ago • Promoted
Staff Site Reliability Engineer, Platform — Hybrid & Scalable

Staff Site Reliability Engineer, Platform — Hybrid & Scalable

Gemini • San Francisco, CA, United States
Full-time
A leading crypto and Web3 platform in San Francisco seeks a Staff Site Reliability Engineer to lead engineering teams towards modern DevOps practices. This role involves developing automation tools ...Show more
Last updated: 5 days ago • Promoted
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Berkley Hunt • San Francisco, CA, United States
Full-time
Founder @ Berkley Hunt | Partnering with VC firms to build high-performing tech teams.Berkley Hunt has partnered with a Series B start up, we are seeking a highly skilled Infrastructure Engineer to...Show more
Last updated: 14 days ago • Promoted
Staff Site Reliability Engineer (SRE)

Staff Site Reliability Engineer (SRE)

Heartflow • San Francisco, California, United States
Full-time
Heartflow is a medical technology company advancing the diagnosis and management of coronary artery disease, the #1 cause of death worldwide, using cutting-edge technology.The flagship product—an A...Show more
Last updated: 30+ days ago • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Checkr • San Francisco, California, United States
Full-time
Checkr is building the data platform to power safe and fair decisions.Established in 2014, Checkr’s innovative technology and robust data platform help customers assess risk and ensure safety and c...Show more
Last updated: 30+ days ago • Promoted
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Replit • Foster City, California, United States
Full-time
Replit is the agentic software creation platform that enables anyone to build applications using natural language.With millions of users worldwide and over 500,000 business users, Replit is democra...Show more
Last updated: 30+ days ago • Promoted
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Crusoe • San Francisco, California, United States
Full-time
Crusoe is building the World’s Favorite AI-first Cloud infrastructure company.We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to ...Show more
Last updated: 30+ days ago • Promoted
Staff Site Reliability Engineer - Platform

Staff Site Reliability Engineer - Platform

Icon Ventures • San Francisco, CA, United States
Full-time
At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way.Our $1B+ learning platform serves tens of millions of students every month, includin...Show more
Last updated: 11 days ago • Promoted