Talent.com
Sr Reliability Engineer
Sr Reliability EngineerHippocratic AI • San Francisco, CA, US
Sr Reliability Engineer

Sr Reliability Engineer

Hippocratic AI • San Francisco, CA, US
10 days ago
Job type
  • Full-time
Job description

Job Description

About Us

Hippocratic AI is the leading generative AI company in healthcare. We have the only system that can have safe, autonomous, clinical conversations with patients. We have trained our own LLMs as part of our Polaris constellation, resulting in a system with over 99.9% accuracy.

Why Join Our Team

Reinvent healthcare with AI that puts safety first. We’re building the world’s first healthcare‑only, safety‑focused LLM — a breakthrough platform designed to transform patient outcomes at a global scale. This is category creation.

Work with the people shaping the future. Hippocratic AI was co‑founded by CEO Munjal Shah and a team of physicians, hospital leaders, AI pioneers, and researchers from institutions like El Camino Health, Johns Hopkins, Washington University in St. Louis, Stanford, Google, Meta, Microsoft, and NVIDIA.

Backed by the world’s leading healthcare and AI investors. We recently raised a $126M Series C at a $3.5B valuation, led by Avenir Growth, bringing total funding to $404M with participation from CapitalG, General Catalyst, a16z, Kleiner Perkins, Premji Invest, UHS, Cincinnati Children’s, WellSpan Health, John Doerr, Rick Klausner, and others.

Build alongside the best in healthcare and AI. Join experts who’ve spent their careers improving care, advancing science, and building world‑changing technologies — ensuring our platform is powerful, trusted, and truly transformative.

Location Requirement

We believe the best ideas happen together. To support fast collaboration and a strong team culture, this role is expected to be in our Palo Alto office five days a week, unless otherwise specified.

About the Role

We are seeking a highly skilled Senior Site Reliability Engineer to join our team. In this role responsibilities will include designing and implementing infrastructure automation, continuous integration and delivery pipelines, and monitoring and scaling the infrastructure that powers our healthcare AI platform. You will work closely with software engineers, research scientists, and other cross-functional teams to develop and maintain reliable and scalable infrastructure that enables rapid iteration and deployment of our products.

What You'll Do

  • Design and implement infrastructure automation and deployment pipelines using tools such as Terraform
  • Implement and maintain monitoring and logging systems to ensure the reliability and performance of our healthcare AI platform
  • Work closely with software engineers to design and deploy scalable, fault-tolerant, and secure production systems on cloud platforms such as AWS, GCP, or Azure
  • Develop and maintain security and compliance policies and procedures for our healthcare AI platform
  • Collaborate with cross-functional teams to troubleshoot and resolve complex issues related to infrastructure, deployment, and operations
  • Implement and maintain disaster recovery and business continuity plans
  • Develop and maintain documentation related to infrastructure, deployment, and operations
  • Mentor and provide technical guidance to junior engineers

What You Bring

Must-Have :

  • Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related field
  • At least 5 years of professional experience as SRE
  • Strong skills in building cloud infra orchestration systems (Operators) using python, Go
  • Expertise in infrastructure automation and deployment tools such as Terraform, or GitLab CI / CD
  • Experience with cloud platforms such as AWS, GCP, or Azure
  • Strong knowledge of containerization technologies such as Docker and Kubernetes
  • Experience with monitoring and logging tools such as ELK, Grafana, or Datadog
  • Familiarity with security and compliance best practices and tools such as HashiCorp Vault, AWS KMS, or Azure Key Vault
  • Strong problem-solving skills and ability to work independently and collaboratively in a team environment
  • Excellent communication and interpersonal skills
  • Experience implementing HIPAA and SOC2 compliance in a plus
  • Experience working in an HPC Environment is a plus
  • Join our team at Hippocratic AI and help shape the future of clinically safe, production-grade AI systems.

    Please be aware of recruitment scams impersonating Hippocratic AI. All recruiting communication will come from @hippocraticai.com email addresses. We will never request payment or sensitive personal information during the hiring process.

    Create a job alert for this search

    Sr Reliability Engineer • San Francisco, CA, US

    Similar jobs
    Senior Technology Site Reliability Engineer

    Senior Technology Site Reliability Engineer

    Cooley LLP • San Francisco, CA, United States
    Full-time
    Senior Technology Site Reliability Engineer.Cooley is seeking a Senior Site Reliability Engineer to join the.Infrastructure & Development Operations. The Senior Technology Site Reliability Engineer(...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer - Platform

    Site Reliability Engineer - Platform

    CodeRabbit • San Francisco, CA, United States
    Full-time
    CodeRabbit is an innovative research and development company focused on building extraordinarily productive human‑machine collaboration systems. Our primary goal is to create the next generation of ...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer US - San Francisco

    Site Reliability Engineer US - San Francisco

    Near Inc. • San Francisco, CA, United States
    Full-time
    The NEAR AI engineering team is developing decentralized and confidential machine learning infrastructure to power user owned AI. We currently focus on building infrastructure to enable private and ...Show more
    Last updated: 7 days ago • Promoted
    Staff SRE : Reliability & Scale for Cloud Platform

    Staff SRE : Reliability & Scale for Cloud Platform

    PowerToFly • Redwood City, CA, United States
    Full-time
    A leading SaaS firm is looking for a Site Reliability Engineer in Redwood City, California.This role involves leading reliability for microservices, developing SLOs, and enhancing service availabil...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer Hybrid - San Francisco

    Site Reliability Engineer Hybrid - San Francisco

    Grammarly, Inc. • San Francisco, CA, United States
    Full-time
    Superhuman offers a dynamic hybrid working model for this role.This flexible approach gives team members the best of both worlds : plenty of focus time along with in-person collaboration that helps ...Show more
    Last updated: 2 days ago • Promoted
    Senior Site Reliability Engineer : Scale & Reliability

    Senior Site Reliability Engineer : Scale & Reliability

    Google Inc. • San Francisco, CA, United States
    Full-time
    A leading technology firm in San Francisco is seeking a Software Engineer III for site reliability engineering.This full-time role requires a Bachelor's degree in Computer Science and at least two ...Show more
    Last updated: 6 days ago • Promoted
    Software Engineer, Site Reliability Engineer (SRE)

    Software Engineer, Site Reliability Engineer (SRE)

    Harvey • San Francisco, California, United States
    Full-time
    Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customi...Show more
    Last updated: 30+ days ago • Promoted
    Sr Reliability Engineer

    Sr Reliability Engineer

    Hippocratic AI • San Francisco, CA, United States
    Full-time
    Hippocratic AI is the leading generative AI company in healthcare.We have the only system that can have safe, autonomous, clinical conversations with patients. We have trained our own LLMs as part o...Show more
    Last updated: 11 days ago • Promoted
    Reliability Engineer

    Reliability Engineer

    Periodiclabs • Menlo Park, CA, United States
    Full-time
    We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries.We are well funded and growing rapidly. Team members are owners who identify and solve prob...Show more
    Last updated: 30+ days ago • Promoted
    Senior SRE Engineer - Reliability & Scale

    Senior SRE Engineer - Reliability & Scale

    Roblox Corporation • San Mateo, CA, United States
    Full-time
    A leading gaming platform is seeking a Senior Software Engineer - Site Reliability to ensure system performance, reliability, and efficiency. Responsibilities include creating resilient software, de...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE) - AI Infrastructure

    Site Reliability Engineer (SRE) - AI Infrastructure

    Hamilton Barnes Associates Limited • San Francisco, CA, United States
    Full-time
    Are you looking for an exciting new opportunity?.Join a stealth-mode hyperscale data center startup building a next-generation AI and cloud platform designed for startups and advanced research, pow...Show more
    Last updated: 30+ days ago • Promoted
    Founding SRE Engineer – Reliability & Growth

    Founding SRE Engineer – Reliability & Growth

    Asana • San Francisco, CA, United States
    Full-time
    A leading software company is seeking experienced Software Engineers to join the new Site Reliability Engineering team.This role focuses on building reliable, scalable systems and leading projects ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Mercor, Inc. • San Francisco, CA, United States
    Full-time
    Mercor is at the intersection of labor markets and AI research.We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.Our vast talent network ...Show more
    Last updated: 24 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    gamma.app • San Francisco, CA, United States
    Full-time
    We're building the creative layer for modern communication.Every month, over a billion people make presentations — but the tools they use to make them haven't evolved in decades.We're changing that...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Site Reliability Engineer (SRE)

    Senior Software Engineer, Site Reliability Engineer (SRE)

    harvey.ai • San Francisco, CA, United States
    Full-time
    At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end.By combining frontier agentic AI, an enterprise-grade platform, and deep domain experti...Show more
    Last updated: 30+ days ago • Promoted
    Founding Site Reliability Engineer

    Founding Site Reliability Engineer

    Assort Health Inc. • San Francisco, CA, United States
    Full-time
    Our mission is to make exceptional healthcare accessible anytime, anywhere, for everyone.At Assort Health, we believe healthcare should feel effortless and connected — quick answers, clear communic...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    Veeam • San Francisco, CA, United States
    Full-time
    Veeam, the #1 global market leader in data resilience, believes businesses should control all their data whenever and wherever they need it. Veeam provides data resilience through data backup, data ...Show more
    Last updated: 2 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Mvp VC • San Francisco, CA, United States
    Full-time
    Loft Orbital is revolutionizing access to space by building reliable, shareable satellites that drastically reduce the time and complexity traditionally required to get to orbit.We operate satellit...Show more
    Last updated: 14 days ago • Promoted