Talent.com
No longer accepting applications
Site Reliability Engineer

Site Reliability Engineer

DaVitaPalo Alto, CA, United States
1 day ago
Job type
  • Full-time
Job description

Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a real quantum computer. PsiQuantum is on a mission to build the first real, useful quantum computers, capable of delivering the world-changing applications that the technology has long promised. We know that means we will need to build a system with roughly 1 million qubits that supports fault tolerant error correction within a scalable architecture, and a data center footprint.

By harnessing the laws of quantum physics, quantum computers can provide exponential performance increases over today's most powerful supercomputers, offering the potential for extraordinary advances across a broad range of industries including climate, energy, healthcare, pharmaceuticals, finance, agriculture, transportation, materials design, and many more.

PsiQuantum has determined the fastest path to delivering a useful quantum computer, years earlier than the rest of the industry. Our architecture is based on silicon photonics which gives us the ability to produce our components at Tier-1 semiconductor fabs such as GlobalFoundries where we leverage high-volume semiconductor manufacturing processes, the same processes that are already producing billions of chips for telecom and consumer electronics applications. We also benefit from the quantum mechanics reality that photons don't feel heat or electromagnetic interference, allowing us to take advantage of existing cryogenic cooling systems and industry standard fiber connectivity.

In 2024, PsiQuantum announced two government-funded projects to support the build-out of our first Quantum Data Centers and utility-scale quantum computers in Brisbane, Australia and Chicago, Illinois. Both projects are backed by nations that understand quantum computing's potential impact and the need to scale this technology to unlock that potential. And we won't just be building the hardware, but also the fault tolerant quantum applications that will provide industry-transforming results.

Quantum computing is not just an evolution of the decades-old advancement in compute power. It provides the key to mastering our future, not merely discovering it. The potential is enormous, and we have the plan to make it real. Come join us.

There's much more work to be done and we are looking for exceptional talent to join us on this extraordinary journey!

Job Summary :

Join the OS / Platform team as a Site Reliability Engineer (SRE) and keep our services healthy, observable, and fast. Partnering with the Platform Engineering group, you'll own the daytoday operation of our monitoring stack-Grafana, Prometheus, Loki, and Tempo-crafting dashboards that surface golden signals and drive realtime insight. You'll codify reliability through SLIs / SLOs, automate runbooks in Python, and lead incident response to maintain worldclass uptime across both onprem and AWS environments.

Responsibilities :

  • Define, implement, and iterate on Service Level Indicators & Service Level Objectives (SLIs / SLOs) and error budgets for critical services.
  • Build and maintain Grafana dashboards that visualize golden signals (latency, traffic, errors, saturation) for engineers and stakeholders.
  • Operate and tune our observability pipeline (Prometheus, Loki, Tempo) to ensure scalable, lowlatency telemetry ingestion and alerting.
  • Drive incident response : triage, mitigate, perform postincident reviews, and implement preventive actions.
  • Develop automation and selfservice tooling in Python / Bash to streamline alerts, runbooks, and operational tasks.
  • Collaborate with Platform and Product teams on capacity planning, performance testing, and change management.
  • Improve CI / CD health checks and release safety nets within GitLab.
  • Contribute to infrastructure as code (Terraform, Ansible) for monitoring stack deployments and upgrades.

Experience / Qualifications :

  • Bachelor's Degree or higher in Computer Science, Engineering or other related technical field.
  • 5+ years in an SRE, DevOps, or Production Engineering role supporting distributed systems in production.
  • Handson expertise with observability tools : Grafana, Prometheus, Loki, Tempo (or equivalent).
  • Proven track record designing dashboards and alerts around golden signals and (Utilization, Saturation, Errors) USE and RED (Rate, Errors, Duration) methodologies.
  • Solid scripting / automation skills in Python and Bash; familiarity with GitLab CI pipelines.
  • Operational experience with Kubernetes and containerized workloads.
  • Working knowledge of AWS services, networking fundamentals, and load balancing.
  • Experience running incident response and writing actionable postmortems.
  • Familiarity with Infrastructure as Code (Terraform, Ansible) and configuration management.
  • Exposure to regulated environments and multiregion architectures is a plus.
  • Strong communication and collaboration skills; comfortable acting as a generalist across infrastructure, application, and data layers.
  • PsiQuantum provides equal employment opportunity for all applicants and employees. PsiQuantum does not unlawfully discriminate on the basis of race, color, religion, sex (including pregnancy, childbirth, or related medical conditions), gender identity, gender expression, national origin, ancestry, citizenship, age, physical or mental disability, military or veteran status, marital status, domestic partner status, sexual orientation, genetic information, or any other basis protected by applicable laws.

    Note : PsiQuantum will only reach out to you using an official PsiQuantum email address and will never ask you for bank account information as part of the interview process. Please report any suspicious activity to recruiting@psiquantum.com .

    We are not accepting unsolicited resumes from employment agencies.

    The ranges below reflect the target ranges for a new hire base salary. One is for the Bay Area (within 50 miles of HQ, Palo Alto), the second one (if applicable) is for elsewhere in the US (beyond 50 miles of HQ, Palo Alto). If there is only one range, it is for the specific location of where the position will be located.Actual compensation may vary outside of these ranges and is dependent on various factors including but not limited to a candidate's qualifications including relevant education and training, competencies, experience, geographic location, and business needs. Base pay is only one part of the total compensation package. Full time roles are eligible for equity and benefits. Base pay is subject to change and may be modified in the future.

    U.S. Base Pay Range $120,000—$140,000 USD Bay Area Pay Range $145,000—$165,000 USD #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • Palo Alto, CA, United States

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    DTEX SystemsFremont, CA, US
    Full-time
    Quick Apply
    We are excited that you’ve taken the time to explore our business and potentially join us on this incredible journey.We are already the leader in the Insider Risk Management, but our story do...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Itlearn360Los Altos, CA, United States
    Full-time
    Site Reliability Engineer - SRE job at Descope.Descope R&D group is a skilled team of developers with a unique DNA of creativity,flexibility,anopen mindset. We are looking for a passionate SRE to jo...Show moreLast updated: 30+ days ago
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Tarana WirelessMilpitas, CA, US
    Full-time
    Quick Apply
    Join the Team That's Redefining Wireless Technology At Tarana , we're more than just a fast-growing tech company—we’re a team of bold innovators on a mission to revolutionize broa...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer

    Senior Software Engineer

    University of California - Santa CruzSanta Cruz, CA, United States
    Full-time +1
    NO VISA SPONSORSHIP AVAILABLE FOR THIS POSITION.Applicants must have current work authorization when accepting a Genomics Institute staff position. We are unable to sponsor or take over sponsorship ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Building Engineer

    Senior Building Engineer

    CBRE GroupLivermore, CA, US
    Full-time
    CBRE GWS works with clients to make real estate a meaningful contributor to organizational productivity and performance.Our account management model is at the heart of our client-centric approach t...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantumPalo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Site Reliability Engineer Denver, CO;San Francisco, CA;New York, NY;Seattle, WA;Toronto, Ont[...]

    Sr Site Reliability Engineer Denver, CO;San Francisco, CA;New York, NY;Seattle, WA;Toronto, Ont[...]

    GustoSan Francisco, CA, United States
    Full-time
    Gusto is a modern, online people platform that helps small businesses take care of their teams.On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and team management...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer, Storage

    Senior Site Reliability Engineer, Storage

    Epoch BiodesignSan Francisco, CA, United States
    Full-time
    Crusoe Energy is on a mission to unlock value in stranded energy resources through the power of computation.Take a look at what we do! - https : / / www. We aim to align the long term interests of the c...Show moreLast updated: 30+ days ago
    • Promoted
    Building Engineer

    Building Engineer

    CBRE GroupLivermore, CA, US
    Full-time
    CBRE Global Workplace Solutions (GWS) works with clients to make real estate a meaningful contributor to organizational productivity and performance. Our account management model is at the heart of ...Show moreLast updated: 4 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Foxconn Industrial Internet - FIISan Jose, CA, US
    Full-time +1
    Quick Apply
    Site Reliability Engineer Foxconn Industrial Internet (Fii), is a world leading professional design and manufacturing service provider of communication network equipment, cloud service equipment, p...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer (Site Reliability Engineer)

    Software Engineer (Site Reliability Engineer)

    CerebrasSan Francisco, CA, United States
    Full-time
    San Francisco or Palo Alto, CA.At Anyscale, we take a market-based approach to compensation.We are data-driven, transparent, and consistent. As the market data changes over time, the target salary f...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Plumbing Engineer

    Lead Plumbing Engineer

    ACCO Engineered SystemsPleasanton, CA, United States
    Full-time
    This position is responsible for independently delivering engineering services, from conceptual design through construction completion. Essential Duties & Responsibilities.Complete project planning,...Show moreLast updated: 30+ days ago
    • Promoted
    Design Engineer

    Design Engineer

    ACCO Engineered SystemsPleasanton, CA, United States
    Full-time
    This position is responsible for the delivery of engineering services, from conceptual design through construction completion of HVAC, Plumbing, and other contract services offered by ACCO Engineer...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Manufacturing Engineer (Aircraft and Maintenance)

    Sr. Manufacturing Engineer (Aircraft and Maintenance)

    Reliable RoboticsSan Martin, CA, United States
    Permanent
    We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineering Manager-Ecommerce

    Site Reliability Engineering Manager-Ecommerce

    Synstack TechnologiesSan Ramon, CA, US
    Full-time
    This is Hemanth from Synstack, please share your resume for below opportunity.Location – San Ramon, CA (onsite).A Site Reliability Engineer is a professional who acts as a warrior to monitor,...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer (SRE) - grok.com & API

    Site Reliability Engineer (SRE) - grok.com & API

    Pantera CapitalPalo Alto, CA, United States
    Full-time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show moreLast updated: 2 days ago
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    OPPO US Research CenterPalo Alto, CA, US
    Full-time
    Quick Apply
    OPPO US Research Center is seeking a skilled and proactive.Site Reliability Engineer (SRE).In this role, you will be responsible for ensuring the stability, scalability, and performance of our appl...Show moreLast updated: 30+ days ago
    • Promoted
    Construction Manager

    Construction Manager

    Clearance JobsMenlo Park, CA, US
    Full-time +1
    Akima Infrastructure Services, LLC (AIS), is actively seeking individuals who can contribute to national security within the Science & Technical fields as part of our staff augmentation team suppor...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Staff Engineer

    Sr. Staff Engineer

    Bio-Rad LaboratoriesPleasanton, CA, United States
    Full-time
    You'll drive the development of hardware products that directly impact healthcare innovation and improve lives worldwide. You'll collaborate cross-functionally to.Your expertise in electrical engine...Show moreLast updated: 30+ days ago
    • Promoted
    Consumables Sustaining Engineer

    Consumables Sustaining Engineer

    Bio-Rad LaboratoriesPleasanton, CA, United States
    Full-time
    Bio-Rad is looking for a Consumables Sustaining Engineer to join the Life Science Group of Bio-Rad based in Pleasanton, CA. You will be part of a multidisciplinary team focused on developing innovat...Show moreLast updated: 30+ days ago