Talent.com
Site Reliability Engineer (SRE) - grok.com & API
Site Reliability Engineer (SRE) - grok.com & APIXai • Palo Alto, CA, United States
Site Reliability Engineer (SRE) - grok.com & API

Site Reliability Engineer (SRE) - grok.com & API

Xai • Palo Alto, CA, United States
2 days ago
Job type
  • Full-time
Job description

About xAI

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company's mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the team

You will work on the team that is responsible for the backend services that power grok.com and our API. Our team is currently based primarily in London with a small but growing number of engineers located in Palo Alto. We focus on writing highly scalable and reliable services that can efficiently process tens of thousands of queries per second. The services are hosted on a number of Kubernetes clusters (on-prem & cloud).

About the role

An ideal candidate meets at least the following requirements :

  • Expert knowledge of Kubernetes,
  • Expert knowledge of continuous deployment systems such as Buildkite and ArgoCD,
  • Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty,
  • Expert knowledge of infrastructure as code technologies such as Pulumi or Terraform.

Location

We hire engineers in London and in Palo Alto. We usually work from the office 5 days a week but allow for work-from-home days when required. Candidates joining the London team must be willing to attend late meetings at least once a week to coordinate with the rest of our team.

Interview process

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview ("phone interview") during which a member of our team will ask some basic technical questions. If you clear the initial phone interview, you will enter the main process, which consists of two technical interviews.

All interviews will be conducted via Google Meet.

Annual Salary Range

$180,000 - $440,000 USD

Benefits

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

xAI is an equal opportunity employer.

California Consumer Privacy Act (CCPA) Notice

Create a job alert for this search

Site Reliability Engineer Sre • Palo Alto, CA, United States

Related jobs
Site Reliability Engineer

Site Reliability Engineer

Together AI • San Francisco, CA, United States
Full-time
As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show more
Last updated: 2 days ago • Promoted
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

ACL Digital • Mountain View, CA, United States
Full-time
Title : Senior Site Reliability Engineer (SRE).Design, develop, and maintain automation frameworks for performance testing and monitoring of QuickBooks infrastructure. Ensure the scalability and reli...Show more
Last updated: 2 days ago • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

AI Fund • San Francisco, CA, United States
Full-time
Baseten powers inference for the world's most dynamic AI companies, like.As a Site Reliability Engineer, you'll envision and build robust systems and processes that ensure our infrastructure is sca...Show more
Last updated: 2 days ago • Promoted
Lead Site Reliability Engineer (SRE)

Lead Site Reliability Engineer (SRE)

EPAM Systems Inc • San Francisco, CA, United States
Full-time
At EPAM, we're not just building software - we're engineering excellence.Lead Site Reliability Engineer (SRE).This role is ideal for someone who thrives in fast-paced financial systems, has a passi...Show more
Last updated: 2 days ago • Promoted
Site Reliability Engineer I

Site Reliability Engineer I

Prosper • San Francisco, CA, United States
Full-time
As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show more
Last updated: 7 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

PsiQuantum • Palo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Runloop AI, Inc • San Francisco, CA, United States
Full-time
Runloop is building the foundational infrastructure for the next generation of AI development.We provide AI engineers and data scientists with lightning-fast, secure, and reproducible code sandboxe...Show more
Last updated: 2 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Rethink recruit • San Francisco, CA, United States
Full-time
Runloop is building the foundational infrastructure for the next generation of AI development.We provide AI engineers and data scientists with lightning-fast, secure, and reproducible code sandboxe...Show more
Last updated: 2 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Insight Global • Santa Clara, CA, United States
Full-time
Insight Global is looking for a seasoned SRE to join one of our largest technology clients' multifaceted and fast-paced Infrastructure, Planning and Processes organization where you will be working...Show more
Last updated: 2 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Runloop AI • San Francisco, CA, United States
Full-time
Runloop is building the foundational infrastructure for the next generation of AI development.We provide AI engineers and data scientists with lightning-fast, secure, and reproducible code sandboxe...Show more
Last updated: 11 days ago • Promoted
Site Reliability Engineer - Openstack

Site Reliability Engineer - Openstack

Fortinet • Sunnyvale, CA, United States
Full-time
Fortinet is recruiting a Site Reliability Engineer- OPENSTACK to join our FortiStack team.This team is responsible for the management, operation and continued development of our Openstack-based pri...Show more
Last updated: 26 days ago • Promoted
Site Reliability Engineer (SRE) - grok.com & API

Site Reliability Engineer (SRE) - grok.com & API

Pantera Capital • Palo Alto, CA, United States
Full-time
AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Baseten • San Francisco, CA, United States
Full-time
Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible inf...Show more
Last updated: 7 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Signify Technology • Palo Alto, CA, US
Full-time
Competitive, based on experience.We are a technology startup advancing healthcare with a safety-focused AI platform that assists medical professionals by managing patient communications, including ...Show more
Last updated: 21 days ago • Promoted
Sr Site Reliability Engineer

Sr Site Reliability Engineer

F5 Networks • San Jose, CA, United States
Full-time
Sr Site Reliability Engineer page is loaded## Sr Site Reliability Engineerremote type : Hybridlocations : San Jose : Seattletime type : Full timeposted on : Posted Todayjob requisition id : RP1034618At F...Show more
Last updated: 2 days ago • Promoted
Site Reliability Engineer I

Site Reliability Engineer I

Prosper.com • San Francisco, CA, United States
Full-time
As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show more
Last updated: 2 days ago • Promoted
Site Reliability Engineer - Supercomputing

Site Reliability Engineer - Supercomputing

Xai • Palo Alto, CA, United States
Full-time
Site Reliability Engineer - Supercomputing.We are seeking a talented Site Reliability Engineer (SRE) to join our SuperComputing team. In this role, you'll ensure the reliability, scalability, and pe...Show more
Last updated: 2 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Rockwoods Inc • Pleasanton, CA, US
Full-time
Note : Candidates must have relevant experience in Medical / Healthcare domains, this is mandatory.Senior SRE Engineer - Pleasanton, 5 days office. Primary work : 24x7 On-call support and setting up mo...Show more
Last updated: 21 days ago • Promoted