Talent.com
Site Reliability Engineer (SRE) - grok.com & API
Site Reliability Engineer (SRE) - grok.com & APIXai • Palo Alto, California, United States
Site Reliability Engineer (SRE) - grok.com & API

Site Reliability Engineer (SRE) - grok.com & API

Xai • Palo Alto, California, United States
30+ days ago
Job type
  • Full-time
Job description

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity.

We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the team

You will work on the team that is responsible for the backend services that power grok.com and our API. Our team is currently based primarily in London with a small but growing number of engineers located in Palo Alto. We focus on writing highly scalable and reliable services that can efficiently process tens of thousands of queries per second. The services are hosted on a number of Kubernetes clusters (on-prem & cloud).

About the role

An ideal candidate meets at least the following requirements :

  • Expert knowledge of Kubernetes,
  • Expert knowledge of continuous deployment systems such as Buildkite and ArgoCD,
  • Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty,
  • Expert knowledge of infrastructure as code technologies such as Pulumi or Terraform.

Location

We hire engineers in London and in Palo Alto. We usually work from the office 5 days a week but allow for work-from-home days when required. Candidates joining the London team must be willing to attend late meetings at least once a week to coordinate with the rest of our team.

Interview process

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview (“phone interview”) during which a member of our team will ask some basic technical questions. If you clear the initial phone interview, you will enter the main process, which consists of two technical interviews.

All interviews will be conducted via Google Meet.

Benefits

  • Competitive cash-based compensation
  • xAI equity
  • Private health and dental insurance
  • Annual Salary Range

    $180,000 - $440,000 USD

    xAI is an equal opportunity employer and does not unlawfully discriminate based on race, color, religion, ethnicity, ancestry, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, disability, medical conditions, genetic information, marital status, military or veteran status, or any other applicable legally protected characteristics.

    Qualified applicants with arrest or conviction records will be considered for employment in accordance with all applicable federal, state, and local laws, including the San Francisco Fair Chance Ordinance, Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act.

    For Los Angeles County (unincorporated) Candidates :

    xAI reasonably believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of a conditional offer of employment :

  • Access to information technology systems and confidential information, including proprietary and trade secret information, and / or user data;
  • Interacting with internal and / or external clients and colleagues; and
  • Exercising sound judgment.
  • California Consumer Privacy Act (CCPA) Notice

    Create a job alert for this search

    Site Reliability Engineer Sre • Palo Alto, California, United States

    Related jobs
    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Itlearn360 • Los Altos, CA, United States
    Full-time
    Site Reliability Engineer - SRE job at Descope.Descope R&D group is a skilled team of developers with a unique DNA of creativity,flexibility,anopen mindset. We are looking for a passionate SRE to jo...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Site Reliability Engineer

    Sr. Site Reliability Engineer

    Globality • Palo Alto, California, United States
    Full-time
    Joel Hyatt and Lior Delgo founded Globality with a vision to create prosperous and healthy economies, companies, communities, and individuals. In this new era of the Autonomous Enterprise, Globality...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer (Senior SRE)

    Senior Site Reliability Engineer (Senior SRE)

    Ciroos • Pleasanton, California, United States
    Full-time
    Ciroos (pronounced “Sai rose”) is a seed-stage startup founded in February 2025 by a team of experienced executives and distinguished engineers with deep expertise in observability, AI, distributed...Show more
    Last updated: 30+ days ago • Promoted
    CDN Site Reliability Engineer (SRE) L5

    CDN Site Reliability Engineer (SRE) L5

    Netflix • Los Gatos, California, United States
    Full-time
    Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and lan...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    DevOps projects • Mountain View, CA, United States
    Full-time
    Site Reliability Engineer (SRE).Manage and improve infrastructure including code reviews, deploying new infrastructure, upgrades, and migration of services. Provide tooling and infrastructure suppor...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantum • Palo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE) - grok.com & API

    Site Reliability Engineer (SRE) - grok.com & API

    Pantera Capital • Palo Alto, CA, United States
    Full-time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Psiquantum • Palo Alto, California, United States
    Full-time
    Quantum computing holds the promise of humanity’s mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    FLUIX • Palo Alto, CA, United States
    Full-time
    FLUIX is building the AI operating system that plans, designs, and optimizes AI infrastructure.We are based in Silicon Valley. We specialize in providing AI-driven solutions for data centers and pow...Show more
    Last updated: 17 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Foxconn Industrial Internet - FII • San Jose, CA, US
    Full-time +1
    Quick Apply
    Site Reliability Engineer Foxconn Industrial Internet (Fii), is a world leading professional design and manufacturing service provider of communication network equipment, cloud service equipment, p...Show more
    Last updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Natcast • Sunnyvale, California, United States
    Full-time
    Natcast (short for The National Center for the Advancement of Semiconductor Technology) is a new, purpose-built, non-profit entity created to operate the National Semiconductor Technology Center (N...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tarana Wireless • Milpitas, California, United States
    Full-time
    Join the Team That's Redefining Wireless Technology.Our groundbreaking Fixed Wireless Access technology is delivering .As a Site Reliability Engineer, you will help us manage software that runs on ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE) - Media Production Infrastructure

    Site Reliability Engineer (SRE) - Media Production Infrastructure

    Monks • Cupertino, California, United States
    Full-time
    Please note that we will never request payment or bank account information at any stage of the recruitment process.As we continue to grow our teams, we urge you to be cautious of fraudulent job pos...Show more
    Last updated: 16 days ago • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    OPPO • Palo Alto, CA, United States
    Full-time
    OPPO US Research Center is seeking a skilled and proactive.Site Reliability Engineer (SRE).In this role, you will be responsible for ensuring the stability, scalability, and performance of our appl...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Oppo Us Research Center • Palo Alto, California, United States
    Full-time
    OPPO US Research Center is seeking a skilled and proactive.Site Reliability Engineer (SRE).In this role, you will be responsible for ensuring the stability, scalability, and performance of our appl...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Id.me • Mountain View, California, United States
    Full-time
    Consumers can verify their identity with ID.Over 152 million users experience streamlined login and identity verification with ID. More than 600+ consumer brands use ID.Commerce Department and is ap...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Paynearme • Cupertino, California, United States
    Remote
    Full-time
    At PayNearMe, we’re on a mission to make paying and getting paid as simple as possible.We build innovative technology that transforms the way businesses and their customers experience payments.Our ...Show more
    Last updated: 2 hours ago • Promoted • New!
    Sr. Site Reliability Engineer (SRE)

    Sr. Site Reliability Engineer (SRE)

    Avenue Code • Mountain View, CA, United States
    Full-time
    We’re seeking an experienced, highly collaborative SRE to partner with product teams and tackle our most critical infrastructure challenges. You’ll be hands-on in designing, building, and operating ...Show more
    Last updated: 11 days ago • Promoted