Talent.com
Software Engineer - Infrastructure/Supercomputing
Software Engineer - Infrastructure/SupercomputingXai • Palo Alto, CA, United States
Software Engineer - Infrastructure / Supercomputing

Software Engineer - Infrastructure / Supercomputing

Xai • Palo Alto, CA, United States
30+ days ago
Job type
  • Full-time
Job description

About xAI

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company's mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

Tech Stack

  • Kubernetes
  • Pulumi
  • Rust and Go
  • Flux / ArgoCD

Location

The role is based in the Bay Area [San Francisco and Palo Alto]. Candidates are expected to be located near the Bay Area or open to relocation.

Focus

  • Operating some of the world's largest GPU supercomputing clusters for both AI training and serving production models.
  • Implement IaC best practices, enhancing deployment pipelines, and ensuring robust, secure service delivery across our production environments.
  • Working with both on-premise clusters and cloud providers.
  • Help with security best practices for internal researchers and live external traffic.
  • Ideal Experiences

  • Writing scalable and highly available containerized applications in Rust.
  • Managing compute fleets with Pulumi, Terraform, Ansible, or other stateful automation libraries.
  • Interview Process

    After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to an initial interview (45 minutes - 1 hour) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews :

  • Coding assessment in a language of your choice.
  • Systems design : Translate high-level requirements into a scalable, fault-tolerant service.
  • Systems hands-on : Demonstrate practical skills in a live problem-solving session.
  • Project deep-dive : Present your past exceptional work to a small audience.
  • Meet and greet with the wider team.
  • Our goal is to finish the main process within one week. All interviews will be conducted via Google Meet.

    Annual Salary Range

    $180,000 - $370,000 USD

    Benefits

    Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

    xAI is an equal opportunity employer.

    California Consumer Privacy Act (CCPA) Notice

    Create a job alert for this search

    Software Engineer • Palo Alto, CA, United States

    Related jobs
    Staff Infrastructure / DevOps Engineer

    Staff Infrastructure / DevOps Engineer

    Gatik Ai • Mountain View, California, United States
    Full-time
    Gatik, the leader in autonomous middle-mile logistics, is revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution and prioritizing safe, consistent del...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - Developer Infrastructure

    Software Engineer - Developer Infrastructure

    Applied Intuition • Mountain View, California, United States
    Full-time
    Applied Intuition is the vehicle intelligence company that accelerates the global adoption of safe, AI-driven machines.Founded in 2017, Applied Intuition delivers the toolchain, Vehicle OS, and aut...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - AI Agent Infrastructure (Healthcare)

    Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • Hayward, CA, United States
    Full-time
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patient data, processing orders and prescri...Show more
    Last updated: 12 days ago • Promoted
    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • Fremont, CA, United States
    Full-time
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...Show more
    Last updated: 10 days ago • Promoted
    Senior Infrastructure Engineer (Core Infra, US)

    Senior Infrastructure Engineer (Core Infra, US)

    Workato • Palo Alto, California, United States
    Full-time
    Workato transforms technology complexity into business opportunity.As the leader in enterprise orchestration, Workato helps businesses globally streamline operations by connecting data, processes, ...Show more
    Last updated: 4 days ago • Promoted
    Software Engineer (Infrastructure)

    Software Engineer (Infrastructure)

    Column • San Francisco, California, United States
    Full-time
    For companies building financial technology and transforming the financial services space, the biggest bottleneck to their growth and innovation is often the underlying banks and infrastructure sta...Show more
    Last updated: 30+ days ago • Promoted
    Software Infrastructure & Platform Engineer

    Software Infrastructure & Platform Engineer

    PsiQuantum • Palo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
    Last updated: 30+ days ago • Promoted
    Cloud Infrastructure Engineer

    Cloud Infrastructure Engineer

    Florvets Structures • San Francisco, California, United States
    Remote
    Full-time +1
    Position : Cloud Infrastructure Engineer.Florvets Structures is a leading construction and engineering company based in San Francisco, California. We specialize in building innovative and sustainable...Show more
    Last updated: 30+ days ago • Promoted
    Senior Infrastructure Engineer - InfraOps

    Senior Infrastructure Engineer - InfraOps

    Bitgo • Palo Alto, California, United States
    Full-time
    BitGo is the leading infrastructure provider of digital asset solutions, delivering custody, wallets, staking, trading, financing, and settlement services from regulated cold storage.Since our foun...Show more
    Last updated: 4 days ago • Promoted
    MTS, Infrastructure Engineer

    MTS, Infrastructure Engineer

    Delphina • San Francisco, California, United States
    Full-time
    Today’s Data Scientists are in pain - spending their time manually wrangling data, building models through slow trial and error, taking on painstaking rewrites for deployment, and dealing with coun...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Machine Learning Infrastructure

    Software Engineer, Machine Learning Infrastructure

    Datologyai • Redwood City, California, United States
    Full-time
    Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...Show more
    Last updated: 30+ days ago • Promoted
    Senior Infrastructure Linux & DevOps Engineer

    Senior Infrastructure Linux & DevOps Engineer

    Matrix Precise, Inc. • Pleasanton, California, United States
    Full-time
    Infra Linux Engineer’s primary function will be to advance the infrastructure team from a traditional infrastructure methodology to an infrastructure as code approach. You will be responsible for ma...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Infrastructure

    Software Engineer, Infrastructure

    Matroid • Palo Alto, California, United States
    Full-time
    Matroid is a full-service computer vision company that has developed an end-to-end platform allowing enterprise customers to rapidly train and. EO, IR, X-Ray, CT, OCT, and others.Founded in 2016 by ...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Infrastructure

    Software Engineer, Infrastructure

    Orb • San Francisco, California, United States
    Full-time
    Orb is on a mission to revolutionize billing infrastructure for the modern era of AI and software.We empower businesses to align their monetization with product usage—whether through seats, consump...Show more
    Last updated: 30+ days ago • Promoted
    Infrastructure Software Engineer, Public Sector

    Infrastructure Software Engineer, Public Sector

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    Scale AI is seeking a highly skilled and motivated.Software Engineer, AI Infrastructure & Security.Public Sector Engineering team. As a part of this team, you will play a critical role in delivering...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Infrastructure

    Software Engineer, Infrastructure

    Zip • San Francisco, California, United States
    Full-time
    Our co-founders started Zip in 2020 to address this seemingly intractable problem with a purpose-built platform that provides a simple, consumer-grade user experience. Within just a few short years,...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Infrastructure

    Software Engineer, Infrastructure

    Harvey • San Francisco, California, United States
    Full-time
    Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customi...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - Cloud Infrastructure

    Software Engineer - Cloud Infrastructure

    Specter • San Francisco, California, United States
    Full-time
    Specter is creating a software-defined “control plane” for the physical world.We are starting with protecting American businesses by granting them ubiquitous perception over their physical assets.T...Show more
    Last updated: 2 days ago • Promoted