Talent.com
Site Reliability Engineer, GNC (Falcon)

Site Reliability Engineer, GNC (Falcon)

SpaceXInglewood, CA, United States
2 days ago
Job type
  • Full-time
Job description

Site Reliability Engineer, GNC (Falcon)

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

Site Reliability Engineer, GNC (Falcon)

SpaceX is looking for a Site Reliability Engineer to operate and scale custom-built mission-critical products for Guidance, Navigational, and Control (GNC). The GNC team performs trajectory design and vehicle simulation and participates in recurring mission-critical launch operations. This position will work with the GNC team to maintain and improve a set of GNC-focused tools. Examples of these products include Monte Carlo simulations on a high-performance computing cluster, automated data analysis systems, continuous integration systems for rocket and simulation software, GNC analysis infrastructure, and vehicle configuration verification tools. The ideal candidate will be flexible, possess broad skills across product operations and software development, and flourish in a fast-paced and challenging environment.

Responsibilities :

  • Deploy, upgrade, operate / maintain, and scale a suite of mission-critical GNC products and services
  • Provision and maintain virtual and physical servers
  • Work with SpaceX HPC team to monitor and maintain a 4000+ thread HPC cluster
  • Closely collaborate with GNC software engineers to create highly operable and maintainable products
  • Add monitoring for web apps and respond to outages
  • Manage the underlying computational infrastructure of GNC in collaboration with IT
  • Engage in and improve the whole lifecycle of services : from inception and design, through deployment, operation and refinement
  • Make recommendations for future hardware purchases
  • Practice sustainable incident response and postmortems
  • Provide end-user support to GNC engineering for products by becoming an expert on analysis applications and support users in troubleshooting and pointing to features
  • Configure automated deployment pipelines for web apps
  • Develop or improve GNC web apps and tools for better usability, maintainability, and robustness
  • Demo and document new software changes such as operating system upgrades, shared filesystem changes, or major tool rollouts
  • Focus on performance bottlenecks and performance improvement techniques

Basic Qualifications :

  • Bachelor's degree in computer science, information systems / IT, engineering, math, or scientific discipline and 2+ years of software development experience OR 4+ years of professional experience building software with site reliability or DevOps in lieu of a degree
  • Experience with Linux operating systems
  • Experience with Python and Python based development frameworks
  • Preferred Skills and Experience :

  • 2+ years of systems administration, site reliability engineering, or DevOps experience
  • 2+ years of experience with Python and Python-based development frameworks
  • 2+ years of Linux experience
  • Expertise with Docker, Vagrant, and Kubernetes or similar technologies
  • Extensive Experience with configuration management tools such as Ansible, Puppet, Terraform
  • Experience with build systems (Make, Bazel / Pants / Buck, Gradle) and package management tools (pip, npm)
  • Strong understanding of virtualization and hypervisor technologies
  • Understanding of databases and data modeling
  • Experience with automatically managing dozens or hundreds of servers
  • Strong networking knowledge of TCP / IP
  • Experience scaling web applications and optimizing applications for performance
  • Professional experience with standard front-end technologies like modern HTML, CSS, JavaScript (we use AngularJS, Polymer, Backbone.js, React, and more), REST, JSON
  • Solid understanding of UI / UX design to provide intuitive applications
  • Experience with high-performance computing systems or large-scale data analysis systems
  • Must be comfortable working with mission-critical and sensitive systems, with a sense of urgency appropriate to the responsibilities
  • Additional Requirements :

  • Willing to work extended hours and weekends when needed to meet critical deadlines
  • Compensation and Benefits :

    Pay Range : Site Reliability Engineer / Level I : $120,000.00 - $145,000.00 / per year Site Reliability Engineer / Level II : $140,000.00 - $170,000.00 / per year

    Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations : job-related knowledge and skills, education, and experience.

    Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock, stock options, or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan. You will also receive access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short and long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks. You may also accrue 3 weeks of paid vacation and will be eligible for 10 or more paid holidays per year. Employees accrue paid sick leave pursuant to Company policy which satisfies or exceeds the accrual, carryover, and use requirements of the law.

    Create a job alert for this search

    Site Reliability Engineer • Inglewood, CA, United States

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Diverse LynxLos Angeles, CA, United States
    Full-time
    Must Have Technical / Functional Skills.Experience in Cloud platforms (AWS, Azure, Google Cloud) and hybrid environments. Proficiency in container technologies (Docker, Container, Podman).Strong knowl...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    StubHubLos Angeles, CA, United States
    Full-time
    StubHub is on a mission to redefine the live event experience on a global scale.Whether someone is looking to attend their first event or their hundredth, we're here to delight them all the way fro...Show moreLast updated: 3 days ago
    • Promoted
    Staff Systems Reliability Engineer

    Staff Systems Reliability Engineer

    Ford Motor CompanyLong Beach, CA, United States
    Full-time
    We are the movers of the world and the makers of the future.We get up every day, roll up our sleeves and build a better world together. At Ford, we're all a part of something bigger than ourselve...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    K2 SpaceLos Angeles, CA, United States
    Permanent
    K2 Space is building large, high-powered spacecraft for the next generation of space development.Backed by Lightspeed Venture Partners, Altimeter Capital, and many others ($200M raised to date), we...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Site Reliability Engineer

    Sr. Site Reliability Engineer

    KēSTA I.T.Culver City, CA, US
    Full-time +1
    Come build, innovate, disrupt, and thrive!.Site Reliability Engineer for an immediate full-time opportunity with our industry leading client. Are you on the lookout for a unique career opportunity t...Show moreLast updated: 12 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tentek, Inc.Glendale, CA, US
    Full-time
    Must report onsite in Glendale 3 days per week, typically Tuesday-Thursday.There will be 3 rounds of interviews for this position. Linux system admin and Windows but willing to consider only Linux b...Show moreLast updated: 30+ days ago
    • Promoted
    Reliability Engineer

    Reliability Engineer

    Teledyne FLIR LLCEl Segundo, CA, United States
    Permanent
    Teledyne Technologies Incorporated provides enabling technologies for industrial growth markets that require advanced technology and high reliability. These markets include aerospace and defense, fa...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineering

    Site Reliability Engineering

    ForhyreLos Angeles, CA, US
    Full-time
    Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changin...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (Remote)

    Senior Site Reliability Engineer (Remote)

    ExperianCosta Mesa, CA, United States
    Remote
    Full-time
    Experian is a global data and technology company, powering opportunities for people and businesses around the world.We help to redefine lending practices, uncover and prevent fraud, simplify health...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer - Developer, Connected Warfare

    Senior Site Reliability Engineer - Developer, Connected Warfare

    Anduril IndustriesCosta Mesa, CA, United States
    Full-time
    Anduril Industries is a defense technology company with a mission to transform U.By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the def...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Diverse LynxNewport Coast, CA, United States
    Full-time
    DevOps Engineer With Strong Site Reliability Engineering Capabilities.Experienced DevOps Engineers with strong Site Reliability Engineering (SRE) capabilities who can work independently, think crit...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    NominalLos Angeles, CA, US
    Permanent
    Nominal is building the software infrastructure powering the world’s most advanced hardware systems — from spacecraft and autonomous vehicles to next-generation industrial machines.Our ...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WorkOSLos Angeles, CA, US
    Full-time
    WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with employees ...Show moreLast updated: 15 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    First ResonanceLos Angeles, CA, US
    Full-time
    As a Senior Site Reliability Engineer at First Resonance, you will play a pivotal role in enhancing the efficiency, scalability, and reliability of our software solutions.Joining the core Engineeri...Show moreLast updated: 15 days ago
    • Promoted
    Reliability Engineer

    Reliability Engineer

    TeledyneEl Segundo, CA, United States
    Permanent
    Teledyne Technologies Incorporated provides enabling technologies for industrial growth markets that require advanced technology and high reliability. These markets include aerospace and defense, fa...Show moreLast updated: 3 days ago
    • Promoted
    Lead Site Reliability Engineer - Federal Team

    Lead Site Reliability Engineer - Federal Team

    SaviyntLos Angeles, CA, US
    Full-time
    Saviynt is an identity authority platform built to power and protect the world at work.In a world of digital transformation, where organizations are faced with increasing cyber risk but cannot affo...Show moreLast updated: 15 days ago
    • Promoted
    Sr. Site Reliability Engineer

    Sr. Site Reliability Engineer

    Kesta ITCulver City, CA, United States
    Full-time +1
    Come build, innovate, disrupt, and thrive!.Site Reliability Engineer for an immediate full-time opportunity with our industry leading client. Are you on the lookout for a unique career opportunity t...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Mango, Inc.Los Angeles, CA, United States
    Full-time
    We are seeking a Senior Site Reliability Engineer to own and evolve the infrastructure that supports our on-premise instruments, data systems, and machine learning pipelines.This role combines syst...Show moreLast updated: 3 days ago