Talent.com
Sr. Site Reliability Engineer, Data (Application Software)

Sr. Site Reliability Engineer, Data (Application Software)

SpacexHawthorne, California, United States
30+ days ago
Job type
  • Full-time
  • Permanent
Job description

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

SR. SITE RELIABILITY ENGINEER, DATA (APPLICATION SOFTWARE)

The application software team is the central nervous system of SpaceX – we create mission-critical applications that are used throughout SpaceX to accelerate launch vehicle production and flight, as well as systems that allow Starlink to grow into a worldwide fast, reliable Internet service. Our missions support scientific research, classified national security space, and commercial opportunities. Software engineering and innovation are at the core of these programs.

Our team is currently creating and evolving systems to enable rapid build and reuse of Starship, as well as scaling the Starlink network. We have built systems to support concurrent streams of data from numerous always-on assets, enabling the management of the world’s largest satellite constellation and the world’s largest rocket. We work directly with engineers across all programs to enable and accelerate the success of Starship, Starlink, and Starshield.

Aerospace experience is not required for success here - rather, we look for smart, motivated, and collaborative site reliability engineers who love solving problems and want to make an impact on a truly inspiring mission. You will have full ownership of challenging problems, working with a team of enthusiastic engineers to design and produce solutions that enable SpaceX to move towards our goals at a rapid pace. The success of the missions at SpaceX depends on the software that you and your team produce.

RESPONSIBILITIES :

  • Upgrade existing distributed systems to become sharded and geo-redundant in multiple data centers
  • Advance existing deployment, monitoring, and alerting infrastructure to support a multi-region environment
  • Manage petabyte scale bare metal compute clusters
  • Closely collaborate with engineers across all programs to create highly operable, scalable, and maintainable products
  • Engage throughout the whole software development lifecycle of services from inception to design, deployment, operation, and iterative refinement
  • Focus on performance bottlenecks and performance improvement techniques

BASIC QUALIFICATIONS :

  • Bachelor's degree in computer science, engineering, math, or scientific discipline and 5 years of software development experience; OR 7+ years of professional experience building software with site reliability or DevOps in lieu of a degree
  • Experience with Linux operating systems
  • PREFERRED SKILLS AND EXPERIENCE :

  • 5+ years of rigorous experience with site reliability or DevOps
  • Experience with Kubernetes and Istio for on-premise deployment
  • Experience with in-stream, data processing and analytics using open source platforms such as Apache Kafka, Spark, HBase, HDFS, Flink
  • Experience troubleshooting hardware and network-layer issues
  • Programming experience in Python, C#, Java, Scala, Go or similar languages
  • Good understanding of version control, testing, continuous integration, build, deployment and monitoring
  • ADDITIONAL REQUIREMENTS :

  • Willing to work extended hours and weekends when needed
  • COMPENSATION AND BENEFITS :

    Pay Range :

    Software Engineer / Senior : $160,000.00 - $220,000.00 / per year

    Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations : job-related knowledge and skills, education, and experience.

    Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock, stock options, or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan. You will also receive access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks. You may also accrue 3 weeks of paid vacation & will be eligible for 10 or more paid holidays per year. Employees accrue paid sick leave pursuant to Company policy which satisfies or exceeds the accrual, carryover, and use requirements of the law.

    ITAR REQUIREMENTS :

  • To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. § 1157, or (iv) Asylee under 8 U.S.C. § 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here .
  • SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin / ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.

    Applicants wishing to view a copy of SpaceX’s Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application / interview process should reach out to  EEOCompliance@spacex.com .

    Create a job alert for this search

    Site Reliability Engineer • Hawthorne, California, United States

    Related jobs
    • Promoted
    Data Center Site Reliability Engineer

    Data Center Site Reliability Engineer

    Expert Technology ServicesCulver, California, USA
    Full-time
    Please Note : As of July 22 2021 our team will require that all candidate submissions include a LinkedIn profile.Please do not submit any candidates that do not have a LinkedIn.Data Center Site Reli...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer, GNC (Falcon)

    Site Reliability Engineer, GNC (Falcon)

    SpacexHawthorne, California, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    Systems Engineer (Reliability, Maintainability & Availability – RMA)

    Systems Engineer (Reliability, Maintainability & Availability – RMA)

    G2 Ops, Inc.El Segundo, CA, US
    Full-time
    Quick Apply
    El Segundo, CA at our customer site Work Setting : In person, some remote opportunity, and / or flexible working hours, not a fully remote position Salary Range : $105,000 – 160,000 plus com...Show moreLast updated: 30+ days ago
    • Promoted
    Solutions Engineer - SLED

    Solutions Engineer - SLED

    Cisco Systems, Inc.Glendale, CA, United States
    Full-time
    The application window is expected to close on : 11 / 30 / 2025.Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.Candidates must reside...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer, Kubernetes Platform (Starshield)

    Site Reliability Engineer, Kubernetes Platform (Starshield)

    SpacexHawthorne, California, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer - PLM

    Senior Site Reliability Engineer - PLM

    Anduril IndustriesCosta Mesa, California, United States
    Full-time
    The Business Systems team is responsible for building and improving the many systems that enables Anduril to accomplish its mission. Anduril’s supply chain, accounting, sales & growth, engineering, ...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Principal Logging / Telemetry / Data Handling Software Design Engineer

    Sr. Principal Logging / Telemetry / Data Handling Software Design Engineer

    Northrop GrummanRedondo Beach, CA, United States
    Full-time
    RELOCATION ASSISTANCE : Relocation assistance may be available.At Northrop Grumman, our employees have incredible opportunities to work on revolutionary systems that impact people's lives around the...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Purchasing Clerk

    Purchasing Clerk

    Vaco by HighspringAltadena, California, United States
    Temporary
    Monrovia, CA (Onsite, 5 days / week).We are seeking a highly detail-oriented.This role is heavily focused on.You will work closely with the Floor and Warehouse Capacity Manager to maintain data integ...Show moreLast updated: 4 hours ago
    • Promoted
    • New!
    Software Reliability Engineer Sr.-Hybrid

    Software Reliability Engineer Sr.-Hybrid

    Logix Federal Credit UnionValencia, CA, United States
    Full-time
    Software Reliability Engineer Sr.Software Reliability Engineer Sr.Using your expertise in coding, algorithms, infrastructure, and high-level problem solving manage the complex challenges of meeting...Show moreLast updated: 5 hours ago
    • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    StubhubLos Angeles, California, United States
    Full-time
    StubHub is on a mission to redefine the live event experience on a global scale.Whether someone is looking to attend their first event or their hundredth, we’re here to delight them all the way fro...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer, Hardware and Infrastructure (Starshield)

    Site Reliability Engineer, Hardware and Infrastructure (Starshield)

    SpacexHawthorne, California, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Anduril IndustriesCosta Mesa, California, United States
    Full-time
    Mission Software Engineering Site Reliability Engineer (SRE) installs, connects and maintains Anduril’s software to deliver mission-critical capabilities to our customers.Site Reliability Engineers...Show moreLast updated: 30+ days ago
    • Promoted
    Deployment, Site Reliability Engineer - Connected Warfare

    Deployment, Site Reliability Engineer - Connected Warfare

    Anduril IndustriesCosta Mesa, California, United States
    Full-time
    Anduril Industries is a defense technology company with a mission to transform U.By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the def...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer - Federal Team

    Lead Site Reliability Engineer - Federal Team

    SaviyntLos Angeles, California, United States
    Full-time
    Saviynt is an identity authority platform built to power and protect the world at work.In a world of digital transformation, where organizations are faced with increasing cyber risk but cannot affo...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Construction Field Assistant

    Sr Construction Field Assistant

    Lennar HomesSanta Clarita, CA, US
    Full-time
    Lennar is one of the nation's leading homebuilders, dedicated to making an impact and creating an extraordinary experience for their Homeowners, Communities, and Associates by building quality home...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer (Special Programs)

    Site Reliability Engineer (Special Programs)

    SpacexHawthorne, California, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Anduril IndustriesCosta Mesa, California, United States
    Full-time
    The Business Systems team is responsible for building and improving the many systems that enables Anduril to accomplish its mission. Anduril’s supply chain, accounting, sales & growth, engineering, ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Varda Space IndustriesEl Segundo, California, United States
    Full-time +1
    Low Earth orbit is open for business.Varda is accelerating the development of commercial space infrastructure, from in-orbit pharmaceutical processing to reliable and economical reentry capsules.Fr...Show moreLast updated: 30+ days ago