Talent.com
Site Reliability Engineer, Kubernetes Platform (Starshield)

Site Reliability Engineer, Kubernetes Platform (Starshield)

SpacexHawthorne, California, United States
30+ days ago
Job type
  • Full-time
  • Permanent
Job description

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

SITE RELIABILITY ENGINEER, KUBERENTES PLATFORM (STARSHIELD)

At SpaceX we’re leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world’s largest US government satellite constellation and is tasked with providing immediate access to critical intelligence and national security data for the US government anywhere on the globe. We design, build, test, and operate all parts of the system – receivers that allow users to connect within minutes, and the software that brings it all together. We’ve only begun to scratch the surface of Starshield's global impact and are looking for best-in-class engineers to help us further our ambitious goals.

As an engineer focused on Starshield's software and network infrastructure, you will design, operate and scale the infrastructure we use to run the world’s largest government satellite constellation. These positions cover a variety of areas ranging from Site Reliability Engineering, Developer Operations, and our internal Kubernetes platforms. You will develop automation to deploy and manage on-premise compute resources, create highly scalable and maintainable software products, and directly collaborate with engineering across the board.

RESPONSIBILITES :

  • Develop automation to deploy and manage on-premise Kubernetes clusters
  • Deploy and manage core infrastructure such as databases, monitoring and distributed storage
  • Closely collaborate with software engineers to create highly scalable, operable, and maintainable products
  • Engage in and improve the whole lifecycle of services from inception and design, through deployment, operation and refinement
  • Monitoring and alerting supporting systems to have high availability
  • Hands-on integration and troubleshooting across the entire Starshield stack
  • Identify areas for improvement and create innovative solutions that enable high system availability

BASIC QUALIFICATIONS :

  • Bachelor’s degree in computer science, information systems / IT, or an engineering discipline and 1+ years of professional experience in site reliability engineering or DevOps; OR 3+ years of professional experience in site reliability engineering or DevOps in lieu of a degree
  • 1+ years of professional experience with Linux operating systems
  • Experience with Terraform, Ansible, or other infrastructure tools
  • Experience with containerization technologies (i.e. OCI containers, Kubernetes)
  • Experience scripting in Bash, Python, or other similar languages
  • Development experience in Python, C++, or Go
  • PREFERRED SKILLS AND EXPERIENCE :

  • 1+ years of experience with Python and Python-based development frameworks
  • Experience managing Kubernetes clusters, not just using them
  • Knowledge of Linux boot process and systems configuration
  • Deep understanding of testing, continuous integration, build, deployment & continuous monitoring
  • Understanding of relevant build technologies, such as Bazel and Makefiles
  • Focus on performance bottlenecks and performance improvement techniques
  • Understanding of distributed databases and data modeling
  • Experience with automatically managing dozens, hundreds, or thousands of servers (eg : Terraform or Ansible)
  • Strong networking knowledge of TCP / IP
  • Excellent communications skills with the ability to communicate with customers, peers, management etc. in both formal and informal situations
  • ADDITIONAL REQUIREMENTS :

  • Note that an active clearance may provide the opportunity for you to work on sensitive SpaceX missions; if so, you will be subject to pre-employment drug and random drug and alcohol testing
  • Must be willing to work extended hours and weekends as needed
  • COMPENSATION AND BENEFITS :

    Pay range :

    Site Reliability Engineer / Level I : $120,000.00 - $145,000.00 / per year

    Site Reliability Engineer / Level II : $140,000.00 - $170,000.00 / per year

    Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations : job-related knowledge and skills, education, and experience.

    Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock, stock options, or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan. You will also receive access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short and long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks. You may also accrue 3 weeks of paid vacation and will be eligible for 10 or more paid holidays per year. Employees accrue paid sick leave pursuant to Company policy which satisfies or exceeds the accrual, carryover, and use requirements of the law.

    ITAR REQUIREMENTS :

  • To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. § 1157, or (iv) Asylee under 8 U.S.C. § 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here .
  • SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin / ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.

    Applicants wishing to view a copy of SpaceX’s Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application / interview process should reach out to  EEOCompliance@spacex.com .

    Create a job alert for this search

    Site Reliability Engineer • Hawthorne, California, United States

    Related jobs
    • Promoted
    Site Reliability Engineer, GNC (Falcon)

    Site Reliability Engineer, GNC (Falcon)

    SpacexHawthorne, California, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Data Center Site Reliability Engineer

    Data Center Site Reliability Engineer

    Expert Technology ServicesCulver, California, USA
    Full-time
    Please Note : As of July 22 2021 our team will require that all candidate submissions include a LinkedIn profile.Please do not submit any candidates that do not have a LinkedIn.Data Center Site Reli...Show moreLast updated: 2 days ago
    • Promoted
    Remote Entry-Level Currency Trader Job in Santa Clarita, CA | Full Time

    Remote Entry-Level Currency Trader Job in Santa Clarita, CA | Full Time

    Maverick CurrenciesSanta Clarita, CA
    Remote
    Full-time +1
    Top-ranked proprietary trading firm, Maverick Currencies, is searching for entrepreneurially-minded, profit-driven people to be trained in the art and science of proprietary trading in its online c...Show moreLast updated: 30+ days ago
    • Promoted
    Construction Project Manager

    Construction Project Manager

    Vaco by HighspringAltadena, California, United States
    Permanent
    Construction Project Manager – Construction (Retail & Commercial Focus).Long Beach (Hybrid – 3 days in office, 2 remote). Flexible, depending on project needs (occasional early mornings or weekends)...Show moreLast updated: 30+ days ago
    • Promoted
    Account Service Manager

    Account Service Manager

    Vaco by HighspringAltadena, California, United States
    Temporary
    Account Services Manager (Contract).El Segundo, CA (Hybrid : 2–3 days onsite per week).Month Contract (Coverage for Maternity Leave). A growing cosmetics and skincare brand is seeking an experienced....Show moreLast updated: 18 days ago
    • Promoted
    Purchasing Clerk

    Purchasing Clerk

    Vaco by HighspringAltadena, California, United States
    Temporary
    Monrovia, CA (Onsite, 5 days / week).We are seeking a highly detail-oriented.This role is heavily focused on.You will work closely with the Floor and Warehouse Capacity Manager to maintain data integ...Show moreLast updated: 1 day ago
    • Promoted
    Software Reliability Engineer Sr.-Hybrid

    Software Reliability Engineer Sr.-Hybrid

    Logix Federal Credit UnionValencia, CA, United States
    Full-time
    Software Reliability Engineer Sr.Software Reliability Engineer Sr.Using your expertise in coding, algorithms, infrastructure, and high-level problem solving manage the complex challenges of meeting...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer, Hardware and Infrastructure (Starshield)

    Site Reliability Engineer, Hardware and Infrastructure (Starshield)

    SpacexHawthorne, California, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    StubhubLos Angeles, California, United States
    Full-time
    StubHub is on a mission to redefine the live event experience on a global scale.Whether someone is looking to attend their first event or their hundredth, we’re here to delight them all the way fro...Show moreLast updated: 30+ days ago
    • Promoted
    Purchasing Manager

    Purchasing Manager

    Vaco by HighspringAltadena, California, United States
    Full-time
    Purchasing Manager (Temp / Temp-to-Hire – Medical Leave Coverage).Culver City, CA (Fully Onsite, Monday–Friday 8am–5pm).Our client, a leading construction company, is seeking a Purchasing Manager to ...Show moreLast updated: 1 day ago
    • Promoted
    Remote Crypto Trader Job in Santa Clarita, CA | Part Time

    Remote Crypto Trader Job in Santa Clarita, CA | Part Time

    Maverick CurrenciesSanta Clarita, CA
    Remote
    Full-time +1
    Top-ranked proprietary trading firm, Maverick Currencies, is searching for entrepreneurially-minded, profit-driven people to be trained in the art and science of proprietary trading in its online c...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer - Federal Team

    Lead Site Reliability Engineer - Federal Team

    SaviyntLos Angeles, California, United States
    Full-time
    Saviynt is an identity authority platform built to power and protect the world at work.In a world of digital transformation, where organizations are faced with increasing cyber risk but cannot affo...Show moreLast updated: 30+ days ago
    • Promoted
    Deployment, Site Reliability Engineer - Connected Warfare

    Deployment, Site Reliability Engineer - Connected Warfare

    Anduril IndustriesCosta Mesa, California, United States
    Full-time
    Anduril Industries is a defense technology company with a mission to transform U.By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the def...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Anduril IndustriesCosta Mesa, California, United States
    Full-time
    Mission Software Engineering Site Reliability Engineer (SRE) installs, connects and maintains Anduril’s software to deliver mission-critical capabilities to our customers.Site Reliability Engineers...Show moreLast updated: 30+ days ago
    • Promoted
    Remote Crypto Trader Job in Santa Clarita, CA | Full Time

    Remote Crypto Trader Job in Santa Clarita, CA | Full Time

    Maverick CurrenciesSanta Clarita, CA
    Remote
    Full-time +1
    Top-ranked proprietary trading firm, Maverick Currencies, is searching for entrepreneurially-minded, profit-driven people to be trained in the art and science of proprietary trading in its online c...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (Special Programs)

    Site Reliability Engineer (Special Programs)

    SpacexHawthorne, California, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Anduril IndustriesCosta Mesa, California, United States
    Full-time
    The Business Systems team is responsible for building and improving the many systems that enables Anduril to accomplish its mission. Anduril’s supply chain, accounting, sales & growth, engineering, ...Show moreLast updated: 30+ days ago
    • Promoted
    Remote Entry-Level Currency Trader Job in Santa Clarita, CA | Part Time

    Remote Entry-Level Currency Trader Job in Santa Clarita, CA | Part Time

    Maverick CurrenciesSanta Clarita, CA
    Remote
    Full-time +1
    Top-ranked proprietary trading firm, Maverick Currencies, is searching for entrepreneurially-minded, profit-driven people to be trained in the art and science of proprietary trading in its online c...Show moreLast updated: 30+ days ago