Talent.com
Staff Site Reliability Engineer (Federal)
Staff Site Reliability Engineer (Federal)Okta for Developers • Washington, DC, United States
Staff Site Reliability Engineer (Federal)

Staff Site Reliability Engineer (Federal)

Okta for Developers • Washington, DC, United States
4 days ago
Job type
  • Full-time
Job description

Staff Site Reliability Engineer (Federal)

2 months ago Be among the first 25 applicants

Get to know Okta

Okta is The World’s Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth.

We celebrate a variety of perspectives and experiences. We are looking for lifelong learners who can make us better with their unique experiences.

Join our team. We’re building a world where identity belongs to you.

We are looking for an experienced Site Reliability Engineer (SRE) to join Okta’s Technical Operations (TechOps) Team. At Okta our motto is “Always On”, and nowhere do we embrace that more than in Technical Operations. We strive to build the most reliable and performant systems on the planet through the skillful use of automation. We’ve created an integrated system that securely connects anyone via any device to the technologies they need to do their most significant work.

This role is ideal for someone who enjoys working on large‑scale cloud production systems and takes pride in seeing them handle anything the internet can throw at them.

If you like to be challenged and have a passion for solving problems at scale with automation, testing, and tuning, we would love to hear from you. The ideal candidate exemplifies the ethic of “If you have to do something more than once, automate it” and can rapidly self‑educate on new concepts and tools. This engineer will be in our Washington, D.C., office or nearby with on‑site customer travel.

This national security role necessitates a TOP SECRET / SCI security clearance, ideally with a full‑scope polygraph. It is a condition of employment that you obtain and continuously maintain this clearance to be eligible for access to classified information. Any inability to uphold these security standards may lead to termination.

Responsibilities

  • Design, build, and monitor Okta's production infrastructure
  • Respond to production incidents and determine preventive solutions
  • Troubleshoot complex reliability and performance issues
  • Automate manual processes, evolve our monitoring tools, and develop technical documentation
  • Support a high‑availability and large‑scale online environment as part of an on‑call rotation

Qualifications & Requirements

  • U.S. Citizen
  • Experience with Federal and DoD compliance requirements – FedRAMP, IL6
  • Background with Linux systems administration and strong scripting skills in Bash, Ruby, Python, Go, etc.
  • 3+ years of experience building and operating workloads orchestrated by Kubernetes
  • 3+ years of writing and debugging Helm values and charts
  • Experience supporting Docker containers and web applications running on Java / Apache / Tomcat in a live production environment
  • Strong expertise with production services in AWS (EC2, ECS, KMS, CloudWatch)
  • Previous experience automating systems and infrastructure via Terraform or CloudFormation
  • Solid understanding of networking concepts and IP protocols
  • Experience with multi‑cloud infrastructure is desired
  • Computer Science (plus) or relevant experience
  • Education and Training

  • B.S. Computer Science (plus) or relevant experience
  • By submitting an application, you agree to the retention of your personal data for consideration for a future position at Okta. More details about Okta’s privacy practices can be found at : https : / / www.okta.com / privacy-policy.

    Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

    If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this form to request an accommodation.

    Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at https : / / www.okta.com / legal / personnel-policy / .

    Some roles may require travel to one of our office locations for in‑person onboarding.

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • Washington, DC, United States

    Related jobs
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Visa • Ashburn, VA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer - Developer, Connected Warfare

    Site Reliability Engineer - Developer, Connected Warfare

    Anduril Industries • Washington, DC, United States
    Full-time
    Site Reliability Engineer, Connected Warfare.Washington, District of Columbia, United States.Anduril Industries is a defense technology company with a mission to transform U.By bringing the experti...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Leidos Inc • Reston, VA, United States
    Full-time
    The Multi Domain Solutions Division at Leidos is looking for a.This role involves supporting the delivery of comprehensive IT and support services to ensure mission success while adhering to DoD st...Show more
    Last updated: 16 days ago • Promoted
    Sr. Manager - Site Reliability Engineer

    Sr. Manager - Site Reliability Engineer

    Visa • Ashburn, VA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    Verisign • Reston, VA, United States
    Full-time
    Verisign helps enable the security, stability, and resiliency of the internet.We are a trusted provider of internet infrastructure services for the networked world and deliver unmatched performance...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tax Analysts • Falls Church, VA, US
    Full-time
    Quick Apply
    Tax Analysts is seeking a Site Reliability Engineer (SRE) to help establish and shape our reliability engineering practice from the ground up. This is a unique opportunity to join a mission-driven o...Show more
    Last updated: 30+ days ago
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Federated IT • Washington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show more
    Last updated: 5 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Improvix Technologies • Washington, DC, United States
    Full-time
    Site Reliability Engineer (SRE).We are seeking a Site Reliability Engineer (SRE) with strong GitLab expertise to support and enhance enterprise platforms. This role will focus primarily on GitLab wh...Show more
    Last updated: 1 day ago • Promoted
    Senior Reliability Engineer

    Senior Reliability Engineer

    The Johns Hopkins University Applied Physics Laboratory • Laurel, MD, United States
    Full-time
    Are you passionate about applying reliability and system engineering principles to analyze and assess the resilience of future strategic weapon systems?. Do you have a strong technical background in...Show more
    Last updated: 7 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CSCI Consulting • Quantico, VA, United States
    Full-time
    CSCI Consulting is looking for a.Site Reliability Engineer (SRE).This role combines deep systems engineering knowledge with DevOps automation, proactive monitoring, and incident response practices....Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Powder River Industries • Washington, DC, United States
    Full-time
    Conduct analysis of alternatives for configuration tools, make recommendations, work with team to design, develop, test, implement, and maintain tool choice. Responsible for the administration, moni...Show more
    Last updated: 4 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    EngFlow • Washington, DC, United States
    Full-time
    Join to apply for the Site Reliability Engineer role at EngFlow.At EngFlow, we help developers save time by accelerating software builds and tests. Our cloud-based, distributed service optimizes dev...Show more
    Last updated: 4 days ago • Promoted
    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Jobgether • Washington, DC, United States
    Full-time
    Principal Site Reliability Engineer (SRE) job at Jobgether.This position is posted by Jobgether on behalf of.We are currently looking for a. Principal Site Reliability Engineer (SRE).Join a high-imp...Show more
    Last updated: 30+ days ago • Promoted
    Deployment Site Reliability Engineer - Connected Warfare

    Deployment Site Reliability Engineer - Connected Warfare

    Anduril Industries, Inc. • Washington, DC, United States
    Full-time
    Senior Deployed Site Reliability Engineer, Connected Warfare.Washington, District of Columbia, United States.Anduril Industries is a defense technology company with a mission to transform U.By brin...Show more
    Last updated: 4 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Cape • Washington, DC, United States
    Full-time
    Cape was founded in early 2022 by Palantir and Anduril alums with deep expertise in privacy and national security.While running Palantir’s US national security business, our CEO became passionate a...Show more
    Last updated: 4 days ago • Promoted
    Gitlab Site Reliability Engineer

    Gitlab Site Reliability Engineer

    2Prod Technologies Corp. • Washington, DC, United States
    Full-time
    Site Reliability Engineer (SRE) with strong GitLab expertise to support and enhance enterprise platforms.This role will focus primarily on GitLab while also maintaining Jira and Confluence in a sec...Show more
    Last updated: 4 days ago • Promoted
    Site Reliability Engineer — Scale mission-critical platforms

    Site Reliability Engineer — Scale mission-critical platforms

    Anduril Industries • Washington, DC, United States
    Full-time
    A defense technology company is seeking a Site Reliability Engineer in Washington, DC.The role involves solving challenges in networking and systems integration while working with cross-functional ...Show more
    Last updated: 13 hours ago • Promoted • New!
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Bridge Defense • Washington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show more
    Last updated: 4 days ago • Promoted