Talent.com
Senior Site Reliability Engineer, Arlington
Senior Site Reliability Engineer, ArlingtonOnebrief • Remote, Remote, United States
Senior Site Reliability Engineer, Arlington

Senior Site Reliability Engineer, Arlington

Onebrief • Remote, Remote, United States
Hace 4 días
Tipo de contrato
  • A tiempo completo
Descripción del trabajo

About Onebrief

Onebrief is collaboration and AI-powered workflow software designed specifically for military staffs. By transforming this work, Onebrief makes the staff as a whole superhuman - meaning faster, smarter, and more efficient.

We take ownership, seek excellence, and play to win with the seriousness and camaraderie of an Olympic team. Onebrief operates as an all-remote company, though many of our employees work alongside our customers at military commands around the world.

Founded in 2019 by a group of experienced planners, today, Onebrief’s team spans veterans from all forces and global organizations, and technologists from leading-edge software companies. We’ve raised $123m+ from top-tier investors, including Battery Ventures, General Catalyst, Insight Partners, and Human Capital, and today, Onebrief is valued at $1.1B. With this continued growth, Onebrief is able to make an impact where it matters most.

Security Clearance, Location, and Onsite Notice :

This role requires regularly working on-site at customer locations in Arlington, VA.

If you are not currently within commuting distance, you must be willing to relocate (note that Onebrief will provide relocation assistance).

Active Top Secret Clearance required with the ability to obtain SCI eligibility.

About The Role

We are hiring a Site Reliability Engineer to join our Infrastructure & Security team. You’ll work closely with fellow SREs, security, and customer success.

You will be the first line of support for our mission critical deployments, and responsible for ensuring best-in-class service quality and issue resolution. You will work in both on-premise DoD environments and AWS cloud environments. Your lessons from the field will shape how our team works, from policy to implementation.

In addition to working at the customer, you will contribute directly to solutions that increase stability, performance, and security of our deployments, and improve the overall experience of deploying and managing Onebrief on premise.

About You

You are a force multiplier who views reliability as the most critical feature of any application and / or platform and believe that "reliability beats novelty." You see infrastructure and operability as a product to be automated, documented, and continuously improved, always leaving systems easier to operate than you found them.

You are equally comfortable leading a post-incident review, designing SLOs in a system design session, or diving into a

kubectl

shell to triage a complex production issue. You don't just fix problems; you translate constraints and failure modes into clear, automated guardrails and scalable, resilient architecture. For you, robust monitoring, actionable alerting, and insightful runbooks are core parts of the engineering process, not afterthoughts.

You mentor others, fostering a culture of blameless postmortems and proactive reliability. You collaborate naturally with application and platform teams, helping them move quickly but safely by building the tools, processes, and observability that make "fast recovery" a reality.

What You'll Do

You'll own the reliability, scalability, and security of the production application and / or platform. You will do this by :

Building a World-Class Observability Platform : Design, implement, and manage our monitoring, logging, and alerting stack (e.g., Prometheus, Loki, Alloy, and Grafana). You won't just track metrics; you'll create the actionable insights and automated alerting that allow teams to identify and resolve issues before they impact users.

Defining and Upholding Reliability : Define, measure, and own alerting that feeds into our Service Level Objectives (SLOs) and increases trust internally and externally. You will be the organization's expert on what it means for our systems to be reliable and how to measure it.

Leading Incident Response : Act as the incident responder and potentially incident commander during critical incidents You will lead blameless post-mortems / After Action Reviews (AARs) that identify true root causes and drive automated, long-term solutions to prevent recurrence.

Automating for Scale and Security : Partner with platform engineers to design, build, and manage secure, resilient Kubernetes clusters and cloud / on-prem environments using Infrastructure-as-Code (Terraform, Ansible). You will embed security and compliance controls (RMF, STIGs) directly into this automation.

Eliminating Toil and Scaling the Team : Proactively identify and eliminate operational toil by building automation. You will act as a force multiplier by advising other teams on best practices in air-gapped environments and production readiness.

What We Look For

3 years of experience in Site Reliability Engineering or a related field, with firsthand experience managing mission-critical systems within DoD’s air-gapped environments

An active Top Secret security clearance. U.S. citizenship required.

Experience automating software delivery, deployment, and providing documentation and self-service tools for engineering teams and customers.

A strong understanding of Linux, containerization and orchestration, and virtual machines

Experience with centralized logging, metrics, and observability using tools such as Prometheus, Loki, Grafana, ELK stack, or Datadog.

Networking fundamentals : core protocols and secure configurations.

A deep understanding of incident response processes, with experience conducting thorough root cause analyses and driving continuous improvement

Clear, concise writing; strong documentation habits and async communication.

Core skills and technologies : VMWare, Kubernetes, Docker, Helm, Ansible, Terraform, Linux, AWS, DoD compliance, Monitoring and Observability tools, AWS.

Bonus points (nice to have)

Experience with compliance frameworks (RMF, STIGs / SRGs, ICD 503).

Security‑minded design for air-gapped environments.

Active Security+ or another DoD 8570.01-approved security credential, or the ability to obtain the valid credentials within 3 months of employment.

Notice to Third Party Recruitment Agencies

Please note that Onebrief does not accept unsolicited resumes from recruiters or employment agencies. In the absence of an executed Recruitment Services Agreement, there will be no obligation to any referral compensation or recruiter fee. In the event a recruiter or agency submits a resume or candidate without an agreement Onebrief explicitly reserves the right to pursue and hire those candidate(s) without any financial obligation to the recruiter or agency. Any unsolicited resumes, including those submitted to hiring managers, shall be deemed the property of Onebrief.

Crear una alerta de empleo para esta búsqueda

Senior Site Reliability Engineer • Remote, Remote, United States

Ofertas relacionadas
Test Products from Home – $25-$45 / hr + Freebies

Test Products from Home – $25-$45 / hr + Freebies

OCPA • Nowata, Oklahoma, us
A tiempo parcial +1
Product Testers are wanted to work from home nationwide in the US to fulfill upcoming contracts with national and international companies. We guarantee 15-25 hours per week with an hourly pay of bet...Mostrar más
Última actualización: hace más de 30 días • Oferta promocionada
Remote Product Tester – $45 / hr + Free Products – Start Now!

Remote Product Tester – $45 / hr + Free Products – Start Now!

OCPA • Nowata, Oklahoma, us
Teletrabajo
A tiempo parcial +1
Product Testers are wanted to work from home nationwide in the US to fulfill upcoming contracts with national and international companies. We guarantee 15-25 hours per week with an hourly pay of bet...Mostrar más
Última actualización: hace más de 30 días • Oferta promocionada
Remote Side Hustle Developer

Remote Side Hustle Developer

Finance Buzz • Nowata, Oklahoma, US
Teletrabajo
A tiempo completo +1
This position is for individuals who want to develop a side income stream while still working full time.You will test different small-scale remote opportunities, learn what works, and grow what pro...Mostrar más
Última actualización: hace más de 30 días • Oferta promocionada
Customer Service

Customer Service

Lowes • Nowata, OK, United States
A tiempo completo
Job Title : Customer Service Representative.As a Customer Service Representative at Lowe’s, you will be responsible for providing exceptional customer service by assisting customers with inquiries, ...Mostrar más
Última actualización: hace 23 días • Oferta promocionada
Structural Support Engineer

Structural Support Engineer

KMS Solutions, LLC • Independence, KS, US
A tiempo completo
KMS Solutions, LLC is a technical management / solutions company that specializes in engineering, analysis, and cyber security. Founded in 2005, KMS is a certified small business with nearly two dec...Mostrar más
Última actualización: hace 1 día • Oferta promocionada
Composite Lead Tech (PIC)

Composite Lead Tech (PIC)

Renewable Concepts • Neodesha, KS, US
A tiempo completo
Quick Apply
To perform the job successfully, the individual must be able to perform each essential duty satisfactorily.The composites season is short : RCL values its composite techs and will utilize our desiri...Mostrar más
Última actualización: hace más de 30 días
Customer Service Rep (06380) - 100 N 25th, Independence, KS

Customer Service Rep (06380) - 100 N 25th, Independence, KS

Domino's Pizza • Independence, KS, US
A tiempo completo
Independence, Kansas, D & E PIZZA, LLC.You got game? You got spring in your step? You want the best job in the world! And schedules that work with you, not against you? That's right, we live to bea...Mostrar más
Última actualización: hace 17 días • Oferta promocionada
Side Hustle Project Lead

Side Hustle Project Lead

Finance Buzz • Nowata, Oklahoma, US
A tiempo completo +1
We’re offering a role for someone who wants to lead their own side-income project in their spare time.You’ll explore various proven side hustles, select the ones that fit your lifestyle, and run th...Mostrar más
Última actualización: hace más de 30 días • Oferta promocionada
Submarine Electronics

Submarine Electronics

U.S. Navy • Nowata County, OK, United States
A tiempo completo
ABOUT The most secretive of Navy vessels, a submarine requires a select community of specially trained professionals to operate its classified, highly advanced hardware. The Sailors in the Submarine...Mostrar más
Última actualización: hace 17 días • Oferta promocionada
Rehab - Occupational Therapist

Rehab - Occupational Therapist

Wilson County Hospital • Neodesha, KS, USA
A tiempo completo
Quick Apply
The Occupational Therapist is responsible for evaluation, planning, directing, and administering occupational therapy modalities of treatment. Administers treatments and physical agents as prescribe...Mostrar más
Última actualización: hace más de 30 días
Electronics Engineering

Electronics Engineering

U.S. Navy • Nowata County, OK, United States
A tiempo completo
ABOUT The most secretive of Navy vessels, a submarine requires a select community of specially trained professionals to operate its classified, highly advanced hardware. The Sailors in the Submarine...Mostrar más
Última actualización: hace 22 días • Oferta promocionada
Travel Inpatient Rehabilitation Therapist - $2,427 per week

Travel Inpatient Rehabilitation Therapist - $2,427 per week

TRS Healthcare • Neodesha, KS, United States
A tiempo completo
TRS Healthcare is seeking a travel Rehabilitation Therapist for a travel job in Neodesha, Kansas.Job Description & Requirements. TRS Healthcare Job ID #1329416.Pay package is based on 8 hour shifts ...Mostrar más
Última actualización: hace 3 días • Oferta promocionada
Air Interdiction Agent

Air Interdiction Agent

U.S. Customs and Border Protection • Cherryvale, KS, US
A tiempo completo
Pilot CBP Air Interdiction Agent.Air and Marine Operations (AMO), a component of U.Customs and Border Protection (CBP), offers skilled Pilots interested in law enforcement an opportunity to work wi...Mostrar más
Última actualización: hace más de 30 días • Oferta promocionada
County Meter Reader

County Meter Reader

Meter Reader • Nowata, OK
A tiempo completo
Responsibilities The primary responsibility of this position is to read meters and record consumption of the water used, cleaning of meter boxes, and removal of vegetation impeding access to meters...Mostrar más
Última actualización: hace 24 días • Oferta promocionada
Wireless Sales Pro

Wireless Sales Pro

Kansas Staffing • Independence, KS, US
A tiempo completo
Premium operates wireless locations in over 1,300 wireless retail outlets via Walmart Supercenter, with a dedicated sales team of over 3,200 brand representatives. As one of Premium's Wireless Sales...Mostrar más
Última actualización: hace 6 días • Oferta promocionada
Residential Support Leader

Residential Support Leader

HOME OF HOPE, INC. • Delaware, OK, United States
A tiempo completo
Coordinates and manages program operations for assigned ICF / IID homes while ensuring compliance with regulatory requirements. Responsible for orientating, training, and scheduling house staff.Will c...Mostrar más
Última actualización: hace 9 días • Oferta promocionada
2026 Internship - Operations - Independence, KS

2026 Internship - Operations - Independence, KS

Textron • Independence, KS, US
Prácticas
The Operation Intern involves planning, organizing, and controlling manpower cost, continuity of production flow according to schedules, and the maintenance and improvement of quality, safety, and ...Mostrar más
Última actualización: hace 27 días • Oferta promocionada
Entry Level Remote Recruiter

Entry Level Remote Recruiter

Gpac • Independence, Kansas, United States
Teletrabajo
A tiempo completo
Quick Apply
Gpac has grown from a small, family-owned company into a top recruiting firm in the nation over our 30 years in business. With a reputation for excellence, we are dedicated to providing the best ser...Mostrar más
Última actualización: hace 29 días