Talent.com
Senior Technology Site Reliability Engineer
Senior Technology Site Reliability EngineerCooley LLP • Washington, DC, United States
Senior Technology Site Reliability Engineer

Senior Technology Site Reliability Engineer

Cooley LLP • Washington, DC, United States
1 day ago
Job type
  • Full-time
Job description

Senior Technology Site Reliability Engineer

Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operations team.

Position summary : The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability, scalability, and performance of the firm's critical infrastructure and applications. The SREblends software engineering with systems engineering to build and maintain automated, resilient, and observable systems that support high availability and operational excellence. In addition to being technically advanced, the SRE will have a high degree of emotional intelligence and the ability to work as a team towards complex and layered objectives. Specific duties and responsibilities include, but are not limited to, the following :

Position responsibilities :

  • Monitor and maintain production systems to ensure high availability and performance
  • Implement and manage service-level indicators (SLIs), objectives (SLO's), agreements (SLA's), and error budgets
  • Participate in on-call rotations and incident response, including root cause analysis and postmortems
  • Develop and maintain infrastructure as code (IaC) using Terraform
  • Automate deployment, scaling, and recovery processes to reduce manual intervention
  • Partner with DevOps to build and maintain CI / CD pipelines to support safe and efficient software delivery
  • Implement observability solutions using metrics, logs, traces, and alerting systems (Prometheus, Grafana, DataDog, etc.)
  • Proactively identify and resolve system bottlenecks and reliability risks
  • Work closely with Infrastructure, DevOps, Development, and security teams to embed reliability into the development lifecycle
  • Contribute to a culture of blameless post-mortems and continuous improvement
  • Document operational procedures and share knowledge across teams
  • All other duties as assigned or required

Skills and experience :

Required :

  • After orientation at Cooley LLP, exhibit proficiency in the Microsoft Office suite, iManage and other firm applications
  • Ability to work extended and / or weekend hours, as required
  • Ability to travel, as required
  • 6+ years direct applicable experience (e.g. site reliability engineering or related field)
  • Proficiency in Terraform and programming languages such as Python, Go, or Java
  • Deep expertise in cloud platforms, particularly AWS, and container orchestration
  • Strong background in distributed systems, performance tuning, and automation
  • Hands-on experience with configuration management tools such as Puppet, Chef, or Salt
  • Preferred :

  • Bachelor's Degree in Computer Science, Information Technology, Engineering, or associated discipline
  • Experience working with advanced ETL data workflows including technologies such as AWS EMR, Azure Synapse, Azure Data Factory, or Apache Hive / Spark / Airflow
  • Experience with IaC deployment of AKS / EKS / GKE architecture
  • Experience with enterprise Data Lake environments using technologies such as DataBricks or Snowflake
  • Competencies :

  • Expert analytical / quantitative, problem-solving, and deductive reasoning skills, experience performing advanced troubleshooting and root cause analysis of complex technical issues
  • Excellent organizational, planning, and time management skills and ability to work independently and in a team environment to manage competing priorities and meet deadlines
  • Advanced verbal and written communication skills with the ability to present findings, conclusions, alternatives, and information clearly and concisely
  • Experience working with all levels of business professionals, management, stakeholders, and vendors with the ability to build effective relationships through trust and diplomacy
  • Cooley offers a competitive compensation and excellent benefits package and is committed to fair and equitable employment practices.

    EOE.

    The expected annual pay range for this position with a full-time schedule is $140,000 - $205,000. Please note that final offer amount will be dependent on geographic location, applicable experience and skillset of the candidate.

    We offer a full range of elective benefits including medical, health savings account (with applicable medical plan), dental, vision, health and / or dependent care flexible spending accounts, pre-tax commuter benefits, life insurance, AD&D, long-term care coverage, backup care for children and / or adults and other parental support benefits. In addition to elective benefit options, benefited employees receive firm-paid life insurance, AD&D, LTD, short term medical benefits as well as 21 days of Paid Time Off ("PTO") and 10 paid holidays each year. We provide generous parental leave and fertility benefits. New employees will attend a detailed benefit orientation to learn more about our many benefits and resources.

    Create a job alert for this search

    Senior Site Reliability Engineer • Washington, DC, United States

    Related jobs
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Federated IT • Washington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show more
    Last updated: 17 days ago • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    LIGHTFEATHER IO LLC • Alexandria, VA, US
    Full-time
    LightFeather is seeking a Site Reliability Engineer (SRE) with strong GitLab platform expertise to support and enhance enterprise DevSecOps and collaboration environments.The ideal candidate thrive...Show more
    Last updated: 30+ days ago • Promoted
    Senior Auxiliary Systems Engineer

    Senior Auxiliary Systems Engineer

    MANTECH • Washington, Washington, D.C., US
    Full-time
    Shape the future of defense with MANTECH! Join a team dedicated to safeguarding our nation through advanced tech and innovative solutions. Since 1968, we’ve been a trusted partner to the Depar...Show more
    Last updated: 21 hours ago • Promoted • New!
    Site Reliability Engineer, Platform Discovery

    Site Reliability Engineer, Platform Discovery

    Slope • Washington, DC, United States
    Full-time
    Anduril Industries is a defense technology company with a mission to transform U.By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the def...Show more
    Last updated: 8 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Canonical • Washington, DC, United States
    Full-time
    Canonical is a leading provider of open source software and operating systems.Our platform, Ubuntu, is used across enterprise initiatives in public cloud, data science, AI, engineering innovation, ...Show more
    Last updated: 17 days ago • Promoted
    Senior Principal ASIC Static Timing Engineer

    Senior Principal ASIC Static Timing Engineer

    Northrop Grumman • Columbia, MD, US
    Full-time
    RELOCATION ASSISTANCE : Relocation assistance may be available.At Northrop Grumman, our employees have incredible opportunities to work on revolutionary systems that impact people's lives around the...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Powder River Industries • Washington, DC, United States
    Full-time
    Conduct analysis of alternatives for configuration tools, make recommendations, work with team to design, develop, test, implement, and maintain tool choice. Responsible for the administration, moni...Show more
    Last updated: 17 days ago • Promoted
    Systems Infrastructure Engineer

    Systems Infrastructure Engineer

    Jobot • Columbia, MD, US
    Full-time
    Exciting opportunity to join an industry leading building materials manufacturing company!.This Jobot Job is hosted by : Matt Tassoni. Are you a fit? Easy Apply now by clicking the "Apply Now&qu...Show more
    Last updated: 9 hours ago • Promoted • New!
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Bridge Defense • Washington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show more
    Last updated: 17 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    EngFlow • Washington, DC, United States
    Full-time
    Join to apply for the Site Reliability Engineer role at EngFlow.At EngFlow, we help developers save time by accelerating software builds and tests. Our cloud-based, distributed service optimizes dev...Show more
    Last updated: 17 days ago • Promoted
    Deployment Site Reliability Engineer - Connected Warfare

    Deployment Site Reliability Engineer - Connected Warfare

    Anduril Industries, Inc. • Washington, DC, United States
    Full-time
    Senior Deployed Site Reliability Engineer, Connected Warfare.Washington, District of Columbia, United States.Anduril Industries is a defense technology company with a mission to transform U.By brin...Show more
    Last updated: 17 days ago • Promoted
    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Jobgether • Washington, DC, United States
    Full-time
    Principal Site Reliability Engineer (SRE) job at Jobgether.This position is posted by Jobgether on behalf of.We are currently looking for a. Principal Site Reliability Engineer (SRE).Join a high-imp...Show more
    Last updated: 30+ days ago • Promoted
    Cyber Operations Engineer

    Cyber Operations Engineer

    BOOZ, ALLEN & HAMILTON, INC. • Alexandria, VA, US
    Full-time +1
    As a cyber mission spe cia list, you understand the value of hunt-forward operations, and you know that battles are won in the grey. At Booz Allen, you can use your cyberspace operations experience ...Show more
    Last updated: 30+ days ago • Promoted
    Senior System Engineer

    Senior System Engineer

    The Planet Group • Washington, DC, US
    Full-time +1
    The Senior Systems Operations Engineer is responsible for designing, implementing, maintaining, and optimizing the client's enterprise systems infrastructure. This role ensures the stability, securi...Show more
    Last updated: 3 hours ago • Promoted • New!
    Site Reliability Engineer — Scale mission-critical platforms

    Site Reliability Engineer — Scale mission-critical platforms

    Anduril Industries • Washington, DC, United States
    Full-time
    A defense technology company is seeking a Site Reliability Engineer in Washington, DC.The role involves solving challenges in networking and systems integration while working with cross-functional ...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ClientMind Recruiting Inc. • Bethesda, MD, United States
    Full-time
    Clientmind Recruiting is searching for a Site Reliability Engineer for a growing tech company based in the Bethesda, MD area. This will be onsite 1x per week (Tuesday).This role centers on maintaini...Show more
    Last updated: 10 days ago • Promoted
    Claim Specialist - Property Field Inspection

    Claim Specialist - Property Field Inspection

    State Farm • Columbia, MD, United States
    Full-time
    Being good neighbors - helping people, investing in our communities, and making the world a better place - is who we are at State Farm. It is at the core of how we operate and the reason for our suc...Show more
    Last updated: 30+ days ago • Promoted
    Senior Reliability Engineer

    Senior Reliability Engineer

    The Johns Hopkins University Applied Physics Laboratory • Laurel, MD, United States
    Full-time
    Are you passionate about applying reliability and system engineering principles to analyze and assess the resilience of future strategic weapon systems?. Do you have a strong technical background in...Show more
    Last updated: 20 days ago • Promoted