Talent.com
Senior Technology Site Reliability Engineer
Senior Technology Site Reliability EngineerCooley LLP • Washington, DC, United States
Senior Technology Site Reliability Engineer

Senior Technology Site Reliability Engineer

Cooley LLP • Washington, DC, United States
1 day ago
Job type
  • Full-time
Job description

Senior Technology Site Reliability Engineer

Cooley is seeking a Senior Site Reliability Engineer to join the Infrastructure & Development Operations team.

Position summary : The Senior Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability, scalability, and performance of the firm's critical infrastructure and applications. The SREblends software engineering with systems engineering to build and maintain automated, resilient, and observable systems that support high availability and operational excellence. In addition to being technically advanced, the SRE will have a high degree of emotional intelligence and the ability to work as a team towards complex and layered objectives. Specific duties and responsibilities include, but are not limited to, the following :

Position responsibilities :

  • Monitor and maintain production systems to ensure high availability and performance
  • Implement and manage service-level indicators (SLIs), objectives (SLO's), agreements (SLA's), and error budgets
  • Participate in on-call rotations and incident response, including root cause analysis and postmortems
  • Develop and maintain infrastructure as code (IaC) using Terraform
  • Automate deployment, scaling, and recovery processes to reduce manual intervention
  • Partner with DevOps to build and maintain CI / CD pipelines to support safe and efficient software delivery
  • Implement observability solutions using metrics, logs, traces, and alerting systems (Prometheus, Grafana, DataDog, etc.)
  • Proactively identify and resolve system bottlenecks and reliability risks
  • Work closely with Infrastructure, DevOps, Development, and security teams to embed reliability into the development lifecycle
  • Contribute to a culture of blameless post-mortems and continuous improvement
  • Document operational procedures and share knowledge across teams
  • All other duties as assigned or required

Skills and experience :

Required :

  • After orientation at Cooley LLP, exhibit proficiency in the Microsoft Office suite, iManage and other firm applications
  • Ability to work extended and / or weekend hours, as required
  • Ability to travel, as required
  • 6+ years direct applicable experience (e.g. site reliability engineering or related field)
  • Proficiency in Terraform and programming languages such as Python, Go, or Java
  • Deep expertise in cloud platforms, particularly AWS, and container orchestration
  • Strong background in distributed systems, performance tuning, and automation
  • Hands-on experience with configuration management tools such as Puppet, Chef, or Salt
  • Preferred :

  • Bachelor's Degree in Computer Science, Information Technology, Engineering, or associated discipline
  • Experience working with advanced ETL data workflows including technologies such as AWS EMR, Azure Synapse, Azure Data Factory, or Apache Hive / Spark / Airflow
  • Experience with IaC deployment of AKS / EKS / GKE architecture
  • Experience with enterprise Data Lake environments using technologies such as DataBricks or Snowflake
  • Competencies :

  • Expert analytical / quantitative, problem-solving, and deductive reasoning skills, experience performing advanced troubleshooting and root cause analysis of complex technical issues
  • Excellent organizational, planning, and time management skills and ability to work independently and in a team environment to manage competing priorities and meet deadlines
  • Advanced verbal and written communication skills with the ability to present findings, conclusions, alternatives, and information clearly and concisely
  • Experience working with all levels of business professionals, management, stakeholders, and vendors with the ability to build effective relationships through trust and diplomacy
  • Cooley offers a competitive compensation and excellent benefits package and is committed to fair and equitable employment practices.

    EOE.

    The expected annual pay range for this position with a full-time schedule is $140,000 - $205,000. Please note that final offer amount will be dependent on geographic location, applicable experience and skillset of the candidate.

    We offer a full range of elective benefits including medical, health savings account (with applicable medical plan), dental, vision, health and / or dependent care flexible spending accounts, pre-tax commuter benefits, life insurance, AD&D, long-term care coverage, backup care for children and / or adults and other parental support benefits. In addition to elective benefit options, benefited employees receive firm-paid life insurance, AD&D, LTD, short term medical benefits as well as 21 days of Paid Time Off ("PTO") and 10 paid holidays each year. We provide generous parental leave and fertility benefits. New employees will attend a detailed benefit orientation to learn more about our many benefits and resources.

    Create a job alert for this search

    Senior Site Reliability Engineer • Washington, DC, United States

    Related jobs
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Federated IT • Washington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show more
    Last updated: 17 days ago • Promoted
    Travel CT Tech - $3,604 per week in Timonium, MD

    Travel CT Tech - $3,604 per week in Timonium, MD

    Triage Staffing LLC • Columbia, Maryland, US
    Full-time
    Travel Radiology : CT Tech Timonium.Shift Details : 0H Days (3 : 16 PM-3 : 16 PM).Length : 26 WEEKS 26 weeks.Apply for specific facility details.Show more
    Last updated: 20 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Govx • Washington, District of Columbia, United States
    Remote
    Full-time
    GOVX is seeking an experienced Senior Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our production systems through automation, observability, and operat...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    LIGHTFEATHER IO LLC • Alexandria, VA, US
    Full-time
    LightFeather is seeking a Site Reliability Engineer (SRE) with strong GitLab platform expertise to support and enhance enterprise DevSecOps and collaboration environments.The ideal candidate thrive...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer, Platform Discovery

    Site Reliability Engineer, Platform Discovery

    Slope • Washington, DC, United States
    Full-time
    Anduril Industries is a defense technology company with a mission to transform U.By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the def...Show more
    Last updated: 8 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Canonical • Washington, DC, United States
    Full-time
    Canonical is a leading provider of open source software and operating systems.Our platform, Ubuntu, is used across enterprise initiatives in public cloud, data science, AI, engineering innovation, ...Show more
    Last updated: 17 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Powder River Industries • Washington, DC, United States
    Full-time
    Conduct analysis of alternatives for configuration tools, make recommendations, work with team to design, develop, test, implement, and maintain tool choice. Responsible for the administration, moni...Show more
    Last updated: 17 days ago • Promoted
    Travel CT Tech - $2,555 to $2,833 per week in La Plata, MD

    Travel CT Tech - $2,555 to $2,833 per week in La Plata, MD

    AlliedTravelNetwork • Columbia, Maryland, US
    Full-time
    AlliedTravelNetwork is working with LRS Healthcare to find a qualified CT Tech in La Plata, Maryland, 20646!.Ready to start your next travel adventure? LRS Healthcare offers a full benefits package...Show more
    Last updated: 20 days ago • Promoted
    Site Reliability Engineer - TS / SCI

    Site Reliability Engineer - TS / SCI

    Mission Box Technologies • Washington, District of Columbia, United States
    Remote
    Full-time
    We are seeking a Site Reliability Engineer (SRE).The ideal candidate will drive continuous improvements, ensuring robust and reliable technology services. For over two decades, our client has been c...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Anduril Industries • Washington, District of Columbia, United States
    Full-time
    The Platform Discovery team at Anduril is at the forefront of incubating and maturing high-potential, software-defined, AI-native offerings that meet the toughest, newest challenges across hardware...Show more
    Last updated: 15 days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Bridge Defense • Washington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show more
    Last updated: 17 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    EngFlow • Washington, DC, United States
    Full-time
    Join to apply for the Site Reliability Engineer role at EngFlow.At EngFlow, we help developers save time by accelerating software builds and tests. Our cloud-based, distributed service optimizes dev...Show more
    Last updated: 17 days ago • Promoted
    Deployment Site Reliability Engineer - Connected Warfare

    Deployment Site Reliability Engineer - Connected Warfare

    Anduril Industries, Inc. • Washington, DC, United States
    Full-time
    Senior Deployed Site Reliability Engineer, Connected Warfare.Washington, District of Columbia, United States.Anduril Industries is a defense technology company with a mission to transform U.By brin...Show more
    Last updated: 17 days ago • Promoted
    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Jobgether • Washington, DC, United States
    Full-time
    Principal Site Reliability Engineer (SRE) job at Jobgether.This position is posted by Jobgether on behalf of.We are currently looking for a. Principal Site Reliability Engineer (SRE).Join a high-imp...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ClientMind Recruiting Inc. • Bethesda, MD, United States
    Full-time
    Clientmind Recruiting is searching for a Site Reliability Engineer for a growing tech company based in the Bethesda, MD area. This will be onsite 1x per week (Tuesday).This role centers on maintaini...Show more
    Last updated: 10 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Ten Mile Square Technologies • Arlington, Virginia, United States
    Remote
    Full-time
    Ten Mile Square Technologies is a high-end technology consulting firm based in the Northern Virginia area.Our customers routinely call upon us to solve some of the largest scale and hardest problem...Show more
    Last updated: 30+ days ago • Promoted