Search jobs > Plano, TX > Senior site reliability

Senior Lead Site Reliability Engineer

JPMorgan Chase & Co.
Plano, TX, United States
Full-time

Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.

As a Senior Lead Site Reliability Engineer at JPMorgan Chase within the CORPORATE SECTOR in the INFRASTRUCTURE PLATFORMS, Runtime Compute Team , you are deemed as a force multiplier at both a line-of-business and firm wide level.

Inspire your peers and the wider product line to deliver durable and resilient products and services to our customers, define firm wide strategies for reliability, and guide and entrust our teams to lead and execute those strategies.

Job responsibilities

  • Provide technical SRE leadership for multiple SRE teams, engineers, and managers throughout Runtime Compute who look to you for advice on the technical issues facing them.
  • Play a key role in the Runtime Compute strategic resiliency, observability, and toil reduction planning.
  • Drive continual improvement in resilience, quality of experience, security, monitoring, instrumentation, and automation.
  • Have successfully implemented SRE best practices in high-performance, stable, mission-critical applications with demonstrable positive outcomes.
  • Work with your fellow stakeholders to define common NFRs and availability targets for your product line, Runtime Compute, and ensure that SRE is practiced consistently across applications, products, and product lines.
  • Act in a blameless, data-driven manner, show high empathy, emotional intelligence, and can navigate difficult situations with composure and tact.
  • Direct the SRE teams in the product line throughout the lifecycle to help develop software for reliability and scale, ensuring consistency across the product line, and minimal refactoring or changes.
  • Direct the SRE teams in the product line to develop and measure the SLO / SLI for provisioning / deprovisioning, deployments, uptime, and other measures critical to products.

Work with business partners to help educate on the product line SLO / SLI.

Identify gaps between applicable requirements and current procedures / controls; Drive resolution of mitigating controls.

Develop and implement solutions that strengthen business operating models, enhance the client experience, and improve efficiency and controls.

Work with business partners to design and implement enhancements to existing processes and / or business applications, introduce new processes and / or toolsets, and engage in process re-engineering.

Required qualifications, capabilities, and skills

  • Formal training or certification on software engineering concepts and 5+ years applied experience with Industry standard Runtime solutions eg Kubernetes and Cloud Foundry.
  • Expertise in at least one technology stack designing, coding, testing and delivering software.
  • Proficiency in one or more technology domains, may be cross-domain expert to able to solve complex and mission critical problems within a business or across the firm.

Software development experience in at least one general purpose programming language : Python, Java, C, C++, Go, Shell scripting.

  • Working knowledge infrastructure component ( . Load balancer, cloud platforms and products, container systems, and runtime compute).
  • Excellent debugging and troubleshooting skills.
  • Strong organizational and prioritization skills, detail-oriented and strong interpersonal skills.
  • Be a team player and a leader who shows commitment and dedication, and can maintain a positive attitude and high-level of performance on high-profile / time-sensitive initiatives

Preferred qualifications, capabilities, and skills

  • Experience hiring, developing, and recognizing talent
  • Ability to work in a high paced environment, be flexible, follow tight deadlines, organize and prioritize work
  • Hands-on experience with cloud-based observability technologies and tools especially in deployment, monitoring and operations, such as Data Dog, Prometheus, Splunk, ElasticSearch, Grafana, appdynamics etc
  • Strong working knowledge of modern development technologies and tools such Agile, CI / CD, Git, Terraform and Jenkins
  • Deep knowledge of Internet protocols and web services technologies such as HTTP, DNS, TCP / UDP, SOAP, JSON and REST
  • Good understanding of networking protocols and cybersecurity best practices in cloud environment. Public Cloud certification is preferred
  • AI / ML knowledge is preferred to evaluate and choose models that help with SRE goals including automated root cause analysis, anomaly detection, and real-time insights and analytics into various products.

LI-RB3

30+ days ago
Related jobs
Promoted
Capital One
Plano, Texas
Remote

Senior Lead Engineer - Generative AI Infrastructure (Remote-Eligible). We are committed to building world-class applied science and engineering teams and continue our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. Lead Engineer, ...

Promoted
Geico Insurance
Richardson, Texas

Our Senior Manager is an engineering leader who works with the engineering staff to innovate and build new engineering solutions, improve and enhance existing solutions as well as leverage engineering solutions to solve critical operational problems. Senior Manager, Site Reliability Engineering - Da...

Promoted
Capital One
Plano, Texas

Senior Lead Engineer - Generative AI Product Engineering. We are committed to building world-class applied science and engineering teams and continue our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. Lead Engineer to help build ...

Olsson
Plano, Texas

Often serves as lead engineer on complex projects and provides high-level technical engineering design. As a Senior Civil Engineer in either our Plano or Fort Worth office, you will provide overall technical support to the team and act as an advisor on complex projects. The senior engineer may provi...

Cisco
Richardson, Texas

Webex Networking Site Reliability Team. Working in tandem with Data Center Engineering and Incident Command teams, we identify problems and bottlenecks in processes and contribute to the automation of infrastructure provisioning, configuration management, and deployment processes. Bachelor’s degree ...

JPMorgan Chase & Co.
Plano, Texas

As a Senior Lead Software Engineer at JPMorgan Chase within the Corporate Technology - Corporate Oversight & Governance Tech team, you will spearhead the development and enhancement of our cutting-edge trade surveillance products. This pivotal role combines deep technical expertise with leadership s...

Capital One
Plano, Texas

Main Street (21020), United States of America, Cambridge, MassachusettsSenior Lead Engineer - Generative AI Product Engineering. Lead Machine Learning EngineerSan Francisco, California (Hybrid On-Site): $248,700 - $283,800 for Sr. We are committed to building world-class applied science and engineer...

Intuit
Plano, Texas

Supporting and coaching other engineers, pair programming or peer reviewing code, helping to ensure that all engineers are growing and part of a community. Owning the quality, reliability, and performance of our applications and services, including production on-call support shifts, construction of ...

Forcepoint
Richardson, Texas

This individual will be focused on maximum availability, reliability, security, and performance for Forcepoint services. Fully understands Agile Systems Engineering practices . ...

Raytheon Technologies
Richardson, Texas

We are seeking a Senior Principal Engineer to join our team in Richardson, Texas as an Integration Verification & Test Lead. Experience leading an engineering team through the integration/test of a software based system. We bring the strength of more than 100 years of experience and renowned enginee...