Talent.com
Principal Site Reliability Engineer

Principal Site Reliability Engineer

JPMorganChasePalo Alto, CA, United States
14 hours ago
Job type
  • Full-time
Job description

Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact.

As a Principal Site Reliability Engineer at JPMorgan Chase within the Enterprise Technology, AI / ML & Data Platforms division , you will utilize your expertise to create innovative solutions that improve critical incident management and streamline the software development lifecycle throughout the organization. Your role will involve overseeing, designing, and deploying infrastructure components to enhance reliability and ensure operational efficiency.

Job responsibilities

  • Architect and implement observability platforms and tools for proactive detection and continuous improvement.
  • Lead the design and development of core observability services, including metrics pipelines and log aggregation.
  • Leverage modern technologies such as Open Telemetry and AI / ML for anomaly detection and automated insights.
  • Collaborate with engineering and SRE teams to define service-level objectives (SLOs) and error budgets.
  • Provide technical leadership and mentorship to engineering teams, ensuring best practices in system design.
  • Champion observability as a first-class concern in the software development lifecycle.
  • Influence platform strategy and roadmap through deep technical insight and alignment with business priorities.
  • Write advanced documentation and create executive presentations that translate technical issues into business impact.
  • Participate in industry professional forums and monitor relevant industry technologies and standards.
  • Lead medium to large projects by bringing together the proper perspective and integrating feedback from team members.
  • Participate in support responsibilities for coverage of critical applications.

Required qualifications, capabilities, and skills

  • Formal training or certification on site reliability engineering concepts and 10+ years applied experience.
  • Ability to determine how each system relates to each other and build automation to improve reliability.
  • Experience with translating research, analysis, and tests into business recommendations.
  • Ability to balance and be accountable for the work of multiple architects and designers.
  • Understands and leads partnerships across job functions to develop efficient systems.
  • Engages team members and expresses complex ideas with appropriate level of detail, while providing constructive feedback.
  • Self-motivated and able to work well under pressure with minimal supervision.
  • Ability to tackle a problem by using a logical, systematic, sequential approach.
  • Preferred qualifications, capabilities, and skills

  • Experience with cloud-native instrumentation and streaming data platforms.
  • Influence technology and policy decisions while fostering commitment and confidence in team members.
  • Develop effective solutions and analyze competitive positions by considering market trends.
  • Support the introduction of innovative methods and communicate clearly to persuade audiences.
  • Demonstrate concern and meet the needs of both internal and external customers.
  • #LI-RB3

    About Us

    JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.

    We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and / or discretionary incentive compensation, paid in the form of cash and / or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.

    We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.

    JPMorgan Chase & Co. is an Equal Opportunity Employer, including Disability / Veterans

    About the Team

    Our professionals in our Corporate Functions cover a diverse range of areas from finance and risk to human resources and marketing. Our corporate teams are an essential part of our company, ensuring that we're setting our businesses, clients, customers and employees up for success.

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • Palo Alto, CA, United States

    Related jobs
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood Materials, Inc.San Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Harrison ClarkeSan Francisco, CA, United States
    Full-time
    Principal Site Reliability Engineer (SRE).The ideal candidate should have extensive experience in designing highly scalable infrastructure, building systems, and performing testing, monitoring, and...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    criteoPalo Alto, CA, United States
    Full-time
    At Criteo we face challenging problems in the IT industry at scale.Our data is large and our systems require speed and complexity handling. We have about 40 petabytes in Hadoop storage and respond t...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    LiveRampSan Francisco, CA, United States
    Full-time
    Join to apply for the Senior Site Reliability Engineer role at LiveRamp.LiveRamp is the data collaboration platform of choice for the world’s most innovative companies. A groundbreaking leader in co...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    WritemedSan Francisco, CA, United States
    Full-time
    Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Together AISan Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show moreLast updated: 14 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PerplexitySan Francisco, CA, United States
    Full-time
    Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world’s leading AI platforms. Perplexity has raised over $1B in venture investment from some of t...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PacerProSan Francisco, CA, United States
    Full-time
    You’ll be joining the engineering team responsible for delivering PacerPro’s SaaS and on-premise solutions that orchestrate case data workflows and provide data driven legal insights for our client...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    AlchemySan Francisco, CA, United States
    Full-time
    Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers th...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood MaterialsSan Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition.Founded in...Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    xAIPalo Alto, CA, United States
    Full-time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show moreLast updated: 14 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PrimerSan Francisco, CA, United States
    Full-time
    Primer helps B2B products break out of the B2C-centric marketing box.Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Jobs via DiceRedwood City, CA, United States
    Full-time
    Dice is the leading career destination for tech experts at every stage of their careers.Our client, Kforce Technology Staffing, is seeking a Reliability Engineer in Redwood City, CA.Deliver high-le...Show moreLast updated: 14 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    HiveSan Francisco, CA, United States
    Full-time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ZapierSan Francisco, CA, United States
    Full-time
    We're humans who simply think computers should do more work.At Zapier, we’re not just making software—we’re building a platform to help millions of businesses globally scale with automation and AI....Show moreLast updated: 14 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Bits to AtomsSan Francisco, CA, United States
    Full-time
    Site Reliability Engineer (SRE).You’ll work at the intersection of infrastructure, AI / ML systems, and mission-critical physical operations. You’ll collaborate directly with engineering, AI, and oper...Show moreLast updated: 14 hours ago
    • Promoted
    Associate Site Reliability Engineer / Site Reliability Engineer

    Associate Site Reliability Engineer / Site Reliability Engineer

    MedStar HealthRedwood City, CA, United States
    Full-time
    C3 AI (NYSE : AI), is the Enterprise AI application software company.C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing,...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer, Founding

    Site Reliability Engineer, Founding

    LimohealthSan Francisco, CA, United States
    Full-time
    At Charta, we're pioneering a transformative approach to healthcare billing through the power of generative AI.Our mission is to revolutionize this critical yet often cumbersome aspect of healthcar...Show moreLast updated: 30+ days ago