Talent.com
Principal Site Reliability Engineer
Principal Site Reliability EngineerHewlett Packard Enterprise Development LP • San Jose, CA, United States
Principal Site Reliability Engineer

Principal Site Reliability Engineer

Hewlett Packard Enterprise Development LP • San Jose, CA, United States
1 day ago
Job type
  • Full-time
Job description

Principal Site Reliability Engineer

This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week from an HPE office.

Who We Are :

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today's complex world. Our culture thrives on finding new and better ways to accelerate what's next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description :

U.S. Citizenship required due to federal client's regulations

As a Staff Software Engineer, you will play a key role in designing, building, and optimizing cloud infrastructure and deployment systems. Your work will directly impact scalability, security, and operational efficiency across our platforms. Key responsibilities include :

  • Enhance Infrastructure as Code (IAC) and enforce best practices.
  • Optimize cloud infrastructure for scalability, security, and cost-effectiveness.
  • Develop internal tools to support and streamline cloud platform operations.
  • Improve CI / CD pipelines and deployment workflows using FluxCD and Jenkins.
  • Address container image vulnerabilities and standardize remediation processes.
  • Build Amazon Machine Images (AMIs) aligned with CIS and STIG benchmarks.
  • Strengthen monitoring, alerting, and observability using Prometheus, Grafana, and logging tools.
  • Troubleshoot complex production issues to ensure system reliability and customer satisfaction.
  • Fine-tune distributed systems such as Apache Kafka and Cassandra.
  • Collaborate with development, security, and operations teams to align infrastructure with application needs.

U.S. Citizenship required due to federal client's regulations

Basic Qualifications

  • Minimum of 12 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE).
  • Proficiency with Linux systems, especially Debian-based distributions.
  • Strong experience with cloud platforms such as AWS and GCP.
  • Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible.
  • Solid programming skills in Python and / or Golang.
  • Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE).
  • Experience with GitOps workflows.
  • Proven track record in implementing and maintaining CI / CD pipelines.
  • Strong background in security and familiarity with security programs.
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK).
  • Knowledge of both relational (SQL) and non-relational databases.
  • Excellent problem-solving and debugging skills with a strong sense of ownership.
  • Experience managing distributed systems like Apache Kafka and Cassandra.
  • Effective communicator and collaborative team player.
  • Preferred Qualifications

  • Experience contributing to open-source projects.
  • Background in security engineering or related disciplines.
  • Additional Skills :

    Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing & Automation, User Experience (UX)

    What We Can Offer You :

    Health & Wellbeing

    We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

    Personal & Professional Development

    We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have - whether you want to become a knowledge expert in your field or apply your skills to another division.

    Unconditional Inclusion

    We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

    Let's Stay Connected :

    Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

    #unitedstates

    #networking

    Job : Engineering

    Job Level : TCP_05

    States with Pay Range Requirement

    The expected salary / wage range for a U.S.-based hire filling this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education / training, and / or skill level. If this is a sales role, then the listed salary range reflects combined base salary and target-level sales compensation pay. If this is a non-sales role, then the listed salary range reflects base salary only. Variable incentives may also be offered. Information about employee benefits offered can be found at

    USD Annual Salary : $152,000.00 - $349,000.00

    HPE is an Equal Employment Opportunity / Veterans / Disabled / LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here : Equal Employment Opportunity.

    Hewlett Packard Enterprise is EEO Protected Veteran / Individual with Disabilities.

    HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.

    Create a job alert for this search

    Site Reliability Engineer • San Jose, CA, United States

    Related jobs
    3GPP Standards Engineer

    3GPP Standards Engineer

    MSR Technology Group • Mountain View, CA, US
    Temporary
    MSR Technology Group DBA Infomatics has been an Inc 500 / 5000 corporation for the last 7 years in a row.Please find the job details below. This job will include 1) contributing to 3GPP standardizatio...Show more
    Last updated: 1 day ago • Promoted
    Project / Validation Engineer

    Project / Validation Engineer

    Project Farma • San Francisco, CA, United States
    Full-time
    Welcome to the forefront of innovation in cutting edge patient centered treatments! We are seeking the best and the brightest to join our high performing organization as a Project Engineer.As a lea...Show more
    Last updated: 1 day ago • Promoted
    Engineering Team Lead

    Engineering Team Lead

    gpac • San Jose, CA, United States
    Full-time
    A well-established local Mechanical Contractor is looking to fill a.Excellent benefits, competitive compensation, and opportunities for professional development. A dynamic, fast-paced culture that t...Show more
    Last updated: 30+ days ago • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Fortinet • Santa Clara, CA, United States
    Full-time
    At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...Show more
    Last updated: 25 days ago • Promoted
    Site Reliability Engineering (SRE)

    Site Reliability Engineering (SRE)

    Syntricate Technologies • Santa Clara, CA, United States
    Full-time
    Position : Site Reliability Engineering (SRE).Location : Santa Clara, CA (Onsite).WS application and CI / CD pipelines, Microsoft Server admin and workload support (Data Center and AWS).Initial respons...Show more
    Last updated: 1 day ago • Promoted
    Founding Site Reliability Engineer

    Founding Site Reliability Engineer

    Assort Health • San Francisco, CA, United States
    Full-time
    Our mission is to make exceptional healthcare accessible anytime, anywhere, for everyone.At Assort Health, we believe healthcare should feel effortless and connected — quick answers, clear communic...Show more
    Last updated: 8 days ago • Promoted
    Team Lead, Site Reliability Engineering in San Francisco

    Team Lead, Site Reliability Engineering in San Francisco

    Energy Jobline ZR • San Francisco, CA, United States
    Full-time
    Team Lead, Site Reliability Engineering.Location : California | Remote | Work from Home.At Pythian, we are experts in strategic database and analytics services, driving digital transformation and op...Show more
    Last updated: 6 days ago • Promoted
    Founding Site Reliability Engineer

    Founding Site Reliability Engineer

    Reducto • San Francisco, CA, United States
    Full-time
    Reducto helps AI teams ingest real world enterprise data with state of the art accuracy.The vast majority of enterprise data — from financial statements to health records — is locked in unstructure...Show more
    Last updated: 11 days ago • Promoted
    Engineering Team Lead - HVAC & Plumbing

    Engineering Team Lead - HVAC & Plumbing

    CyberCoders • San Jose, CA, United States
    Full-time
    The Engineering Team Lead will oversee and guide a team of engineers in the design and execution of HVAC and plumbing projects. The ideal candidate will possess strong leadership skills and a deep u...Show more
    Last updated: 30+ days ago • Promoted
    Reliability Engineer, Energy Storage

    Reliability Engineer, Energy Storage

    Redwood Materials, Inc. • San Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recyclingkeeping critical minerals in circulation and driving the energy transition.Founded in 20...Show more
    Last updated: 1 day ago • Promoted
    Bradley Fighting Vehicle System Maintainer

    Bradley Fighting Vehicle System Maintainer

    United States Army • San Juan Bautista, CA, US
    Permanent
    As a Bradley Fighting Vehicle System Maintainer, you'll have the challenging task of performing repairs and maintenance exclusively on the range of Bradley fighting vehicles, including anti-aircraf...Show more
    Last updated: 30+ days ago • Promoted
    Reliability Engineer (Regulated Industry)

    Reliability Engineer (Regulated Industry)

    Mentor Technical Group • San Francisco, CA, United States
    Full-time
    Mentor Technical Group Job Opportunity.Mentor Technical Group (MTG) provides a comprehensive portfolio of technical support and solutions for the FDA-regulated industry. As a world leader in life sc...Show more
    Last updated: 1 day ago • Promoted
    Senior Reliability Engineer

    Senior Reliability Engineer

    Eight Sleep • San Francisco, CA, United States
    Full-time
    Join the Sleep Fitness Movement.At Eight Sleep, we're on a mission to fuel human potential through optimal sleep.As the world's first sleep fitness company, we're redefining what it means to be wel...Show more
    Last updated: 1 day ago • Promoted
    Consumables Sustaining Engineer

    Consumables Sustaining Engineer

    Bio-Rad Laboratories • Pleasanton, CA, United States
    Full-time
    Bio-Rad is looking for a Consumables Sustaining Engineer to join the Life Science Group of Bio-Rad based in Pleasanton, CA. You will be part of a multidisciplinary team focused on developing innovat...Show more
    Last updated: 30+ days ago • Promoted
    Founding Site Reliability Engineer

    Founding Site Reliability Engineer

    Assort Health Inc. • San Francisco, CA, United States
    Full-time
    Our mission is to make exceptional healthcare accessible anytime, anywhere, for everyone.At Assort Health, we believe healthcare should feel effortless and connected — quick answers, clear communic...Show more
    Last updated: 8 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Rockwoods Inc • Pleasanton, CA, US
    Full-time
    Note : Candidates must have relevant experience in Medical / Healthcare domains, this is mandatory.Senior SRE Engineer - Pleasanton, 5 days office. Primary work : 24x7 On-call support and setting up mo...Show more
    Last updated: 20 days ago • Promoted
    Founding Site Reliability Engineer

    Founding Site Reliability Engineer

    Relevance AI • San Francisco, CA, United States
    Full-time
    San Francisco, USA (Hybrid 3 days / week).At Relevance AI, our mission is to empower anyone to delegate work to the AI workforce. We’re building a new category of AI automation, enabling teams to crea...Show more
    Last updated: 23 days ago • Promoted
    Apple Services Engineering - Senior SRE

    Apple Services Engineering - Senior SRE

    Apple • Cupertino, CA, United States
    Full-time
    The Apple Service Engineering - SRE team is looking for Site Reliability Engineers with experience in developing processes, tools, and automation for managing distributed systems in production envi...Show more
    Last updated: 1 day ago • Promoted