Talent.com
Lead Site Reliability Engineer - Cloud Operations. - US Citizen Required
Lead Site Reliability Engineer - Cloud Operations. - US Citizen RequiredOracle • Salt Lake City, UT, US
No longer accepting applications
Lead Site Reliability Engineer - Cloud Operations. - US Citizen Required

Lead Site Reliability Engineer - Cloud Operations. - US Citizen Required

Oracle • Salt Lake City, UT, US
12 days ago
Job type
  • Full-time
Job description

Job Description

Oracle Health (OHAI) is a leader in generative AI for healthcare, focusing on cutting-edge cloud services that streamline healthcare operations. Our EHR and Clinical AI Agent platforms help healthcare providers reduce manual tasks and improve patient care. We are expanding our OCI Cloud Operations team and are seeking a Principal Site Reliability Engineer (SRE) to ensure the reliability, performance, and scalability of these services in a production environment.

Role Overview

As a key member of Oracle Health's Cloud Operations team, you will be responsible for the operational health, reliability, and performance of our EHR and Clinical AI Agent services. You will ensure these customer-facing platforms meet the highest standards of scalability, availability, and security, while developing and executing strategies for continuous optimization. Your focus will include cloud service performance monitoring, incident management, and operational efficiency across Kubernetes-based environments. In this role, you will serve as a technical team lead, providing mentorship to Site Reliability Engineers (SREs)

Key Responsibilities

Service Ownership & Leadership :

Own operational aspects of the EHR and Clinical AI Agent cloud services, ensuring their reliability and performance.

Serve as a technical lead within the Site Reliability Engineering (SRE) team, promoting adherence to best practices in incident management, service design, operational excellence, and automation.

Mentor and support team members, fostering professional growth, technical proficiency, and a culture of accountability and continuous improvement in service operations

Operations Engineering

Oversee deployment and operations of cloud services across commercial and government data centers, ensuring compliance with corporate and regulatory standards.

Monitor and optimize resource utilization, performance, and scalability to maintain high availability and reliability.

Ensure security and compliance of all services in alignment with organizational and governmental requirements.

Manage incidents proactively, identifying and resolving issues in real time to minimize downtime and maintain system stability.

Lead critical incident response efforts, collaborating with development teams to implement corrective and preventive measures.

Service Design & Optimization :

Design and implement zero-downtime deployment strategies for software and security updates.

Work with cross-functional teams to enhance service stability and ensure predictable, efficient operation.

Implement proactive measures for system failure analysis and rapid issue resolution.

Automation & Continuous Improvement :

Lead automation initiatives to streamline operational tasks and reduce manual intervention.

Drive improvements to monitoring and alerting frameworks using tools like Prometheus and Grafana.

Implement Infrastructure as Code (IaaC) practices using Terraform and Shepherd.

Assist in the optimization of cloud-based services, ensuring smooth performance, scalability, and operational efficiency.

Qualifications

Experience : 8+ years in Site Reliability Engineering, DevOps, or Cloud Operations.

Experienced in managing customer-facing, Kubernetes-based Cloud services.

Cloud & Container Technologies : Experience with OCI, Kubernetes, Docker, Prometheus, Grafana, and cloud-native solutions.

Scripting & Automation : Proficiency in Python, Perl, Shell Scripting, and tools like Terraform.

Incident Management : Strong troubleshooting skills for resolving complex issues in production systems.

Cloud Platforms : Experience with OCI, AWS, GCP, or Azure.

Version Control : Familiarity with Git.

Operating Systems : Extensive experience with Linux / Unix environments in a Cloud Production environment.

Security & Compliance : Knowledge of cloud security best practices, particularly in regulated industries like healthcare.

US Citizenship on US soil is required. This position requires you to be eligible to receive a federal security clearance which requires you to be a US Citizen.

Responsibilities

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and / or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

Disclaimer

Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.

Range and benefit information provided in this posting are specific to the stated locations only

US : Hiring Range in USD from : $86,400 to $199,500 per annum. May be eligible for bonus and equity.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.

Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following

Medical, dental, and vision insurance, including expert medical opinion

Short term disability and long term disability

Life insurance and AD&D

Supplemental life insurance (Employee / Spouse / Child)

Health care and dependent care Flexible Spending Accounts

Pre-tax commuter and parking benefits

401(k) Savings and Investment Plan with company match

Paid time off : Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.

11 paid holidays

Paid sick leave : 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.

Paid parental leave

Adoption assistance

Employee Stock Purchase Plan

Financial planning and group legal

Voluntary benefits including auto, homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.

Career Level - IC4

About Us

As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Create a job alert for this search

Site Reliability Engineer • Salt Lake City, UT, US

Related jobs
Senior Site Reliability Engineer

Senior Site Reliability Engineer

VirtualVocations • Salt Lake City, Utah, United States
Full-time
A company is looking for a Senior Site Reliability Engineer to help scale its platform and ensure system reliability.Key Responsibilities Act as a first responder for system incidents and outages...Show more
Last updated: 30+ days ago • Promoted
Lead DevOps Engineer

Lead DevOps Engineer

VirtualVocations • Salt Lake City, Utah, United States
Full-time
A company is looking for a Lead DevOps Engineer to define and drive the infrastructure and deployment strategy across its enterprise platform. Key Responsibilities : Define and execute the DevOps s...Show more
Last updated: 30+ days ago • Promoted
Cloud Engineer

Cloud Engineer

Unisys Corporation • Salt Lake City, UT, United States
Full-time
What success looks like in this role : .Design and implement cloud computing solutions using AWS, Azure, or Google Cloud Platform. Manage and optimize cloud infrastructure, ensuring high availability ...Show more
Last updated: 30+ days ago • Promoted
Platform Operations Engineer

Platform Operations Engineer

VirtualVocations • Provo, Utah, United States
Full-time
A company is looking for a Platform Operations Engineer.Key Responsibilities Drive solution architecture and delivery across diverse domains, focusing on AWS cloud infrastructure and containers ...Show more
Last updated: 30+ days ago • Promoted
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Pura • Pleasant Grove, UT, US
Full-time
Quick Apply
Staff Site Reliability Engineer Join Us at Pura—Reimagining Fragrance for the Future At Pura, we believe life is better when it smells good. Fragrance has the unique power to transform spaces,...Show more
Last updated: 30+ days ago
DevOps Site Reliability Engineer

DevOps Site Reliability Engineer

VirtualVocations • Provo, Utah, United States
Full-time
A company is looking for a DevOps / Site Reliability Engineer (Remote).Key Responsibilities Configure, manage, and improve CI / CD pipelines for application deployments Monitor application perform...Show more
Last updated: 3 days ago • Promoted
Principal Site Reliability Engineer

Principal Site Reliability Engineer

VirtualVocations • Salt Lake City, Utah, United States
Full-time
A company is looking for a Principal Site Reliability Engineer.Key Responsibilities Lead the technical direction of the team while contributing to the design and implementation of self-service to...Show more
Last updated: 30+ days ago • Promoted
Engineer, Reliability

Engineer, Reliability

AES Corporation • Salt Lake City, UT, United States
Full-time
Are you ready to be part of a company that's not just talking about the future, but actively shaping it? Join The AES Corporation (NYSE : AES), a. AES is committed to shaping a future through innovat...Show more
Last updated: 10 days ago • Promoted
Cloud Operations Engineer

Cloud Operations Engineer

VirtualVocations • Provo, Utah, United States
Full-time
A company is looking for a Cloud Operations Engineer to join their IT Services team primarily on a remote basis.Key Responsibilities : Utilize infrastructure-as-code tools to configure systems and...Show more
Last updated: 30+ days ago • Promoted
Cloud Site Reliability Engineer

Cloud Site Reliability Engineer

VirtualVocations • Provo, Utah, United States
Full-time
A company is looking for a Cloud Site Reliability Engineer (AWS).Key Responsibilities Design, deploy, and maintain AWS cloud infrastructure for high availability and fault tolerance Administer M...Show more
Last updated: 30+ days ago • Promoted
Data Center Facility Operations Reliability Engineer

Data Center Facility Operations Reliability Engineer

Meta • Salt Lake City, UT, US
Full-time
Meta was built to help people connect and share, and over the last decade, our tools have played a critical part in changing how people around the world communicate with one another.With over two b...Show more
Last updated: 6 days ago • Promoted
Senior Cloud DevOps Engineer

Senior Cloud DevOps Engineer

VirtualVocations • Salt Lake City, Utah, United States
Permanent
A company is looking for a Senior Cloud DevOps Engineer to work remotely on a government contract.Key Responsibilities Operate modern web-based applications / systems in a secure, reliable, and sca...Show more
Last updated: 30+ days ago • Promoted
Technical DevSecOps Lead

Technical DevSecOps Lead

VirtualVocations • Salt Lake City, Utah, United States
Full-time
A company is looking for a Technical DevSecOps Lead to join their global team remotely.Key Responsibilities Design and architect DevSecOps platforms to meet scalability, performance, and security...Show more
Last updated: 3 days ago • Promoted
DevSecOps Engineer

DevSecOps Engineer

VirtualVocations • Salt Lake City, Utah, United States
Full-time
A company is looking for a DevSecOps Engineer to support their Project Ares team in maintaining and enhancing cloud-native infrastructure and security operations. Key Responsibilities Manage Kuber...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

BankTalent HQ • Midvale, UT, United States
Full-time
Zions Bancorporation's Enterprise Technology and Operations (ETO) team is transforming what it means to work for a financial institution. With a commitment to technology and innovation, we have been...Show more
Last updated: 30+ days ago • Promoted
Sr AWS Cloud DevOps Engineer

Sr AWS Cloud DevOps Engineer

Unisys Corporation • Salt Lake City, UT, United States
Full-time
What success looks like in this role : .DevSecOps Pipeline Design & Automation : .Design and implement secure, automated CI / CD pipelines in AWS using tools like AWS CodePipeline, Jenkins, GitLab CI, an...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

VirtualVocations • Salt Lake City, Utah, United States
Full-time
A company is looking for a Site Reliability Engineer (P3) - Cloud Infrastructure.Key Responsibilities Automate day-to-day operations and improve system reliability Design and implement highly av...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Unisys Corporation • Salt Lake City, UT, United States
Full-time
What success looks like in this role : .Design, implement, and manage scalable and reliable systems.Monitor system performance and troubleshoot issues. Collaborate with development teams to improve th...Show more
Last updated: 30+ days ago • Promoted
Cloud Engineer

Cloud Engineer

VirtualVocations • Salt Lake City, Utah, United States
Full-time
A company is looking for a Google Workspace and Cloud Engineer.Key Responsibilities Manage and secure Google Workspace and other cloud security platforms Implement and oversee cloud security mea...Show more
Last updated: 30+ days ago • Promoted
Cloud DevOps Engineer

Cloud DevOps Engineer

VirtualVocations • Provo, Utah, United States
Full-time
A company is looking for a Cloud DevOps Engineer to join their team in the US (Remote).Key Responsibilities Design, build, and manage cloud infrastructure solutions for scalable application deplo...Show more
Last updated: 30+ days ago • Promoted