Digital Site Reliability Engineer

Ampcus Incorporated
Charleston, SC, United States
Full-time
Temporary

Title : Digital Site Reliability Engineer

Location : Remote

Duration : 12 Months contract

Role and Responsibilities

Reporting to the Head of DevOps Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions business.

In this role, the candidate will have the opportunity to make a lasting impact on the company's digital transformation journey, drive customer-centric innovation and automation, and position the organization as a leader in the competitive digital banking landscape.

Specifically, the Site Reliability Engineer will be responsible for the following :

Ensure the reliability, availability, and performance of applications and services, focusing on minimizing downtime, optimizing response times, and maintaining high availability for users.

Lead incident response efforts for incidents, including identification, triage, resolution, and post-incident analysis to prevent recurrence and improve system resilience.

Develop and maintain monitoring solutions and alerting mechanisms for infrastructure, application performance, and user experience metrics, enabling proactive issue detection and mitigation.

Implement automation tools and processes to automate routine tasks, scale infrastructure, and ensure seamless deployments, updates, and rollbacks with minimal user impact.

Conduct capacity planning, performance tuning, and resource optimization for environments, collaborating with development and operations teams to meet scalability and performance goals.

Collaborate with security teams to implement security best practices, perform vulnerability assessments, and ensure compliance with security standards and regulatory requirements for applications.

Manage deployment pipelines, release processes, and configuration management for app deployments, ensuring consistency, reliability, and version control across environments.

Identify areas for improvement in reliability, performance, and efficiency through data analysis, root cause analysis, and trend analysis, and drive initiatives to enhance system reliability and operational efficiency.

Create and maintain documentation, runbooks, and knowledge base articles for operational procedures, troubleshooting guides, and best practices, and promote knowledge sharing within the team.

Develop and test disaster recovery plans, backup strategies, and failover mechanisms for app services, ensuring business continuity and data integrity in case of failures or disasters.

Collaborate with development, QA, DevOps, and product teams to ensure alignment on reliability goals, performance metrics, release schedules, and incident response processes.

Participate in on-call rotations and provide 24 / 7 support for critical incidents, troubleshoot issues, and coordinate with teams for resolution, escalation, and follow-up actions as per defined SLAs.

Professional Qualifications

Proficient in development technologies, architectures, and platforms (web, api) to understand system complexities and performance considerations.

Experience in cloud platforms (e.g., AWS, Azure, Google Cloud) and infrastructure as code (IaC) tools for managing app infrastructure and deployments.

Knowledge of monitoring tools (e.g., Prometheus, Grafana, New Relic) and logging frameworks (e.g., ELK Stack) for real-time visibility into system health, performance metrics, and user experience.

Experience in incident management, including incident response, triage, root cause analysis (RCA), and post-mortem reviews to prevent recurring issues.

Strong troubleshooting skills to diagnose complex technical issues in app environments, infrastructure, networking, and performance bottlenecks.

Proficiency in scripting languages (e.g., Python, Bash) and automation tools (e.g., Ansible, Terraform) for automating routine tasks, deployments, and infrastructure management.

Experience in implementing continuous integration / continuous deployment (CI / CD) pipelines for apps using tools like Jenkins, GitLab CI / CD, or Azure DevOps.

Expertise in setting up monitoring solutions, configuring alerts, and creating dashboards to monitor system performance, application metrics, and user experience.

Familiarity with APM (Application Performance Monitoring) tools to analyze app performance, identify bottlenecks, and optimize resource utilization.

Commitment to continuous learning, staying updated with industry trends, new technologies, and best practices in app reliability, performance, and operations.

Adaptability to evolving requirements, technologies, and business needs, with a focus on driving continuous improvement and operational excellence.

6 days ago
Related jobs
Promoted
Ampcus Incorporated
Charleston, South Carolina

Title: Digital Site Reliability Engineer. Reporting to the Head of DevOps Engineering, the Site Reliability Engineer will play a critical role in driving innovation and growth for the Banking Solutions business. Specifically, the Site Reliability Engineer will be responsible for the following:. In t...

YourCause
Charleston, South Carolina

We are looking for a Site Reliability Engineer to join our JustGiving Platform team who support the stability and availability of our website. We focus on preserving customer trust in our platform, by obsessing over the stability, security, and speed of our site and by continuously improving the cap...

Celanese Corporation
Charleston, South Carolina

Celanese is seeking an experienced Infrastructure Operations and Reliability Engineer for our Hytrel® operations at the Cooper River Site located near Monck Corner, South Carolina. This position will provide engineering support to the site infrastructure and utilities to improve operating efficienci...

Splunk Inc
South Carolina, United States
Remote

We are looking for a Site Reliability Engineer to help maintain, contribute to and improve the next generation of our large scale Cloud offering. Learn more about Splunk careers and how you can become a part of our journey! Join us on the Splunk TechOps Team, empowering our customers to execute our ...

Promoted
PMI (Project Management Institute)
Charleston, South Carolina

JobPosting","title":"Data Engineer II","datePosted":"2024-04-15T00:00:00","validThrough":null,"description":"Data Engineer II (Multiple Openings), Project Management Institute, Inc. Data Engineer II (Multiple Openings), Project Management Institute, Inc. The position requires a minimum of a Bachelor...

Promoted
Ampcus Incorporated
Charleston, South Carolina

Reporting to the Head of Digital DevOps, the DevOps Engineer will play a critical role in driving innovation and growth for the Banking Solutions business. Title: Digital DevOps Engineer. With a focus on digital software pipelines and deployments, you will collaborate closely with product leaders, a...

Promoted
The Bolton Group
SC, United States

National Homebuilder is looking for an Engineering Project Manager to join one of their top divisions – the South Carolina Region. D design experience will be needed for this role as well as project management experience. ...

Promoted
Cognito Forms
SC, United States

We are looking for an aspiring engineer to join our agile system team to enhance the infrastructure and toolchain used to deliver Cognito Forms while learning how to excel in a professional software development environment. Experience with distributed version control (git/github), continuous integra...

Promoted
HRUCKUS
Hanahan, South Carolina

Veteran Firm Seeking a Senior DevOps Engineer for. We seek a highly skilled and experienced Senior DevOps Engineer to join our 8a client’s team. HRUCKUS is seeking a Senior DevOps Engineer for a Remote Assignment. The Senior DevOps Engineer will support the design and creation of technical solutions...

Promoted
Cambridge International Systems, Inc.
Charleston, South Carolina

Work with Network Engineering team to develop network topologies. Applies professional knowledge of scientific and engineering concepts to: assess specific requirements, delineate appropriately engineered designs, and develop preliminary and final design plans, engineering specifications, cost estim...