Job Summary
The DevOps Engineer is responsible for ensuring high availability, performance, monitoring, and incident response for OVHcloud Baremetal products and services.
This position drives the reliability, development, configuration, and deployment of our current and future products and services.
This includes investigating and debugging errors and submitting fixes that contribute to software development to improve our services.
Essential Duties & Responsibilities
- Maintain essential OVHcloud infrastructures, products, and services.
- Diagnose errors with a data-driven approach, analyzing categorization and resolving issues effectively and efficiently.
- Author knowledge base articles and instructional guides as needed.
- Automate tasks by developing scripts and tooling.
- Participate in building, deploying, and / or troubleshooting of microservices software applications and other underlying APIs.
- Responsible for monitoring the alerting systems and submitting configuration changes on a regular basis to ensure availability of systems and services.
- Install, deploy, and configure OVHcloud infrastructure as new capabilities are developed.
- Analyze data and develop meaningful automated reports to be used by technical and business leaders.
- Write well-documented root cause analyses when critical issues occur.
- User Acceptance Testing (UAT) for new product launches.
- Take part in on-call rotations, including weekend coverage.
Minimum Requirements
- 2+ years of relevant experience is required, including DevOps / programming, and administration of Linux / Unix / Windows operating systems.
- Experience performing day-to-day operational (DevOps) tasks and working with microservices and multiple APIs.
- Experience with languages such as Python, Bash, Perl, Go, etc.
- Experience with maintenance / configuration of monitoring, metrics, and logging infrastructures like Nagios, Grafana, Graylog.
- Experience with virtualization and container technology.
- Experience with open-source configuration management tools such as Puppet, Ansible, etc. preferred.
- Experience managing a distributed, highly available, high-traffic infrastructure based on Linux is preferred.
- Well-versed in cloud technologies and terminology.
- Ability to efficiently prioritize, organize, and complete tasks throughout the workday and adjust when new priorities arise.
- Bachelor’s degree in computer science or a related field, or equivalent and relevant experience preferred.
Working Conditions
Standard office environment
30+ days ago