SRE DevOps Engineer
We are seeking experienced DevOps Engineers with strong Site Reliability Engineering (SRE) capabilities who can work independently, think critically, and contribute immediately to our technical operations. This role requires professionals who can troubleshoot complex issues, write code, and collaborate effectively with development teams to solve problems proactively.
Required Technical Skills Minimum Competency Level : E2 (Medium) across ALL listed technologies
Core DevOps & Infrastructure
- Azure DevOps (E2 minimum) - Pipeline creation, management, and optimization
- CI / CD (E2 minimum) - End-to-end pipeline design and implementation
- AWS (E2 minimum) - EC2, S3, Lambda, RDS, CloudWatch, IAM
- Docker (E2 minimum) - Container creation, optimization, and troubleshooting
- Kubernetes (E2 minimum) - Cluster management, troubleshooting, and optimization
Development & Automation
Core Java (E2 minimum) - Code review, debugging, performance optimizationPython (E2 minimum) - Automation scripting, tool development, API integrationPowerShell (E2 minimum) - Windows automation and system managementAnsible (E2 minimum) - Configuration management and automationQuality & Security Tools
JFrog Artifactory (E2 minimum) - Artifact management and repository operationsSonarQube (E2 minimum) - Code quality analysis and remediationObservability & Monitoring
Application Performance Monitoring : Experience with AppDynamics, Grafana, ZabbixModern Observability Platforms : Datadog or Dynatrace experience STRONGLY PREFERREDLog Analysis : ELK Stack, Splunk, or equivalentInfrastructure Monitoring : Prometheus, CloudWatch, or similarEssential Professional Competencies : Problem-Solving & Critical Thinking
Root Cause Analysis : Ability to systematically identify and resolve complex technical issuesIncident Response : Experience with incident management, escalation procedures, and post-mortem analysisPerformance Optimization : Proactive identification and resolution of performance bottlenecksCapacity Planning : Understanding of resource utilization and scaling strategies Development CollaborationCode Review & Debugging : Ability to review application code and identify issuesApplication Troubleshooting : Work directly with developers to resolve application-level problemsPerformance Profiling : Use A PM tools to identify and resolve application performance issuesSecurity Implementation : Implement and maintain security best practices across the pipeline Self-Direction & InitiativeIndependent Problem Solving : Ability to research, analyze, and resolve issues with minimal guidanceProactive Communication : Regularly communicate status, blockers, and recommendationsContinuous Learning : Stay current with technology trends and best practicesDocumentation : Create and maintain clear technical documentationBehavioral Expectations Work Ethic & Resourcefulness
Self-Motivated : Takes ownership of tasks and sees them through to completionResourceful : Uses available resources (documentation, community, colleagues) to find solutionsProactive : Identifies potential issues before they become problemsQuality-Focused : Delivers work that meets high standards without extensive rework Communication & CollaborationClear Communication : Articulates technical concepts clearly to both technical and non-technical stakeholdersQuestion When Needed : Asks clarifying questions to ensure understandingKnowledge Sharing : Contributes to team knowledge through documentation and mentoringCultural Fit : Works well in a collaborative, fast-paced environment Experience RequirementsMinimum 3-5 years in DevOps / SRE rolesHands-on experience with modern cloud-native architecturesProven track record of incident resolution and system optimizationExperience working in Agile environments with rapid deployment cycles Success Criteria (90-day evaluation) 1. Technical Proficiency : Demonstrates competency across all required tools 2. Problem Resolution : Successfully resolves incidents and technical issues independently 3. Code Contributions : Makes meaningful contributions to automation and tooling 4. Team Integration : Effectively collaborates with existing team members 5. Process Improvement : Identifies and implements improvements to existing processes Preferred CertificationsAWS Certified Solutions Architect or DevOps EngineerMicrosoft Azure DevOps Engineer ExpertKubernetes certifications (CKA, CKAD)Datadog or Dynatrace certificationsSalary Range- $90,000-$100,000 a year
#LI-SP3 #LI-VX1