US CITIZENSHIP REQUIREDRead all the information about this opportunity carefully, then use the application button below to send your CV and application.
The
- Cloud Operations Engineer (L2)
- is responsible for advanced troubleshooting, system administration, and application environment support across Knox’s
- cloud infrastructure
- . This role bridges operations, automation, and development support — maintaining system stability, executing changes, and ensuring compliance within
- FedRAMP Moderate, High, and DoD IL5
- environments.
The ideal candidate brings strong
- Linux, cloud, and automation experience
- , with an understanding of application architecture and
- low-code / no-code platform operations
- Key ResponsibilitiesIncident Management & System Troubleshooting
- Perform advanced troubleshooting for infrastructure, OS, and application issues.
- Analyze system logs, metrics, and telemetry from monitoring platforms (Grafana, Datadog, Wiz, CloudWatch).
- Coordinate with Platform / DevOps Engineers on root cause analysis and long-term remediation.
- Ensure timely resolution of escalated incidents in accordance with SLAs.
- Cloud Administration & Maintenance
- Manage and maintain
- AWS, Azure, and hybrid environments
- following configuration baseline controls (CM-2, CM-6).
- Execute system patching, upgrades, and configuration changes via automation or scripts.
- Perform health checks, deployment validations, and post-change verifications.
- Maintain infrastructure documentation and system configuration inventories.
- Application Support & Deployment Assistance
- Support
- low-code / no-code and custom applications
- during deployment and maintenance windows.
- Troubleshoot app-layer issues such as API failures, integration errors, or misconfigurations.
- Work with DevOps / Platform teams to optimize
- CI / CD deployment workflows
- and rollback plans.
- Ensure adherence to
- change management and deployment authorization
- processes.
- Automation & Scripting
- Create or modify automation scripts (Bash, Python, PowerShell) for maintenance and reporting tasks.
- Leverage
- Terraform, Ansible, or cloud-native tools
- for provisioning and environment consistency.
- Proactively identify opportunities to automate recurring operational processes.
- Document system changes and incident response details for FedRAMP audits.
- Qualifications
- 3–5 years of experience in
- cloud operations, system administration, or infrastructure support
- Proficiency in
- Linux administration
- and
- command-line troubleshooting
- Strong working knowledge of
- AWS and / or Azure infrastructure services
- Familiarity with
- CI / CD pipelines
- and deployment automation tools.
- Understanding of
- low-code / no-code platforms
- (Power Platform, ServiceNow, Salesforce) and related integration troubleshooting.
- Experience writing and maintaining scripts (Bash, Python, PowerShell)
- Familiarity with
- FedRAMP, NIST 800-53
- , or similar compliance environments.
- US Citizenship required
- Preferred Certifications :
- AWS SysOps Administrator, Microsoft Azure Administrator, Terraform Associate, CompTIA Security+, ITIL v4.
- Success Indicators
- Improved uptime and service reliability across assigned systems.
- Faster incident resolution and RCA completion times.
- Demonstrated automation improvements in recurring operations.
- Positive collaboration feedback from DevOps and Security teams.
- Support
- Continuous Monitoring (ConMon)
- activities through vulnerability reporting and patch compliance tracking.
- Assist in maintaining logs, baselines, and access control evidence.
Job Types : Full-time, Contract
Pay : $105,000.00 - $135,000.00 per year
Work Location : Remote