Job Description
Job Title : AIOps Enterprise Architect
Skills : Monitoring, AIOps & Observability, scripting and automation (Python, Bash, Terraform, Ansible),
Experience : 12-18 Years
Location : Windsor, CT
Job Type : Fulltime
We at Coforge are hiring for AIOps Enterprise Architect with the following skills :
- Responsible for designing and governing the enterprise-wide observability and intelligent operations architecture.
- This role ensures real-time visibility, predictive analytics, and autonomous remediation capabilities across hybrid and cloud-native environments, enabling proactive IT operations and business resilience.
- Define and maintain the enterprise observability and AIOps reference architecture.
- Develop and enforce architecture principles, standards, and best practices for monitoring, logging, tracing, and event management.
- Align observability strategies with enterprise IT and business goals, ensuring scalability, security, and compliance.
- Architect and integrate unified observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, ELK, Splunk, Dynatrace, Datadog).
- Design telemetry pipelines for metrics, logs, traces, and events across distributed systems.
- Implement AIOps capabilities such as anomaly detection, event correlation, root cause analysis, and predictive alerting.
- Closely work with DevOps, SRE, Cloud, Security, and Application teams to embed observability into the SDLC and CI / CD pipelines.
- Lead cross-functional teams in the selection, integration, and optimization of observability and AIOps tools.
- Provide architectural guidance to solution architects and engineering teams.
- Drive automation of incident response and operational workflows using AI / ML and rule-based systems.
- Define and monitor SLOs, SLIs, and KPIs to ensure system reliability and performance.
- Support real-time analytics and business-impact insights through enriched telemetry data.
- 15+ years of experience in IT architecture, infrastructure, or software engineering.
- 5+ years of experience in observability, monitoring, or AIOps domains.
- Deep expertise in observability tools (e.g., Splunk, Prometheus, Grafana, Dynatrace, ELK, OpenTelemetry).
- Strong understanding of cloud platforms (AWS, Azure, GCP) and hybrid cloud architectures.
- Proficiency in scripting and automation (Python, Bash, Terraform, Ansible).
- Experience with CI / CD pipelines and DevOps / DevSecOps practices.
- Familiarity with ITSM tools (e.g., ServiceNow) and CMDB integration. ]
- Experience in Finance industries is a plus.