Observability Tools : Proficiency in monitoring, logging, and tracing tools, including Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, New Relic, and cloud-native solutions like AWS CloudWatch.
Programming Languages : Expertise in languages such as Python and Go for scripting and automation.
Infrastructure & Cloud Platforms : Experience with cloud platforms (AWS, GCP, Azure) and container orchestration systems like Kubernetes.
Infrastructure as Code (IaC) : Familiarity with Terraform and Ansible for managing infrastructure and configurations.
CI / CD & Automation : Experience with CI / CD pipelines and automation tools like Jenkins.
System & Software Engineering : A strong background in both system operations and software development.
Optimize cloud agent instrumentation, with cloud certifications being a plus.
Strong understanding of Observability concepts (Logs, Metrics, Tracing)
Expertise in security & vulnerability management in observability
Possesses 2 years of experience in cloud-based observability solutions, specializing in monitoring, logging, and tracing across AWS, Azure, and GCP environments.
Job Description :
Design & Implement Solutions : Build and maintain comprehensive observability platforms that provide deep insights into complex systems, incorporating logs, metrics, and traces.
System Instrumentation : Instrument applications, infrastructure, and services to collect telemetry data using frameworks like OpenTelemetry.
Data Analysis & Visualization : Develop dashboards, reports, and alerts using tools like Prometheus, Grafana, and Splunk to visualize system performance and detect issues.
Collaboration : Work with development, SRE, and DevOps teams to integrate observability best practices and align monitoring with business and operational goals.
Automation : Develop scripts and use Infrastructure as Code (IaC) tools like Ansible and Terraform to automate monitoring configurations and telemetry collection.
Implement and manage full-stack observability using Datadog, ensuring seamless monitoring across infrastructure, applications, and services.
Instrument agents for on-premise, cloud, and hybrid environments to enable comprehensive monitoring.
Design and deploy key service monitoring, including dashboards, monitor creation, SLA / SLO definitions, and anomaly detection with alert notifications.
Configure and integrate Datadog with third-party services such as ServiceNow, SSO enablement, and other ITSM tools.
Create a job alert for this search
Lead Engineer • Richmond, Virginia, United States
Related jobs
Promoted
New!
Lead Software Engineer, DevOps- Global Payment Network
Capital OneRichmond, Virginia, United States
Full-time +1
Lead Software Engineer, DevOps- Global Payment Network Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, in...Show moreLast updated: 1 hour ago
Promoted
Senior Software Engineer, DevOps
Capital OneRichmond, VA, US
Full-time +1
Senior Software Engineer, DevOps.Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive.At Capital One...Show moreLast updated: 30+ days ago
Promoted
New!
Lead Software Engineer (SRE / DevOps)
Capital OneRichmond, VA, US
Full-time +1
Lead Software Engineer (SRE / DevOps).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive.At Capital ...Show moreLast updated: 20 hours ago
Promoted
Lead Software Engineer, Full Stack
Capital OnePETERSBURG, Virginia, United States
Full-time +1
Lead Software Engineer, Full Stack.Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive, and iterati...Show moreLast updated: 30+ days ago
Promoted
Lead Software Engineer, DevOps
Capital OneRICHMOND, Virginia, United States
Full-time +1
Lead Software Engineer, DevOps.Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive.At Capital One, ...Show moreLast updated: 30+ days ago
Promoted
Lead DevOps Engineer
TrekrecruitRichmond, VA, US
Full-time
Observability Tools : Proficiency in monitoring, logging, and tracing tools, including Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, Datadog, New Relic, and cloud-native ...Show moreLast updated: 7 days ago
Promoted
Remote DevOps Engineer (Kafka / Kong)
Pennant Solutions GroupRichmond, VA, US
Remote
Full-time
Senior Kafka Administrator / Kong Administrator with DevOps Experience.Our client is seeking a skilled and experienced Senior Kafka / Kong Administrator to join our client's IT operations team.As a...Show moreLast updated: 30+ days ago
Promoted
New!
Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)
Capital OneEttrick, VA, US
Full-time +1
Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering) Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-pa...Show moreLast updated: 17 hours ago
Promoted
Sr AWS Cloud DevOps Engineer
Unisys CorporationRichmond, VA, United States
Full-time
What success looks like in this role : .DevSecOps Pipeline Design & Automation : .Design and implement secure, automated CI / CD pipelines in AWS using tools like AWS CodePipeline, Jenkins, GitLab CI, an...Show moreLast updated: 30+ days ago
Promoted
DevOps Engineer Lead
Tek SpikesRichmond, VA, US
Full-time
Job Title : DevOps Engineer - Lead.Job ID : 94330-1, 94329-1 & 94503-1.Location : 15075 Capital One Drive Richmond, VA 23238 (Hybrid).
Duration : 12+ Months with possible of extension.Observability ...Show moreLast updated: 16 days ago
Lead Platform Engineer (Global Payment Network - Palo Alto, Security, Python, AWS, Terraform, Ansible).Do you love building and pioneering in the technology space? Do you enjoy solving complex tech...Show moreLast updated: 7 days ago
Promoted
Senior Lead Software Engineer, DevOps- Global Payment Network
Capital OneRichmond, VA, US
Full-time +1
Senior Lead Software Engineer, DevOps- Global Payment Network.Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborat...Show moreLast updated: 14 days ago
Promoted
New!
Lead Software Engineer, DevOps (Global Payment Network)
Capital OneRichmond, VA, US
Full-time +1
Lead Software Engineer, DevOps (Global Payment Network).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, i...Show moreLast updated: 14 hours ago
Promoted
Lead Platform Engineer (Network Infrastructure)
Capital OnePetersburg, VA, US
Full-time +1
Lead Platform Engineer (Network Infrastructure).Do you love building and pioneering in the technology space? Do you enjoy solving complex technical problems in a fast-paced, collaborative, inclusiv...Show moreLast updated: 30+ days ago
Promoted
Sr.Devops Engineer
Resource Informatics Group IncRichmond, VA, US
Full-time
AWS admin Kafka DevOps at least 5 years experience".Show moreLast updated: 30+ days ago
Promoted
New!
Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)
Capital OneEttrick, VA, US
Full-time +1
Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering) Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a ...Show moreLast updated: 17 hours ago
Promoted
New!
Lead Software Engineer, DevOps (Cloud Operations Resilience Engineering)
Capital OneEttrick, VA, US
Full-time +1
Lead Software Engineer, DevOps (Cloud Operations Resilience Engineering) Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced,...Show moreLast updated: 15 hours ago
Captivation SoftwareAshland, Virginia, United States
Full-time
Build to something to be proud of.Captivation has built a reputation on providing customers exactly what is needed in a timely manner.
Our team of engineers take pride in what they develop and const...Show moreLast updated: 30+ days ago