Monitoring Lead (Observability Tools)Purple Drive • Austin, TX, United States

Monitoring Lead (Observability Tools)

Purple Drive • Austin, TX, United States

1 day ago

Job type

Full-time

Job description

Role Summary

We are seeking an experienced Monitoring Lead with strong expertise in observability tools such as Datadog, Grafana, and Kibana. The candidate will lead the design, implementation, and management of monitoring and observability solutions to ensure system reliability, performance, and availability across enterprise applications and infrastructure.

Key Responsibilities

Lead the observability and monitoring strategy across infrastructure, applications, and services.
Configure, maintain, and optimize monitoring tools ( Datadog, Grafana, Kibana ) for real-time visibility.
Develop dashboards, alerts, and metrics to support proactive incident detection and resolution.
Collaborate with DevOps, Cloud, and Application teams to define SLA / SLO / SLI metrics and ensure service reliability.
Implement log aggregation, tracing, and metrics collection to improve end-to-end observability.
Troubleshoot performance bottlenecks and identify root causes using monitoring insights.
Provide leadership, guidance, and training to teams on best practices for observability and monitoring.
Stay updated on emerging monitoring technologies and recommend adoption where relevant.

Required Skills & Experience

8-12 years of experience in IT operations / DevOps with at least 3+ years in monitoring & observability leadership roles.

Strong hands-on experience with Datadog, Grafana, and Kibana .

Knowledge of observability practices (metrics, logs, traces).

Experience with cloud platforms (AWS, Azure, GCP) and containerized environments (Kubernetes, Docker).

Proficiency in scripting (Python, Shell, PowerShell, etc.) for automation of monitoring tasks.

Excellent troubleshooting, analytical, and communication skills.

Good to Have

Experience with Prometheus, Elastic Stack, Splunk, or New Relic .

Familiarity with Site Reliability Engineering (SRE) practices .

Exposure to infrastructure-as-code tools (Terraform, Ansible) .

Create a job alert for this search

Monitoring Lead • Austin, TX, United States

Related jobs

Product Safety Engineer

Apple • Austin, TX, United States

Full-time

At Apple, phenomenal ideas have a way of becoming exceptional products, services, and customer experiences very quickly.Bring passion and dedication to your job and there's no telling what you coul...Show more

Last updated: 1 day ago • Promoted

Senior Product Safety Engineer, Product Compliance

Emerson Group • Austin, TX, United States

Full-time

Senior Product Safety Engineer will evaluate the hardware designs in development for compliance to industry safety standards, perform testing, and document findings in reports for declaration and 3...Show more

Last updated: 1 day ago • Promoted

Senior Product Safety Engineer, Product Compliance

Emerson • Austin, TX, United States

Full-time

Last updated: 1 day ago • Promoted

Automation Test Lead

Omni Inclusive • Austin, TX, United States

Full-time

Automation Test Lead - System and Software Testing for Medical Device (Electrosurgical).ENT, orthopedics, surgical devices). Familiarity with Selenium (for UI testing), TestNG, Postman (API testing)...Show more

Last updated: 1 day ago • Promoted

SAP Test Automation Manager - Austin, TX (Hybrid)(Backfill Role)

Georgia IT Inc • Austin, TX, United States

Temporary

SAP Test Automation Manager (Backfill Role).Location - Austin, TX (Hybrid).Must have 6 years of experience of leading Testing teams. Must have testing experience with SAP, Salesforce, or other simil...Show more

Last updated: 30+ days ago • Promoted

Team Lead - Architectural Drafting & Compliance Technician

Archistar • Austin, TX, US

Full-time

Quick Apply

Lead the Future of Architecture with Archistar!.Team Leader – Architectural Draftsperson & Compliance.Our multi-award winning team is tackling the global housing crisis head-on by streamlining ...Show more

Last updated: 30+ days ago

Protection & Control Engineering Intern

Wind Energy Transmission Texas • Austin, TX, US

Full-time

Wind Energy Transmission Texas (WETT) is a leading provider of electricity transmission services, dedicated to delivering reliable and sustainable energy solutions to our communities.With a commitm...Show more

Last updated: 3 days ago • Promoted

Senior Test Analyst

PTP • Austin, TX, US

Full-time

PTP is a fast-growing system integrator that offers strategic Customer Experience (CX) solutions to our clients.CX solutions that provide our clients with a beautiful customer journey that achieves...Show more

Last updated: 30+ days ago • Promoted

Senior Security Architect

TradeJobsWorkForce • 78749 Austin, TX, US

Full-time

Senior Security Architect Job Duties : Enhances security team accomplishments and competence by planning deliver...Show more

Last updated: 30+ days ago • Promoted

Sr. IC Packaging Test Engineer, Silicon Technology (Starlink)

SpaceX • Bastrop, TX, United States

Permanent

IC Packaging Test Engineer, Silicon Technology (Starlink).SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where w...Show more

Last updated: 30+ days ago • Promoted

Protective Design Engineer

Control Risks • Austin, TX, US

Remote

Full-time

Quick Apply

The Protective Design Engineer role will be experienced in protective design, physical security, and / or civil construction to support the Client's Global Security Systems & Technology program.T...Show more

Last updated: 30+ days ago

PCI Quality Security Assessor (QSA)

Chad Management Group • Austin, TX, US

Full-time

We are in search of skilled and experienced Qualified Security Assessors (QSA) to join our team and contribute to our growth. If you are an Information Security Consultant who thrives in dynamic env...Show more

Last updated: 30+ days ago • Promoted

Senior Branch Safety Engineer

Performance Contracting • Austin, TX, United States

Full-time

Performance Contracting Group is a national employee-owned specialty contractor that offers quality services and products to the commercial, industrial, and non-residential construction markets.We ...Show more

Last updated: 1 day ago • Promoted

QA / Test Manager 2

Abacus Service Corporation • Austin, TX, US

Full-time

Solicitation Reference Number 529500994 Customer Name Texas Health And Human Services Commission Category Quality Assurance (QA) And Testing Customer Entity Name Health And Human Services Commissio...Show more

Last updated: 30+ days ago • Promoted

Sr. Production Test Engineering, Section Manager

Raytheon • Bee Cave, Texas, United States of America

Full-time

MA114 : Andover MA 354 Lowell Suffolk 354 Lowell Street Suffolk, Andover, MA, 01810 USA.Person, or Immigration Status Requirements : . At Raytheon, the foundation of everything we do is rooted in our v...Show more

Last updated: 22 days ago • Promoted

Interface Manager (Building Inspection)

DHD Consulting • Taylor, Texas, United States

Full-time

Quick Apply

We are seeking an experienced Building Inspection Consultant with over 10 years of expertise in building inspection-related fields. The ideal candidate will have a strong background in interpreting ...Show more

Last updated: 30+ days ago

NERC Compliance Analyst (O&P)

The Sustainable Partnership • Austin, Texas Metropolitan Area, United States

Full-time

NERC Compliance Analyst (Operations & Planning).TSP is partnered with a leading global renewable energy company with a diverse portfolio of large-scale wind (onshore and offshore), solar and batter...Show more

Last updated: 1 day ago • Promoted

Senior Safety Engineer

Samsung Electronics Per • Austin, TX, United States

Full-time

About Samsung Austin Semiconductor • • Samsung is a world leader in advanced semiconductor technology, founded on the belief that the pursuit of excellence creates a better world.At SAS, we are Innov...Show more

Last updated: 1 day ago • Promoted