AI Engineer - Learn how to integrate AI into software applications
AI Adoption - Drive Business Value and Organizational Impact
Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn how to combat alert fatigue in Site Reliability Engineering through strategic implementation of Service Level Objectives (SLOs) in this 33-minute conference talk from Conf42 SRE 2025. Discover the root causes and warning signs of alert fatigue that plague engineering teams, including decreased response times, alert dismissal patterns, and the cascading effects on system reliability. Explore practical strategies for improving alert volume and health by establishing meaningful thresholds, reducing noise, and creating actionable notifications that truly matter. Master the implementation of SLOs as a powerful framework to prioritize alerts based on actual user impact rather than arbitrary system metrics. Gain insights into building sustainable alerting practices that enhance rather than hinder your team's ability to maintain system reliability, with concrete examples and actionable recommendations for transforming your monitoring strategy from reactive noise to proactive intelligence.
Syllabus
00:00 Introduction and Speaker Information
00:49 Understanding Alert Fatigue
04:23 Signs and Consequences of Alert Fatigue
07:38 Improving Alert Volume and Health
21:06 Implementing SLOs to Combat Alert Fatigue
28:29 Conclusion and Resources
Taught by
Conf42