Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore how artificial intelligence is transforming Site Reliability Engineering (SRE) practices and customer experience in this 16-minute conference talk from Conf42 DevOps 2026. Discover the evolving challenges facing modern production systems and learn how AI technologies are revolutionizing reliability engineering approaches. Examine the current reality of production environments and understand the critical need for improved visibility and system understanding. Delve into AI's expanding role in enhancing system awareness, enabling continuous learning capabilities, and maintaining contextual knowledge across complex infrastructures. Gain insights into how machine learning and AI-driven tools are reshaping traditional SRE methodologies, from incident response and root cause analysis to predictive maintenance and automated remediation. Understand the implications of AI integration for customer experience optimization and service reliability improvements. Consider the future trajectory of SRE practices as they become increasingly AI-augmented, including the potential for autonomous system management and intelligent decision-making processes. Learn about practical applications of AI in monitoring, alerting, and system optimization that are already transforming how reliability engineers approach their work and deliver value to end users.
Syllabus
Introduction and Overview
The Changing Landscape of SRE
Challenges in Modern Production Systems
Impact of AI on Reliability
The Reality of Production Environments
Efforts to Improve Visibility and Understanding
AI's Role in System Awareness
Continuous Learning and Context Retention
The Future of SRE and Reliability
Conclusion and Final Thoughts
Taught by
Conf42