Give the Gift That Unlocks Potential
Google AI Professional Certificate - Learn AI Skills That Get You Hired
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to proactively prepare for system failures and unexpected incidents through chaos engineering principles and game day exercises in this 41-minute conference talk. Discover how to implement structured approaches to testing system resilience by deliberately introducing controlled failures and disruptions. Explore the methodology behind chaos engineering, including how to design effective experiments that reveal weaknesses in distributed systems before they cause real outages. Understand the concept of game days as collaborative exercises where teams practice incident response procedures in a safe, controlled environment. Gain insights into building organizational culture around failure preparation, including how to create realistic scenarios that test both technical systems and human processes. Master techniques for measuring system reliability, identifying single points of failure, and improving overall system robustness through intentional stress testing. Examine real-world case studies and practical examples of how chaos engineering has helped organizations reduce mean time to recovery and increase confidence in their infrastructure's ability to handle unexpected events.
Syllabus
Plan For Unplanned Work: Game Days With Chaos Engineering by Daniel Afonso
Taught by
All Things Open