Effective Disaster Recovery - The Day We Deleted Production
CNCF [Cloud Native Computing Foundation] via YouTube
Get 20% off all career paths from fullstack to AI
AI, Data Science & Cloud Certificates from Google, IBM & Meta
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a real-world disaster recovery scenario in this 37-minute conference talk from KubeCon + CloudNativeCon. Learn how InfluxData accidentally deleted all compute from a busy production cluster, causing a multi-hour outage. Discover the events leading up to the incident, the recovery process, customer reactions, and implemented changes. Gain insights into CI/CD pipeline configurations and the specific change that triggered the outage. Examine the effectiveness of their disaster recovery plan, identifying successful elements and areas for improvement. Benefit from a blend of technical and management perspectives on handling critical infrastructure failures and implementing robust disaster recovery strategies.
Syllabus
Effective Disaster Recovery: The Day We Deleted Production - Rick Spencer & Wojciech Kocjan
Taught by
CNCF [Cloud Native Computing Foundation]