Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Engineering Failure-Resilient Systems

Conf42 via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to build failure-resilient systems through this 16-minute conference talk that explores the critical importance of engineering systems that can withstand and recover from failures. Discover why failure is an inevitable part of engineering and understand the significant costs and impacts of system downtime on businesses and users. Explore comprehensive strategies for building resilient systems, including the fundamental pillars of resilience engineering that form the foundation of robust system design. Master chaos engineering principles and antifragility concepts that help systems become stronger through stress and failure. Examine circuit breaker design patterns and dynamic resource allocation techniques that prevent cascading failures and maintain system stability. Understand the crucial role of monitoring and observability in detecting, diagnosing, and responding to system issues before they become critical failures. Analyze common failure patterns that plague distributed systems and learn how to identify and mitigate these recurring problems. Gain insights into achieving high reliability through proven methodologies and best practices used by leading technology organizations. Access a comprehensive resilience engineering toolkit with practical tools, frameworks, and techniques you can immediately apply to your own systems and infrastructure.

Syllabus

00:00 Introduction and Personal Confession
00:09 The Importance of Failure in Engineering
01:06 Overview of the Talk
01:55 The Reality of System Failures
04:23 Cost and Impact of Downtime
05:13 Strategies for Building Resilient Systems
05:51 Pillars of Resilience Engineering
06:51 Chaos Engineering and Antifragility
08:14 Circuit Breaker Design and Dynamic Resource Allocation
09:37 Monitoring and Observability
10:21 Common Failure Patterns
13:00 Achieving High Reliability
13:59 Resilience Engineering Toolkit
15:08 Conclusion and Q&A

Taught by

Conf42

Reviews

Start your review of Engineering Failure-Resilient Systems

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.