Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Building Resilient Cloud-Native Platforms for Intermittent Networks

Conf42 via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn to build robust cloud-native platforms that can withstand network interruptions and maintain service continuity through this comprehensive conference talk from Conf42 Go 2025. Explore fundamental quality attributes in distributed systems, focusing on availability, resilience, and reliability as core design principles for modern cloud architectures. Discover how to define and measure availability metrics, establish meaningful Service Level Agreements (SLAs) and Service Level Objectives (SLOs), and implement risk mitigation strategies that protect against common failure scenarios. Master essential reliability patterns including store-and-forward mechanisms, intelligent backoff and retry strategies, circuit breakers for graceful degradation, bulkhead patterns for fault isolation, and fallback mechanisms for service continuity. Examine real-world applications through Chef's DevOps journey and analyze complex scenarios including microservice architectures, retail systems, and DevOps platforms to understand practical implementation challenges. Dive deep into Go-specific error handling techniques and language components that support resilient system design, while learning to conduct Failure Modes and Effects Analysis (FMEA) to proactively identify potential system weaknesses. Gain insights into monitoring and telemetry strategies that provide visibility into system health and performance, and understand testing methodologies that validate reliability assumptions under various network conditions.

Syllabus

00:00 Introduction to Quality Attributes in Distributed Systems
01:31 Understanding Availability and Resilience
04:23 Designing for Reliability and Risk Mitigation
05:52 Service Level Agreements and Objectives
06:16 Chef's DevOps Journey and Real-World Applications
11:01 Defining and Measuring Availability
16:54 Exploring Resilience and Reliability
23:40 Failure Modes and Effects Analysis FMEA
27:25 Infrastructure and Software Patterns for Reliability
31:41 Introduction to Error Handling in Go
32:13 Store and Forward Pattern
32:57 Backoff and Retry Strategies
33:06 Circuit Breakers and Asynchronous Services
33:49 Bulkhead and Fallback Patterns
34:16 Netflix's Histrix Module
35:40 Monitoring and Telemetry
37:00 Microservice Example
39:40 Complex Retail Scenario
41:52 DevOps System Example
50:17 Go Language Components
53:03 Testing and Reliability
55:38 Conclusion and Key Takeaways

Taught by

Conf42

Reviews

Start your review of Building Resilient Cloud-Native Platforms for Intermittent Networks

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.