Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the critical role of retry mechanisms in building resilient distributed systems through this 26-minute conference talk from LeadDev Berlin 2025. Challenge the common misconception that retrying failed operations is futile by learning how strategic retries actually enhance system reliability and fault tolerance. Discover the essential components of effective retry strategies, including proper timeout configuration, retry limits, and knowing when to abandon attempts. Master techniques for implementing safe retry mechanisms that prevent cascading failures and avoid costly resource drain. Examine real-world scenarios where retry logic proves invaluable for handling transient failures in distributed architectures. Gain insights into balancing persistence with pragmatism when designing fault-tolerant systems that can gracefully handle temporary network issues, service unavailability, and other common distributed system challenges.
Syllabus
Try, try again | Sam Newman | LeadDev Berlin 2025
Taught by
LeadDev