Start speaking a new language. It’s just 3 weeks away.
Gain a Splash of New Skills - Coursera+ Annual Just ₹7,999
Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore the critical but often misunderstood aspects of error handling in high-scale distributed systems through this 22-minute conference talk from DevOpsDays Tel Aviv. Discover why short timeouts are crucial for system resilience yet dangerously prone to misuse, and learn how poorly implemented retry mechanisms can trigger catastrophic system-wide failures. Examine real-world scenarios through a detailed postmortem-style analysis of production incidents based on actual system problems. Master actionable strategies for designing intelligent retry logic, preventing service overload, and avoiding cascading failures that can bring down entire infrastructures. Gain practical wisdom, understand common pitfalls, and acquire essential system design insights that will fundamentally change how you approach implementing timeout and retry mechanisms in your applications.
Syllabus
Dancing with Failure - The Art of Timeouts & Retries, Alon Nativ
Taught by
DevOpsDays Tel Aviv