Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to achieve zero data loss in large-scale distributed systems through this conference talk from Conf42 SRE 2025. Explore the fundamentals of microservice architecture and asynchronous systems before diving into the specific challenges that distributed systems present for data integrity. Discover the critical impact these challenges have on Site Reliability Engineering practices and understand comprehensive strategies for preventing data loss at scale. Master advanced recovery techniques and learn to design robust distributed acknowledgement protocols that ensure message delivery reliability. Examine standardization approaches for message formats that enhance system interoperability and reduce failure points. Analyze performance optimization strategies while understanding the crucial trade-offs between consistency and performance in distributed environments. Apply these concepts through a detailed case study that demonstrates real-world implementation of zero data loss architectures, concluding with key takeaways for implementing these practices in your own systems.
Syllabus
00:00 Introduction to Microservice Architecture
00:25 Understanding Asynchronous Systems
01:54 Challenges in Distributed Systems
02:43 Impact on Site Reliability Engineering SRE
03:45 Preventing Data Loss
04:14 Advanced Recovery Techniques
04:56 Distributed Acknowledgement Protocol Design
06:06 Standardizing Message Formats
06:56 Performance Optimization Strategies
08:58 Consistency vs. Performance Trade-offs
10:18 Case Study and Key Takeaways
11:13 Conclusion
Taught by
Conf42