Building Reliable Operations - Incident Management and MTTR Reduction with Fannie Mae Case Study
AWS Events via YouTube
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore proven practices for building reliable operations and reducing Mean Time to Resolution (MTTR) through effective incident management in this 54-minute conference talk from AWS re:Invent 2025. Discover how to manage incidents effectively across complex infrastructure by learning from Fannie Mae's transformation of their operations through building a cross-region observability platform on AWS. Understand how they automated incident response and improved reliability across their hybrid environment. Gain practical strategies for implementing automated incident management, establishing effective on-call processes, and leveraging AWS services to enhance operational reliability within your organization. Learn specific techniques for managing incidents across complex infrastructure, building observability platforms that span multiple regions, and creating automated response systems that reduce downtime and improve overall system reliability.
Syllabus
AWS re:Invent 2025 - Building reliable operations, feat. Fannie Mae (COP340)
Taught by
AWS Events