Building Reliable Operations - Incident Management and MTTR Reduction with Fannie Mae Case Study
AWS Events via YouTube
JavaScript Programming for Beginners
AI Engineer - Learn how to integrate AI into software applications
Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore proven practices for building reliable operations and reducing Mean Time to Resolution (MTTR) through effective incident management in this 54-minute conference talk from AWS re:Invent 2025. Discover how to manage incidents effectively across complex infrastructure by learning from Fannie Mae's transformation of their operations through building a cross-region observability platform on AWS. Understand how they automated incident response and improved reliability across their hybrid environment. Gain practical strategies for implementing automated incident management, establishing effective on-call processes, and leveraging AWS services to enhance operational reliability within your organization. Learn specific techniques for managing incidents across complex infrastructure, building observability platforms that span multiple regions, and creating automated response systems that reduce downtime and improve overall system reliability.
Syllabus
AWS re:Invent 2025 - Building reliable operations, feat. Fannie Mae (COP340)
Taught by
AWS Events