Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how Meta's backbone change management system handles over 1 million network operations annually with minimal human intervention in this conference talk. Discover the architecture of a centralized scheduler that plans operations, automated orchestration systems that execute changes, and intent-aware scheduling that tailors controls to specific operation types. Explore the implementation of early warning systems for proactive risk mitigation and workflow-specific health checks designed to reduce noise and improve reliability. Understand the challenges posed by network growth, operational complexity, and risk management at scale, and examine practical solutions that ensure safe and efficient network operations in large-scale infrastructure environments. Gain insights from Meta's Tech Lead for WAN Management on scaling change management systems and addressing operational challenges in software-defined networking environments.
Syllabus
Backbone Change Management - Scaling @ Million+ Operations each Year
Taught by
NANOG