Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn about the operational challenges and troubleshooting experiences of managing CERN's OpenStack private cloud through a conference talk that explores over 12 years of continuous 24/7 operations. Discover how operators at CERN handle complex upgrades to operating systems, virtualization layers, and OpenStack control planes while maintaining stability for users. Examine real-world case studies of challenging bugs encountered during upgrades that go beyond routine operational issues, including identification, troubleshooting, and resolution strategies. Gain insights into the occupational hazards of cloud operations and understand how experienced operators manage the most problematic issues that can disrupt large-scale private cloud environments. Explore the balance between implementing necessary security fixes, performance improvements, and new features while mitigating the risks of upgrade-related complications in mission-critical infrastructure.
Syllabus
Operator nightmares in the trenches of CERN’s OpenStack cloud
Taught by
OpenInfra Foundation