AI Engineer - Learn how to integrate AI into software applications
Google AI Professional Certificate - Learn AI Skills That Get You Hired
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a conference talk from SREcon23 Europe/Middle East/Africa that delves into implementing Site Reliability Engineering (SRE) practices in a telecommunications company using Reliability Enhancing Procedures (REPs). Learn how Swisscom, following severe outages in 2021, transitioned from a traditional ITIL Operation model to SRE practices. Discover the innovative approach of creating cookbook-style work instructions to simplify and scale SRE adoption across the organization. Gain insights into the development and implementation of nine REPs designed to improve service reliability across hundreds of services. Understand how this initiative goes beyond a reliability improvement program, representing a significant transformation in service reliability management using SRE methodologies.
Syllabus
SREcon23 Europe/Middle East/Africa - Implementing SRE in a Telco with Reliability Enhancing...
Taught by
USENIX