Google AI Professional Certificate - Learn AI Skills That Get You Hired
Live Online Classes in Design, Coding & AI — Small Classes, Free Retakes
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore the basics of site reliability engineering for DevOps. Learn SRE techniques for release, change and incident management, self-service automation, and more.
Syllabus
Introduction
- Reliability engineering basics
- What you should know
- Your job as a DevOp
- You aren't Google or Netflix
- Release engineering
- Change management
- Self-service automation
- SLAs and SLOs
- Incident management
- Introducing postmortems
- The postmortem process
- Troubleshooting
- Performance engineering
- Capacity and scalability
- Distributed design
- Deliberate adversity
- Organizing SREs
- The softer side of SRE
- Next steps
Taught by
James Wickett and Ernest Mueller