Google Data Analytics, IBM AI & Meta Marketing — All in One Subscription
Pass the PMP® Exam on Your First Try — Expert-Led Training
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore how Jupyter can be utilized as an effective incident response tool in this 20-minute conference talk from SREcon20 Americas. Learn to leverage Jupyter's dynamic exploration capabilities and result-sharing features for Site Reliability Engineering. Follow along as the speaker demonstrates triaging and remediating a simulated cache slowdown incident affecting site performance. Discover post-incident best practices for proper documentation and preparation for incident retrospectives. Gain insights into Jupyter notebooks, kernels, data analysis techniques, and practical applications of tools like Boto3 for efficient problem-solving in SRE contexts.
Syllabus
Intro
Acknowledgement of Country
Jupyter Notebooks
Jupyter Kernels
Data Science Origins
Imaginary Stack
Symptom
Spoiler
Query Data
Analyze Data
Boto3
We Know The Culprit
Connect to Host
Confirm the Problem
Fix the Problem
Post-Incident
Final Thoughts
Taught by
USENIX