Courses from 1000+ universities
$7.2 billion in combined revenue since 2020. $8 billion in lost market value. This merger marks the end of an era in online education.
600 Free Google Certifications
Computer Science
Information Technology
Data Analysis
The Science of Gastronomy
Transforming Digital Learning: Learning Design Meets Service Design
Intelligenza Artificiale
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore the human and technical aspects of SRE on-call duty, from survey findings to alternative models, with insights on improving the experience and questioning current practices.
Explore how a financial services provider developed an automated chaos engineering program for entire datacenter testing, overcoming challenges of fear and uncertainty while building robust infrastructure.
Discover best practices for collaborating with visually impaired incident analysts, including screen reader demos and strategies to overcome dashboard visibility challenges while leveraging diverse perspectives.
Discover how to tackle OpenSearch performance issues using scientific methods, monitoring, modeling, and experimentation to reduce latency and costs while creating a framework for continuous improvement.
Discover the challenges and strategies behind Squarespace's unprecedented migration of over 10 million domains from Google Domains, exploring the technical complexities of this massive infrastructure project.
Discover practical strategies for implementing Service Level Objectives (SLOs) in asynchronous microservices, from identifying key metrics to designing dashboards and alerts that effectively monitor customer experience.
Discover how Bluesky's small engineering team rapidly evolved their data architecture to handle explosive growth of 1M+ users per day and 1,600+ events/second during an 11-day crisis period.
Discover how to develop, test, and maintain effective Disaster Recovery Plans through collaborative tabletop exercises. Learn best practices for designing reliable services and ensuring your organization is prepared for recovery after disasters.
Discover a scalable, cost-effective multi-cluster Elasticsearch architecture that handles petabytes of data, featuring intelligent query routing, real-time auditing, and automated rate-limiting to reduce costs while maintaining performance.
Explore the significant overlap between Cybersecurity and Site Reliability Engineering (SRE), learning how these disciplines can collaborate to improve organizational performance through shared capabilities and mutual goals.
Explore the challenges of monitoring ML systems in production from an SRE perspective, learning how to bridge domain expertise gaps and implement effective observability practices for reliable ML operations at scale.
Discover how Netflix's streaming control plane architecture handles massive demand shifts through load balancing, failover strategies, intelligent scaling, and resilience techniques for maintaining service quality across global systems.
Explore how to overcome the challenges of building a reliability culture in organizations where it's not extrinsically valued, by leveraging intrinsic motivation, pride, and joy instead of relying on heroics during incidents.
Delve into the inner workings of Linux kernel memory management with Chris Down, exploring CPU memory internals, virtual memory, and common misconceptions that impact system reliability.
Explore how SRE confronts increasing complexity challenges, examining the tension between verb-centric resilience skills and noun-centric technological advancement in modern systems engineering.
Get personalized course recommendations, track subjects and courses with reminders, and more.