Courses from 1000+ universities
$7.2 billion in combined revenue since 2020. $8 billion in lost market value. This merger marks the end of an era in online education.
600 Free Google Certifications
Computer Science
Information Technology
Data Analysis
The Science of Gastronomy
Transforming Digital Learning: Learning Design Meets Service Design
Intelligenza Artificiale
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore strategies for implementing and improving on-call practices in SRE and tech roles, emphasizing its importance and offering actionable ideas for more humane and effective 24/7 operations.
Innovative solution for sustainable battery lifecycle management in data centers using IoT, AI, and real-time monitoring to reduce costs, prevent downtime, and achieve zero-carbon waste diversion goals.
Practical approach to predicting storage device failures in data centers using multi-phase proactive sampling, addressing challenges of accuracy, performance, and cost-effectiveness for Site Reliability Engineers.
Explore the evolution of SRE over 25 years, gaining insights on essential career skills and future trends in tech infrastructure management from an industry veteran's perspective.
Discover how automated load testing in production enhances system reliability, developer confidence, and business performance while minimizing risks and human intervention.
Explore eBPF for debugging production issues in Golang, overcoming traditional tool limitations. Learn practical techniques and gain insights for immediate problem-solving without special debug modes.
Evolving PayPal's E2E customer flow testing into a proactive synthetic monitoring system for improved incident prevention, faster detection, and enhanced site reliability.
Explore cost-saving strategies and challenges of running GKE clusters on Spot Instances, including capacity management, edge cases, and the importance of SRE principles in this dynamic environment.
Explore a real-time ML platform designed to assist contact center agents, addressing challenges in throughput, latency, and fault tolerance while improving customer support efficiency.
Explore Adaptive Paging, an innovative alert handler leveraging tracing and semantic conventions to identify and page teams closest to issues in complex distributed systems, reducing alert fatigue.
Deep-dive into domain and DNS safety, exploring threat vectors, detection methods, and mitigation strategies for improved security and availability of infrastructure domains and subdomains.
Explore DBS Bank's framework for building resilient infrastructure through cultural transformation, focusing on collaborative approaches and system-wide improvements for a proactive tech environment.
Explore innovative analytics-driven services for improved site reliability, including automated outage management, predictive Slackbot solutions, and user engagement optimization through event data analysis.
Explore Keptn's integration with K6 for scalable performance testing in delivery pipelines, including SLO evaluation and quality gates for efficient service monitoring and optimization.
Explore Cloudflare's autonomous system for server diagnostics and recovery at scale. Learn to transform automation into autonomy, applying SRE principles for increased efficiency and productivity.
Get personalized course recommendations, track subjects and courses with reminders, and more.