Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore the complexities of managing Google's massive Build & CI infrastructure in this conference talk from SREcon25 EMEA. Dive deep into the Site Reliability Engineering perspective of supporting over 100,000 monthly users and maintaining a 98% cache hit rate across diverse computing environments. Learn about resource management strategies for handling doubling year-over-year growth, the practical application of the Pareto principle, and the implementation of critical caching layers including local, remote, and peer-to-peer systems. Discover how Google manages output storage processing hundreds of petabytes daily through advanced deduplication and compression techniques. Follow a complete user's build journey from desktop development to production deployment while gaining insights into optimizing large-scale build systems. Understand how a small SRE team maintains 24/7 service availability through strategic planning, comprehensive monitoring, and effective collaboration practices. Examine availability risks, standardization challenges, and evaluate the advantages and disadvantages of operating a monolithic Build & CI stack at planetary scale.
Syllabus
SREcon25 Europe/Middle East/Africa - The Bitter and the Sweet of Running a Planet-Scale Build &...
Taught by
USENIX