Scaling Thanos and Prometheus for Massive Metrics Deployment at Reddit
CNCF [Cloud Native Computing Foundation] via YouTube
The Most Addictive Python and SQL Courses
Learn EDR Internals: Research & Development From The Masters
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore how Reddit scales its monitoring infrastructure using Thanos and Prometheus in this informative conference talk. Discover the custom monitoring operator developed by Reddit to manage thousands of Prometheus instances, handling over 45 million samples per second and 600 million active series. Learn about the Kubernetes controller used to orchestrate this massive deployment and how Thanos enables long-term storage and global querying capabilities. Gain insights into the tools developed by Reddit's team, the challenges they faced, and the solutions implemented to achieve a robust and scalable metrics system for one of the world's largest social media platforms.
Syllabus
Scaling Thanos at Reddit - Ben Kochie & Trevor Riles, Reddit
Taught by
CNCF [Cloud Native Computing Foundation]