An Extremely Technical Overview of How Apache Iceberg Planning Actually Works
CMU Database Group via YouTube
AI, Data Science & Cloud Certificates from Google, IBM & Meta
Master Production-Ready Machine Learning, Step by Step
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Explore the intricate technical mechanisms behind Apache Iceberg's query planning system in this comprehensive database seminar. Delve deep into the internal workings of how Apache Iceberg processes and optimizes queries, examining the sophisticated algorithms and data structures that enable efficient query execution across large-scale data lakes. Learn about the planning phase architecture, metadata handling, partition pruning strategies, and the decision-making processes that occur when Iceberg determines optimal query execution paths. Understand how the system manages schema evolution, handles concurrent operations, and maintains ACID properties during the planning stage. Gain insights into performance optimization techniques, cost-based optimization strategies, and the integration points between Iceberg's planning layer and various compute engines. This technical deep-dive provides database professionals and data engineers with detailed knowledge of Iceberg's query planning internals, enabling better understanding of performance characteristics and optimization opportunities in modern data lake architectures.
Syllabus
An Extremely Technical Overview of How Apache Iceberg Planning Actually Works (Russell Spitzer)
Taught by
CMU Database Group