Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Bridging Python and Apache Iceberg: The Power of PyIceberg

PyCon US via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Discover how to manage large-scale datasets efficiently with this PyCon US talk that explores the integration of Apache Iceberg™ with Python through PyIceberg. Learn how this open table format addresses the challenges of handling terabyte-scale data, evolving schemas, and maintaining consistency across different tools. Follow along as the presentation introduces Iceberg and PyIceberg fundamentals, highlighting features like schema evolution and transactional guarantees specifically designed for the Python ecosystem. See practical demonstrations of creating, querying, and writing to Iceberg tables while maintaining interoperability with Python-native dataframes such as PyArrow and Pandas. Dive deeper into Iceberg's file structure, including metadata files, manifest lists, and manifests, to understand how PyIceberg leverages this architecture for transactional table updates and query optimization. Explore advanced features like hidden partitioning and time travel that make table management more efficient and flexible at scale. This 28-minute talk provides essential knowledge for Python developers working with large datasets.

Syllabus

Bridging Python and Apache Iceberg™: The Power of PyIceberg

Taught by

PyCon US

Reviews

Start your review of Bridging Python and Apache Iceberg: The Power of PyIceberg

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.