Build the Finance Skills That Lead to Promotions — Not Just Certificates
AI Engineer - Learn how to integrate AI into software applications
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Discover how to manage large-scale datasets efficiently with this PyCon US talk that explores the integration of Apache Iceberg™ with Python through PyIceberg. Learn how this open table format addresses the challenges of handling terabyte-scale data, evolving schemas, and maintaining consistency across different tools. Follow along as the presentation introduces Iceberg and PyIceberg fundamentals, highlighting features like schema evolution and transactional guarantees specifically designed for the Python ecosystem. See practical demonstrations of creating, querying, and writing to Iceberg tables while maintaining interoperability with Python-native dataframes such as PyArrow and Pandas. Dive deeper into Iceberg's file structure, including metadata files, manifest lists, and manifests, to understand how PyIceberg leverages this architecture for transactional table updates and query optimization. Explore advanced features like hidden partitioning and time travel that make table management more efficient and flexible at scale. This 28-minute talk provides essential knowledge for Python developers working with large datasets.
Syllabus
Bridging Python and Apache Iceberg™: The Power of PyIceberg
Taught by
PyCon US