Tracking Row Identities and Changes in Delta Lake and Apache Iceberg Tables
DSDSD - Dutch Seminar on Data Systems Design via YouTube
Google, IBM & Meta Certificates — 40% Off for a Limited Time
Get 20% off all career paths from fullstack to AI
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a 15-minute conference talk from the Dutch-Belgian DataBase Day (DBDBD) that delves into the intricacies of row identity tracking and change management in Delta Lake and Apache Iceberg tables. Learn about the challenges and solutions for assigning dense, unique, and monotonically increasing identity sequence numbers while maintaining optimistic concurrency. Discover efficient methods for deriving differences between arbitrary snapshot versions of tables. Delivered by Bart Samwel, Principal Engineer at Databricks Amsterdam, known for his contributions to Delta Lake innovations including Deletion Vectors, Liquid Clustering, and Row-Level Concurrency, this presentation from DBDBD 2024 at Science Park in Amsterdam offers valuable insights into modern database management techniques.
Syllabus
Follow your rows (wherever they may go) by Bart Samwel (DBDBD 2024)
Taught by
DSDSD - Dutch Seminar on Data Systems Design