Tracking Row Identities and Changes in Delta Lake and Apache Iceberg Tables
DSDSD - Dutch Seminar on Data Systems Design via YouTube
Google AI Professional Certificate - Learn AI Skills That Get You Hired
JavaScript Programming for Beginners
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore a 15-minute conference talk from the Dutch-Belgian DataBase Day (DBDBD) that delves into the intricacies of row identity tracking and change management in Delta Lake and Apache Iceberg tables. Learn about the challenges and solutions for assigning dense, unique, and monotonically increasing identity sequence numbers while maintaining optimistic concurrency. Discover efficient methods for deriving differences between arbitrary snapshot versions of tables. Delivered by Bart Samwel, Principal Engineer at Databricks Amsterdam, known for his contributions to Delta Lake innovations including Deletion Vectors, Liquid Clustering, and Row-Level Concurrency, this presentation from DBDBD 2024 at Science Park in Amsterdam offers valuable insights into modern database management techniques.
Syllabus
Follow your rows (wherever they may go) by Bart Samwel (DBDBD 2024)
Taught by
DSDSD - Dutch Seminar on Data Systems Design