Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn how to handle messy, unstable schema data in the bronze layer of medallion data architecture using native Python data structures instead of traditional pandas approaches in this conference talk from PyBay 2025. Discover practical strategies for reliably ingesting third-party data sources that contain inconsistent or changing schemas, which often challenge conventional tabular data processing methods. Explore how flexible data schemas and Python's native capabilities can provide more robust solutions for staging raw data before transformation and cleaning. Understand the principles behind effective bronze layer design that prioritizes data ingestion reliability over immediate queryability, enabling better downstream data processing and monitoring workflows for unpredictable data sources.
Syllabus
The Zen of the Bronze Layer Ingestion of Data with Unstable Schema — Aaron Wiegel (PyBay 2025)
Taught by
SF Python