Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore how to build scalable, reliable ETL pipelines for managing large, diverse data sources through a comprehensive conference talk featuring HealthVerity's custom ETL framework called Theseus. Learn how this framework streamlines data ingestion and transformation by fully leveraging Databricks-native capabilities, including Delta Live Tables (DLT), auto loader, and event-driven orchestration. Discover the architecture that decouples supplier logic and implements structured bronze, silver, and gold data layers to ensure high-performance, fault-tolerant data processing with minimal operational overhead. Understand how this approach delivers faster time-to-value, simplified governance, and improved data quality within a declarative framework that reduces engineering effort. Examine real-world implementation strategies for automating complex data workflows, optimizing cost efficiency, and enhancing scalability while showcasing how Databricks-native tools drive tangible business outcomes in the healthcare data marketplace industry.
Syllabus
Health Data, Delivered: How DLT Powers the HealthVerity Marketplace
Taught by
Databricks