Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Python Over Data Lakes: Declarative Environments, Data Management and Other Things with Feathers

Data Council via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
In this 32-minute conference talk from Data Council, learn how to design reproducible data workloads over Data Lakes as Ciro Greco shares valuable insights on decoupling code, compute, and data management for deterministic pipeline reproduction. Discover practical approaches for Python data pipeline developers and engineers debugging complex workflows, with a focus on leveraging open-source components like Iceberg, Arrow, and Docker to create declarative functional DAGs that execute efficiently in cloud environments. Particularly valuable for technical teams working with data lakes who need to ensure reproducibility and maintainability in their data processing systems.

Syllabus

Python Over Data Lakes: Declarative Environments, Data Management & Other Things w/ Feathers

Taught by

Data Council

Reviews

Start your review of Python Over Data Lakes: Declarative Environments, Data Management and Other Things with Feathers

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.