Master AI & Data—50% Off Udacity (Code CC50)
Gain a Splash of New Skills - Coursera+ Annual Just ₹7,999
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn the fundamentals of Apache Airflow, an open-source platform for developing, scheduling, and monitoring workflows, in this comprehensive tech talk from the CMU Database Group. Explore Airflow's core concepts including DAGs (Directed Acyclic Graphs), operators, and schedulers while understanding how to design and implement data pipelines for complex workflow orchestration. Discover best practices for managing dependencies, handling failures, and scaling workflows in production environments. Gain insights into Airflow's architecture, including its web interface, metadata database, and executor components. Understand how to integrate Airflow with various data sources, cloud platforms, and other tools in the modern data stack. Master techniques for monitoring workflow performance, debugging failed tasks, and implementing proper logging and alerting strategies. Examine real-world use cases and practical examples that demonstrate Airflow's capabilities in data engineering, ETL processes, and automated data processing pipelines.
Syllabus
Introduction to Apache Airflow (Julian LaNeve)
Taught by
CMU Database Group