Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera

Open source Data Engineering with Spark, dbt & Airflow

Coursera via Coursera Professional Certificate

Overview

Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
This program equips you with the open-source tools and architectural thinking used by professional data engineers to build scalable, reliable data systems from the ground up. You will work hands-on with Apache Spark for distributed data processing, dbt for modular SQL-based transformation, and Apache Airflow for workflow orchestration — the same stack powering data infrastructure at leading technology and data-driven organizations worldwide. Across the courses, you will gain practical expertise in designing dimensional data models, implementing incremental load strategies, optimizing Spark job performance, enforcing data quality with automated testing frameworks, and deploying pipelines through CI/CD workflows. You will also develop foundational skills in cloud storage provisioning, containerization with Docker, and version control best practices that mirror real production environments. By the end of this Program, you will be able to design and deploy end-to-end data pipelines that ingest from diverse sources, transform data through well-tested models, and deliver analytics-ready datasets to downstream consumers — demonstrating job-ready engineering skills valued across analytics engineering, data platform, and data infrastructure roles.

Syllabus

  • Course 1: Building Automated Data Pipelines with Spark,dbt,and Airflow
  • Course 2: Optimizing Spark and Cloud Data Storage for Analytics
  • Course 3: Data Modeling & Warehousing Fundamentals in Data Engineering
  • Course 4: DevOps and CI/CD for Data Engineering Performance
  • Course 5: Data Quality and Debugging for Reliable Pipelines
  • Course 6: Career Development For Open Source Data Engineering

Courses

Taught by

Professionals from the Industry

Reviews

Start your review of Open source Data Engineering with Spark, dbt & Airflow

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.