Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera

Build & Analyze Your Data Lakehouse

Coursera via Coursera

Overview

Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
The modern data landscape demands professionals who can seamlessly bridge the gap between data lakes and data warehouses. This course transforms your ability to architect, implement, and optimize lakehouse platforms that deliver both flexibility and performance. This Short Course was created to help data engineering professionals accomplish scalable data platform implementation using advanced SQL and lakehouse patterns. By completing this course, you'll be able to register massive file-based datasets as queryable external tables, make informed decisions between Delta Lake, Iceberg, and Hudi formats, and automate robust data ingestion pipelines that keep your warehouse synchronized with your lake. By the end of this course, you will be able to: - Apply configurations to register file-based datasets as external tables - Analyze the technical capabilities of different open-source table formats - Create a data ingestion pipeline within a lakehouse architecture This course is unique because it combines hands-on SQL implementation with strategic architectural decision-making, giving you both the technical skills and analytical framework needed for enterprise-scale data platforms. To be successful in this course, you should have a background in SQL, data warehousing concepts, and distributed systems fundamentals.

Syllabus

  • Module 1: External Table Configuration Mastery
    • Learners will master the technical implementation of external table configurations to enable direct querying of file-based datasets in cloud storage.
  • Module 2: Open-Source Table Format Analysis
    • Learners will develop analytical frameworks to evaluate and compare the technical capabilities of Delta Lake, Apache Iceberg, and Apache Hudi for specific business requirements.
  • Module 3: Data Ingestion Pipeline Implementation
    • Learners will architect and implement automated data ingestion pipelines that orchestrate data movement across medallion architecture zones within lakehouse platforms.

Taught by

Hurix Digital

Reviews

Start your review of Build & Analyze Your Data Lakehouse

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.