Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
The modern data landscape demands professionals who can seamlessly bridge the gap between data lakes and data warehouses. This course transforms your ability to architect, implement, and optimize lakehouse platforms that deliver both flexibility and performance.
This Short Course was created to help data engineering professionals accomplish scalable data platform implementation using advanced SQL and lakehouse patterns.
By completing this course, you'll be able to register massive file-based datasets as queryable external tables, make informed decisions between Delta Lake, Iceberg, and Hudi formats, and automate robust data ingestion pipelines that keep your warehouse synchronized with your lake.
By the end of this course, you will be able to:
- Apply configurations to register file-based datasets as external tables
- Analyze the technical capabilities of different open-source table formats
- Create a data ingestion pipeline within a lakehouse architecture
This course is unique because it combines hands-on SQL implementation with strategic architectural decision-making, giving you both the technical skills and analytical framework needed for enterprise-scale data platforms.
To be successful in this course, you should have a background in SQL, data warehousing concepts, and distributed systems fundamentals.