Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Google

Modernizing Data Lakes and Data Warehouses with Google Cloud

Google via Google Skills

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
While the traditional approaches of using data lakes and data warehouses can be effective, they have shortcomings, particularly in large enterprise environments. This course introduces the concept of a data lakehouse and the Google Cloud products used to create one. A lakehouse architecture uses open-standard data sources and combines the best features of data lakes and data warehouses, which addresses many of their shortcomings.

Syllabus

  • Course introduction
    • Course Introduction
  • Introduction to Modern Data Engineering on Google Cloud
    • Module 1: Introduction to modern data engineering on Google Cloud
    • The classics: Data lakes and data warehouses
    • The modern approach: Data lakehouse
    • Choosing the right architecture
    • Data warehouses, lakes, and lakehouses concepts
  • Building a Data Lakehouse with Cloud Storage, Open Formats, and BigQuery
    • Module 2: Building a data lakehouse with Cloud Storage, open formats, and BigQuery
    • Building a data lake foundation
    • Introduction to Apache Iceberg open table format
    • BigQuery as the central processing engine
    • Combining operational data in AlloyDB
    • Combining operational and analytical data with federated queries
    • Real world use case
    • Data lakehouse features and benefits
    • Federated query with BigQuery
  • Modernizing Data Warehouses with BigQuery and BigLake
    • Module 3: Modernizing data warehouses with BigQuery and BigLake
    • BigQuery fundamentals
    • Partitioning and clustering in BigQuery
    • Introducing BigLake and external tables
    • Querying external data and Iceberg tables
    • BigQuery and BigLake
  • Advanced Lakehouse Patterns and Data Governance
    • Module 4: Advanced lakehouse patterns and data governance
    • Data governance and security in a unified platform
    • Demo: Data Loss Prevention
    • Analytics and machine learning on the lakehouse
    • Real-world lakehouse architectures and migration strategies
    • Security and ML in the lakehouse
  • Labs and Best Practices
    • Module 5: Labs and best practices
    • Getting Started with BigQuery ML
    • Vector search with BigQuery
    • Review and best practices
  • Course Summary
    • Summary and next steps
  • Your Next Steps
    • Course Badge

Reviews

Start your review of Modernizing Data Lakes and Data Warehouses with Google Cloud

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.