- Learn fundamental data governance concepts using Unity Catalog in Azure Databricks. Explore metastore architecture, external storage integration, and federation capabilities.
In this module, you learn how to:
- Assess data governance challenges and evaluate Unity Catalog as a solution
- Analyze the Unity Catalog metastore architecture and three-level namespace structure
- Explore external storage integration and Lakehouse Federation capabilities
- Configure Unity Catalog isolation boundaries, leverage automatic lineage tracking, and migrate from Hive metastore to establish centralized data governance.
By the end of this module, you'll be able to:
- Configure Unity Catalog isolation
- Use automatic lineage tracking to analyze data flow and plan changes
- Migrate tables from the Hive metastore to Unity Catalog
- Learn about security and access control in Azure Databricks Unity Catalog
At the end of this module, you're able to:
- Describe the Unity Catalog query lifecycle and where access checks occur.
- Distinguish explicit grants from inherited privileges and state when each is appropriate.
- Identify the minimum privileges required for common tasks (querying, creating tables).
- Compare Unity Catalog authorization to a legacy Hive metastore approach.
- Apply a repeatable pattern for granting and auditing access.
- Implement Advanced Security and Data Management in Unity Catalog
At the end of this module, you're able to:
- Explain the importance of fine-grained access control and distinguish between Row and Column Security and Dynamic Views.
- Implement masking and filtering rules at the table level using Row and Column Security, and identify scenarios where Dynamic Views are appropriate.
- Use Lakehouse Monitoring to profile data, detect anomalies such as nulls, zeros, and outliers, and interpret quality and drift metrics over time.
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Syllabus
- Establish Unity Catalog fundamentals
- Introduction
- Explore data governance using Unity Catalog
- Analyze Unity Catalog architecture
- Explore external data
- Exercise: Populate and navigate the metastore
- Module assessment
- Summary
- Structure Unity Catalog for governance
- Introduction
- Implement isolation methods
- Explore lineage
- Transition from the Hive Metastore
- Module assessment
- Exercise - Transition from the Hive metastore
- Summary
- Implement Security and Access Control in Unity Catalog
- Introduction
- Understand Query Lifecycle
- Explore Azure Databricks Marketplace and Delta Sharing
- Implement Access Control Strategies
- Knowledge Check
- Summary
- Implement Advanced Security and Data Management in Unity Catalog
- Introduction
- Understand Fine-Grained Access Control
- Implement Row Filtering and Column Masking
- Exercise: Secure data access
- Understand Lakehouse Monitoring
- Implement Lakehouse Monitoring
- Interpret Data Quality Dashboard
- Knowledge Check
- Summary