- Explore Azure Databricks
In this module, you learn how to:
- Provision an Azure Databricks workspace
- Identify core workloads for Azure Databricks
- Use Data Governance tools Unity Catalog and Microsoft Purview
- Describe key concepts of an Azure Databricks solution
- Perform data analysis with Azure Databricks
In this module, you learn how to:
- Ingest data using Azure Databricks.
- Using the different data exploration tools in Azure Databricks.
- Analyze data with DataFrame APIs.
- Use Apache Spark in Azure Databricks
In this module, you'll learn how to:
- Describe key elements of the Apache Spark architecture.
- Create and configure a Spark cluster.
- Describe use cases for Spark.
- Use Spark to process and analyze data stored in files.
- Use Spark to visualize data.
- Manage data with Delta Lake
In this module, you learn:
- What Delta Lake is
- How to create Delta Tables
- How to use schema versioning and time travel in Delta Lake
- How to maintain data integrity with Delta Lake
- Learn how to build Lakeflow Declarative Pipelines in Azure Databricks
In this module, you'll learn how to:
- Describe Lakeflow Declarative Pipelines
- Ingest data into Lakeflow Declarative Pipelines
- Use Lakeflow Declarative Pipelines for real time data processing
- Learn how to deploy workloads with Lakeflow Jobs
In this module, you learn:
- What Lakeflow Jobs are
- The key components and benefits of Lakeflow Jobs
- How to deploy workloads using Lakeflow Jobs
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Syllabus
- Explore Azure Databricks
- Introduction
- Get started with Azure Databricks
- Identify Azure Databricks workloads
- Understand key concepts
- Data governance using Unity Catalog and Microsoft Purview
- Exercise - Explore Azure Databricks
- Module assessment
- Summary
- Perform data analysis with Azure Databricks
- Introduction
- Ingest data with Azure Databricks
- Data exploration tools in Azure Databricks
- Data analysis using DataFrame APIs
- Exercise - Explore data with Azure Databricks
- Module assessment
- Summary
- Use Apache Spark in Azure Databricks
- Introduction
- Get to know Spark
- Create a Spark cluster
- Use Spark in notebooks
- Use Spark to work with data files
- Visualize data
- Exercise - Use Spark in Azure Databricks
- Module assessment
- Summary
- Manage data with Delta Lake
- Introduction
- Get started with Delta Lake
- Create Delta tables
- Implement schema enforcement
- Data versioning and time travel in Delta Lake
- Data integrity with Delta Lake
- Exercise - Use Delta Lake in Azure Databricks
- Module assessment
- Summary
- Build Lakeflow Declarative Pipelines
- Introduction
- Explore Lakeflow Declarative Pipelines
- Data ingestion and integration
- Real-time processing
- Exercise - Create a Lakeflow Declarative Pipeline
- Module assessment
- Summary
- Deploy workloads with Lakeflow Jobs
- Introduction
- What are Lakeflow Jobs?
- Understand key components of Lakeflow Jobs
- Explore the benefits of Lakeflow Jobs
- Deploy workloads using Lakeflow Jobs
- Exercise - Create a Lakeflow Job
- Module assessment
- Summary