Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera

Introduction to Data Engineering on AWS

Packt via Coursera

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
This course features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. In this course, you'll gain a comprehensive understanding of data engineering using AWS Glue and Redshift, two critical tools for modern data workflows. You will be equipped with the skills to manage and transform data at scale, from cataloging and processing with AWS Glue to leveraging Redshift for powerful data warehousing and analytics. By diving into hands-on tutorials, you'll learn the core concepts and practical applications necessary to streamline data pipelines and optimize query performance. As you progress through the course, you will explore a variety of AWS Glue features such as Data Catalogs, ETL development, job bookmarking, and data quality evaluation, empowering you to automate data workflows and manage large datasets effectively. With Amazon Redshift, you will learn how to configure clusters, optimize queries, and even work with Redshift Spectrum and Serverless, improving the scalability and efficiency of your data operations. This course is ideal for data professionals looking to enhance their cloud-based data engineering skills, especially those who want to integrate AWS Glue and Redshift into their existing systems. It is suitable for learners with a basic understanding of data analytics, but prior knowledge of AWS or data engineering concepts would be beneficial. The course is designed for both beginners and intermediate learners, offering a solid foundation and practical skills that can be applied in real-world data engineering roles. By the end of the course, you will be able to build and optimize ETL pipelines using AWS Glue, manage data workflows, configure Redshift clusters, optimize query performance, and deploy serverless Redshift for scalable data warehousing solutions.

Syllabus

  • Introduction: Data Is the New Oil
    • In this module, we will introduce the concept of data as the new oil and explore its growing importance in the modern digital world. You'll gain a high-level overview of the course and understand the pivotal role data plays in driving innovation and business success.
  • Know Your Trainer
    • In this module, we will introduce your trainer and provide insights into their professional background. You’ll learn what to expect from this course and how their expertise will guide you throughout your learning journey.
  • Getting Started with Data Analytics
    • In this module, we will explore the foundational concepts of data engineering, focusing on how AWS services facilitate modern data analytics. You'll also be introduced to essential terminologies to build a strong understanding of data engineering workflows.
  • AWS Glue: Catalog and Process Your Data
    • In this module, we will dive deep into AWS Glue, exploring its features for data cataloging, ETL processes, and data quality management. You’ll gain hands-on experience in setting up and orchestrating workflows that automate data transformation and processing tasks.
  • Amazon Redshift: A Data Warehouse in AWS
    • In this module, we will explore Amazon Redshift, focusing on its architecture, cluster management, and querying capabilities. You'll also learn about advanced features like Redshift Spectrum, Serverless Redshift, and materialized views to optimize your data warehousing experience.

Taught by

Packt - Course Instructors

Reviews

Start your review of Introduction to Data Engineering on AWS

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.