Data Engineering on AWS - The Complete Training

Packt via Coursera Specialization

Go to class Write review

Overview

Google, IBM & Meta Certificates – 40% Off

One plan covers every Professional Certificate on Coursera.

Unlock All Certificates

This specialization features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the specialization. This specialization prepares you to become an AWS data engineer by teaching you to build data pipelines, process large datasets, and manage workflows using key AWS services. You’ll start with core data engineering concepts, then dive into AWS Glue for ETL and Amazon Redshift for data warehousing. Learn streaming with Kinesis and MSK, big data processing on EMR, and scalable data lakes with Lake Formation. You’ll query data using Athena, visualize insights in QuickSight, and orchestrate pipelines with Step Functions and AppFlow. Migration tools like DMS, DataSync, and Snow Family are also covered. Designed for data professionals and cloud engineers with basic AWS and data knowledge, this hands-on program is ideal for intermediate learners. By the end of the specialization, you will be able to design, build, and optimize scalable data pipelines, implement real-time analytics, manage large-scale data migrations, and deploy end-to-end data engineering workflows on AWS.

Syllabus

Course 1: Introduction to Data Engineering on AWS
Course 2: Advanced Data Processing and Analytics with AWS
Course 3: Orchestrating Data Pipelines and Advanced Data Strategies

Courses

0 reviews

View details

This course features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. This course equips learners with the skills to efficiently process and analyze large volumes of data using AWS services. You will gain expertise in streaming data with Amazon Kinesis and Amazon MSK, running big data workloads on Amazon EMR, building data lakes on AWS, and querying data using Amazon Athena. The course is designed to help you develop a deep understanding of AWS tools and best practices for managing data in cloud environments. Through the course, you will explore the fundamentals of streaming data and various AWS services that support real-time analytics, such as Kinesis and MSK. You’ll also dive into building scalable data lakes using AWS Lake Formation and learn how to run big data processing workloads using Amazon EMR, along with optimizing them for cost and performance. Each module builds on the last, allowing you to master streaming, storage, and query operations seamlessly. As you progress, you will learn how to configure and optimize systems for maximum throughput. The course features hands-on exercises and best practices for using AWS tools, ensuring that you develop practical skills for real-world applications. The structure ensures that you understand the foundational concepts before advancing to complex data management and optimization techniques. This course is ideal for data engineers, cloud architects, or anyone looking to advance their skills in AWS data processing. While prior experience with cloud services is helpful, the course is designed for those with an intermediate understanding of data management and analytics. By the end of the course, you will be able to configure AWS services for real-time data processing, set up data lakes, optimize big data workloads on Amazon EMR, and query data efficiently using Amazon Athena.
0 reviews

View details

This course features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. In this course, you'll gain a comprehensive understanding of data engineering using AWS Glue and Redshift, two critical tools for modern data workflows. You will be equipped with the skills to manage and transform data at scale, from cataloging and processing with AWS Glue to leveraging Redshift for powerful data warehousing and analytics. By diving into hands-on tutorials, you'll learn the core concepts and practical applications necessary to streamline data pipelines and optimize query performance. As you progress through the course, you will explore a variety of AWS Glue features such as Data Catalogs, ETL development, job bookmarking, and data quality evaluation, empowering you to automate data workflows and manage large datasets effectively. With Amazon Redshift, you will learn how to configure clusters, optimize queries, and even work with Redshift Spectrum and Serverless, improving the scalability and efficiency of your data operations. This course is ideal for data professionals looking to enhance their cloud-based data engineering skills, especially those who want to integrate AWS Glue and Redshift into their existing systems. It is suitable for learners with a basic understanding of data analytics, but prior knowledge of AWS or data engineering concepts would be beneficial. The course is designed for both beginners and intermediate learners, offering a solid foundation and practical skills that can be applied in real-world data engineering roles. By the end of the course, you will be able to build and optimize ETL pipelines using AWS Glue, manage data workflows, configure Redshift clusters, optimize query performance, and deploy serverless Redshift for scalable data warehousing solutions.
0 reviews

View details

This course features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. Dive deep into modern data engineering with this comprehensive course that equips learners with practical skills in orchestrating complex data workflows and implementing advanced data strategies on AWS. You’ll gain hands-on experience with tools like AWS Step Functions, DMS, DataSync, and QuickSight to visualize, migrate, and manage data efficiently. The journey begins with mastering data visualization through Amazon QuickSight, where you’ll connect data sources and build dashboards. You'll then explore the core of the course—automating and orchestrating data pipelines using AWS tools like Glue Workflows and Step Functions, complete with real-world integrations and advanced capabilities. Next, you'll tackle large-scale data migrations, learning how to move data across platforms using AWS DMS, DataSync, and Transfer Family services. Finally, you'll go beyond standard analytics with services like Lambda, S3, RDS, EC2, and DynamoDB, enhancing your understanding of scalable cloud data architectures. This course is ideal for cloud engineers, data engineers, and IT professionals aiming to sharpen their AWS data skills. A basic understanding of AWS and data concepts is recommended. The difficulty level is Intermediate. By the end of the course, you will be able to orchestrate complex data workflows, migrate data across systems, implement cost-efficient storage strategies, and leverage serverless and cloud-native services to optimize data pipelines.