Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera

Processing and Analyzing Big Data in AWS

Packt via Coursera

Overview

AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off your first 3 months — limited time.
Unlock All Certificates
This course features Coursera Coach! A smarter way to learn with interactive, real-time conversations that help you test your knowledge, challenge assumptions, and deepen your understanding as you progress through the course. This course will guide you through the essential AWS tools for processing and analyzing big data. You will learn how to leverage services such as EMR, SageMaker, Lambda, and Data Pipeline to build scalable data processing solutions. The course focuses on both the core technologies and best practices for real-time data analysis and machine learning model training in the AWS cloud. As you progress, you will dive deep into each service. You’ll set up and utilize EMR clusters with Spark, Hue, and Hive, explore machine learning workflows in SageMaker, and understand how Lambda and Glue can simplify processing and ETL jobs. Hands-on examples help you understand how to create a seamless data flow from collection to analysis. You will also be introduced to powerful tools like Elasticsearch, Athena, and Redshift for data analysis and reporting. The course is designed to equip you with the practical skills to use AWS data services effectively in production environments. Through real-world use cases, you will gain the confidence to tackle any big data challenges, from batch processing to streaming analytics. This course is ideal for data engineers, cloud developers, and IT professionals who want to enhance their data processing and analytics capabilities. A basic understanding of cloud services and programming is helpful but not required. By the end of the course, you will be able to set up data processing workflows with AWS services like EMR, SageMaker, Lambda, and Redshift, and gain proficiency in analyzing and visualizing data with Elasticsearch, Athena, and Kinesis Analytics.

Syllabus

  • Processing
    • In this module, we will dive into various AWS services for processing big data. We will cover setting up and managing EMR clusters with Spark and Hive, using SageMaker for training machine learning models, and utilizing Lambda for reactive processing. Additionally, we will explore Data Pipeline for data transformations and migrations, and discuss using Glue for ETL jobs and data catalogs.
  • Analysis
    • In this module, we will focus on AWS tools for analyzing big data. We will explore Elasticsearch and Kibana for real-time data visualization, dive into Athena for interactive querying, and examine Redshift for large-scale data warehousing. Additionally, we will cover Kinesis Analytics for processing real-time streaming data.

Taught by

Packt - Course Instructors

Reviews

Start your review of Processing and Analyzing Big Data in AWS

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.