Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera

Big Data Analytics with Hive, Pig & MapReduce

EDUCBA via Coursera

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
By the end of this course, learners will be able to design Hive databases, manage complex tables, process XML data with Pig, execute MapReduce jobs, and analyze large-scale social media datasets to extract meaningful insights. The course begins with foundational concepts of Hive, including databases, partitions, and bucketing, then advances into table optimization and constraints for schema design. Learners will gain practical experience in ingesting data with Sqoop, processing it using MapReduce, and applying location- and author-based analytics to real-world datasets. Finally, the course explores Pig scripting for XML processing and Hive complex data types for advanced bookmarking dataset analysis. This course is unique because it combines two hands-on case studies: one from the telecom industry and another from social media analytics, offering a blend of foundational Hive knowledge and advanced Hadoop ecosystem tools. Designed for professionals, students, and data enthusiasts, the course emphasizes practical application over theory, ensuring learners can confidently apply big data technologies to solve real business problems.

Syllabus

  • Foundations of Hive and Big Data
    • This module introduces Apache Hive and its role in the Hadoop ecosystem. Learners will explore Hive’s basic features, database commands, table operations, and foundational concepts like external tables, partitions, and bucketing. By the end, they will have a strong foundation to query and manage data effectively in Hadoop using Hive.
  • Optimizing Data with Hive
    • This module dives deeper into advanced Hive functionality, including table constraints and complex table creation. Learners will understand how to design optimized tables and implement constraints to improve schema structure and maintainability in Hive.
  • Social Media Data Integration and Processing
    • This module focuses on importing social media data into Hadoop, processing it with MapReduce, and analyzing it for insights. Learners will practice using Sqoop for RDBMS to HDFS transfers, run MapReduce programs, and analyze datasets by location, authors, and reader preferences.
  • Social Media Insights with Pig and Hive
    • This module explores Pig and Hive for advanced social media analytics. Learners will process XML data with Pig, store and explore outputs, and utilize Hive complex data types with MapReduce for deep insights into bookmarking datasets and user interactions.

Taught by

EDUCBA

Reviews

Start your review of Big Data Analytics with Hive, Pig & MapReduce

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.