Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera

Hadoop and Spark Fundamentals: Unit 1

via Coursera

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
This course provides a practical introduction to the Apache Hadoop ecosystem. You will learn the basic skills needed to analyze and manage large, unstructured datasets. The course covers core concepts such as the data lake, MapReduce, and using Spark for analytics. You will install and configure Hadoop on your own computer using the Hortonworks HDP sandbox. The course includes instruction on the Hadoop Distributed File System (HDFS), its architecture, and how to use it in real-world situations. This course is suitable for beginners and those looking to expand their data analytics skills. By the end, you will understand the fundamentals of Hadoop and Spark for scalable data processing.

Syllabus

  • Hadoop and Spark Fundamentals: Unit 1
    • This module introduces the fundamentals of Hadoop and Spark, starting with core concepts and the transformative impact of Hadoop on data management. It guides learners through installing a full-featured Hadoop environment on a desktop or laptop using the Hortonworks HDP sandbox or direct installation. The module also covers the Hadoop Distributed File System (HDFS), highlighting its architecture, advantages for big data, navigation tools, and advanced features. A bonus lesson provides essential Linux command line skills for beginners.

Taught by

Pearson and Douglas Eadline, PhD

Reviews

Start your review of Hadoop and Spark Fundamentals: Unit 1

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.