Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Johns Hopkins University

HDFS Architecture and Programming

Johns Hopkins University via Coursera

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
The course “HDFS Architecture and Programming” offers a comprehensive understanding of the Hadoop Distributed File System (HDFS) architecture, components, and advanced programming techniques. You will gain practical experience in setting up and configuring Hadoop for Java development, while mastering key concepts such as file and directory CRUD operations, data compression, and serialization. By the end of the course, you will be proficient in using HDFS to handle large-scale data processing, enabling you to build scalable, high-availability solutions. What sets this course apart is its hands-on approach, where you will work directly with HDFS, writing client programs and applying advanced techniques such as using Sequence and Map Files for specialized data storage. Whether you're new to Hadoop or looking to refine your existing skills, this course equips you with the tools and knowledge to become proficient in HDFS programming, making you a valuable asset in the field of Big Data.

Syllabus

  • Course Introduction
    • This course provides a comprehensive understanding of Hadoop Distributed File System (HDFS) architecture and its key components. Students will gain hands-on experience with HDFS, learning how to set up Java programming environments and configure Hadoop. The course covers essential topics such as the HDFS programming model, file and directory CRUD operations, and compression techniques. You will also explore serialization, deserialization, and specialized file structures like Sequence and Map Files. By the end of the course, You will be equipped to leverage HDFS for scalable, highly available big data solutions.
  • HDFS Architecture
    • In this module, we will cover the working model and architecture behind Hadoop Distributed File System (HDFS) 1.0 and the capabilities and deficiencies of HDFS 1.0 architecture.
  • HDFS Programming Basics
    • In this module, we will cover HDFS programming concepts, HDFS API, and steps to write an HDFS client program for CRUD (Create, Read, Update and Delete) on files.
  • HDFS Programming Advanced
    • In this module, we will cover HDFS advanced programming concepts, such as CRUD on directories, compression, serialization and deserialization, and file-based data structures like sequence files.

Taught by

Karthik Shyamsunder

Reviews

Start your review of HDFS Architecture and Programming

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.