Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Georgia Institute of Technology

CSE 8803 Special Topics: Big Data

Georgia Institute of Technology via Udacity

Overview

In this course, we introduce the characteristics of medical data and associated data mining challenges on dealing with such data. We cover various algorithms and systems.

Syllabus

  • Introduction to Big Data Systems
    • In this lesson, we will take a 30000 foot view of Big Data and see why it is so important. We will meet the instructor and hear about the components of the course, including the final project.
  • Characteristics of Big Data
    • In this lesson you will learn about the main characteristics of Big Data, called the 4Vs. You will also start to explore the Big Data ecosystem.
  • Ingestion, Storage and Processing Frameworks
    • In this lesson, you'll take a look at several of the layers that make Big Data possible, We will also look at some of the tools that help implement those layers.
  • NoSQL Databases
    • In this lesson, we will look at the differences between NoSQL and SQL. We will also see why and how NoSQL databases provide capabilities that allow Big Data to be possible.
  • Scalable Data Lake Architecture
    • In this lesson, we will see what a Data Lake storage implementation of Big Data looks like. In addition to the benefits, we will see what considerations, risks, and challenges organizations face.
  • Project - Designing an Enterprise Data Lake System
    • In this lesson, we will lead you through the scenario and instructions for completing the final project, which is a proposal for an actual Data Lake architecture.

Taught by

David Joyner and Jimeng Sun

Reviews

Start your review of CSE 8803 Special Topics: Big Data

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.