Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

CodeSignal

Introduction to Data Cleaning with Python

via CodeSignal

Overview

This course introduces fundamental concepts of data cleaning using Python, covering essential libraries, handling missing values, detecting and removing duplicates, dealing with outliers, and normalizing data for analysis.

Syllabus

  • Unit 1: Data Handling and File Operations in Python
    • Modify Entries in a DataFrame
    • Managing Missing Data with Pandas
    • Debug the Agenda Logic
    • Mastering DataFrame Inspection
  • Unit 2: Handling Missing Data with Pandas
    • Adapting Missing Data Strategies
    • Fix Bugs in Handling Missing Data
    • Clean and Fill Your DataFrame
    • Handling Missing Values Like a Pro
  • Unit 3: Handling Duplicates in Data Using Pandas
    • Debug Duplicate Detection
    • Mastering DataFrame Duplicate Handling
    • Clearing Up Duplicate Data
  • Unit 4: Detecting Outliers in Data Using Python
    • Calculate Z-scores for Outlier Detection
    • Fix the Outlier Detection Code
    • Handle Outliers with Z-score
    • Identify and Remove Outliers Easily
  • Unit 5: Standardizing and Normalizing Data in Python
    • Changing the Scale of Data
    • Fix the Data Preprocessing Errors
    • Handling Missing Data Efficiently
    • Transform Your Data Efficiently

Reviews

Start your review of Introduction to Data Cleaning with Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.