Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Udacity

Scripting for Data Science

via Udacity

Overview

Learn essential Unix shell and Python scripting skills for data workflows and analytics. Navigate and manage files, inspect and monitor data efficiently, and work confidently with the filesystem. Build reusable Python scripts that accept user input, automate repetitive tasks, and process data from the command line. Develop professional command-line tools with validation and clear output, integrate operating system commands into Python workflows, and use Python libraries to manage files and directories. Apply these skills to real-world data engineering and analytics tasks to improve efficiency, automation, and reproducibility.

Syllabus

  • File Navigation and Management in the Shell
    • Navigate the filesystem, inspect file contents, and manage files and directories efficiently in the Unix shell while building practical skills for common terminal workflows.
  • Shell Scripts for Data Automation
    • Automate file handling and data workflows with shell scripts, command chaining, loops, conditionals, and text-processing tools to organize files, analyze logs, and transform datasets.
  • Python Scripting for Data Workflows
    • Develop reusable Python automation tools that process data, handle command-line inputs, integrate shell workflows, and manage files efficiently using Python's built-in modules.
  • Automating Data Cleaning with Shell and Python
    • Automate data cleaning workflows by organizing files, validating JSON data, normalizing records, and merging datasets into analysis-ready outputs with Unix shell and Python.

Taught by

Kedar Kodgire

Reviews

Start your review of Scripting for Data Science

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.