Learn essential Unix shell and Python scripting skills for data workflows and analytics. Navigate and manage files, inspect and monitor data efficiently, and work confidently with the filesystem. Build reusable Python scripts that accept user input, automate repetitive tasks, and process data from the command line. Develop professional command-line tools with validation and clear output, integrate operating system commands into Python workflows, and use Python libraries to manage files and directories. Apply these skills to real-world data engineering and analytics tasks to improve efficiency, automation, and reproducibility.
Overview
Syllabus
- File Navigation and Management in the Shell
- Navigate the filesystem, inspect file contents, and manage files and directories efficiently in the Unix shell while building practical skills for common terminal workflows.
- Shell Scripts for Data Automation
- Automate file handling and data workflows with shell scripts, command chaining, loops, conditionals, and text-processing tools to organize files, analyze logs, and transform datasets.
- Python Scripting for Data Workflows
- Develop reusable Python automation tools that process data, handle command-line inputs, integrate shell workflows, and manage files efficiently using Python's built-in modules.
- Automating Data Cleaning with Shell and Python
- Automate data cleaning workflows by organizing files, validating JSON data, normalizing records, and merging datasets into analysis-ready outputs with Unix shell and Python.
Taught by
Kedar Kodgire