Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

CodeSignal

Building an Async CLI Tool for ETL Pipelines in Python

via CodeSignal

Overview

It's time to put it all together. You will build a complete, production-ready command-line ETL tool from scratch, integrating dataclasses, pattern matching, and asyncio. This project solidifies all the advanced Python skills you've learned throughout the path into a single, impressive portfolio piece.

Syllabus

  • Unit 1: Domain Model Validation
    • Implement the Money Domain Model
    • Build a Reusable Range Validator
    • Building a Self Validating Transaction Model
    • Wiring Up the Range Descriptor
    • Handle Validation Errors Gracefully
  • Unit 2: Parsing and Normalization
    • Build a Streaming CSV Parser
    • Building a JSON Lines Parser
    • Transform Records into Valid Transactions
    • Orchestrating the Complete Data Pipeline
  • Unit 3: Routing and Orchestration
    • Building a Transaction Dispatcher
    • Pattern Guards for Smart Routing
    • Routing and Validating Refund Transactions
    • Handling the Unexpected Gracefully
  • Unit 4: Async Pipeline Backpressure
    • Building Your Queue Consumer
    • Feeding Data into the Pipeline
    • Orchestrating the Async Pipeline
    • Wire Up Your CLI Tool
  • Unit 5: CLI and Packaging Polish
    • Configuring Application Logging
    • Building the Command Line Parser
    • Bringing the CLI to Life
    • Invoking Your Pipeline Tool

Reviews

Start your review of Building an Async CLI Tool for ETL Pipelines in Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.