Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
This 16-minute IBM tutorial explores essential techniques for scaling data pipelines effectively. Learn how to optimize memory usage, implement robust failure control mechanisms through checkpointing and retry logic, and build resilient data flows for AI and big data workflows. Discover practical best practices that enable your pipelines to handle massive data volumes efficiently without breaking down. The presentation by Ellie Najewicz provides actionable strategies to ensure your data infrastructure can scale seamlessly as your processing needs grow, making it valuable for data engineers and architects working with large-scale data processing systems.
Syllabus
Scaling Data Pipelines: Memory Optimization & Failure Control
Taught by
IBM Technology