Building a Batch Processing Platform for Data Pipelines Using Argo and Kubernetes
CNCF [Cloud Native Computing Foundation] via YouTube
Launch Your Cybersecurity Career in 6 Months
Finance Certifications Goldman Sachs & Amazon Teams Trust
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a conference talk detailing Intuit's development of a highly scalable batch processing platform using Kubernetes and Argo for efficient data pipeline management. Discover how this solution addresses challenges in scheduling, orchestration, and complex dependency management for over 100,000 data pipelines across hundreds of AI and Data engineering teams. Learn about the integration of Argo Events, Argo Workflow, and Kubernetes to create an effective orchestration and scheduling engine for various data processing use cases. Gain insights into the operational challenges of managing multi-cluster Kubernetes infrastructure and the integration of Argo with Kafka for zero downtime scheduling. Understand how this holistic approach eliminates silos and enhances processing effectiveness in the data lake environment.
Syllabus
Building a Batch Processing Platform... - Rakesh Subramanian Suresh & Aroop Maliakkal Padmanabhan
Taught by
CNCF [Cloud Native Computing Foundation]