Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to revolutionize data processing by bringing compute to where data lives rather than moving data to centralized infrastructure in this 36-minute conference talk from the Linux Foundation. Discover how organizations can address the critical challenge of exponentially growing data across distributed locations while avoiding the unsustainable costs and complexity of traditional centralized processing approaches. Explore the open-source Bacalhau project and understand how to deploy distributed processing jobs across clouds, edge devices, and on-premises infrastructure to reduce data movement costs while maintaining centralized control. Master practical techniques for ensuring compliance by processing sensitive data in place and enabling real-time analytics at the edge. Examine real-world case studies including an energy company managing 15,000 microgrids and cities processing camera feeds to understand how these solutions work in practice. Gain insights into architectural patterns, security considerations, and best practices for implementing compute-over-data architectures that can help unlock the majority of enterprise data that currently goes unused due to transfer costs, compliance issues, and network reliability problems.
Syllabus
The MODERN Modern Data Stack: Building an Open Distributed Data Warehouse Beyond... David Aronchick
Taught by
Linux Foundation