Apache Arrow DataFusion - A Fast, Embeddable, Modular Analytic Query Engine
CMU Database Group via YouTube
AI Engineer - Learn how to integrate AI into software applications
Learn the Skills Netflix, Meta, and Capital One Actually Hire For
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore the capabilities of Apache Arrow DataFusion in this comprehensive seminar from the CMU Database Group's Database Building Blocks series. Delve into the intricacies of this fast, embeddable, and modular analytic query engine as presented by speaker Andrew Lamb. Learn about the engine's architecture, performance optimizations, and its role in modern data analytics. Discover how DataFusion leverages the Apache Arrow format for efficient in-memory processing and its integration possibilities with other data systems. Gain insights into real-world applications and use cases for DataFusion in data-intensive environments. Understand the benefits of its modular design and how it enables customization for specific analytical needs. This hour-long talk provides valuable knowledge for database professionals, data engineers, and anyone interested in high-performance query processing.
Syllabus
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine (Andrew Lamb)
Taught by
CMU Database Group