Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore efficient data exchange mechanisms using Apache Arrow and pyarrow in this 41-minute conference talk from EuroPython 2025. Discover how Apache Arrow was designed to facilitate seamless data sharing between systems and learn about the project's evolution in the data exchange domain over the years. Master in-process data sharing techniques using the C Data Interface, C Device Interface, C Stream Interface, and Arrow PyCapsule Interface, with practical examples demonstrating how popular dataframe libraries like pandas and polars implement these exchange methods. Gain insights into the Inter Process Communication Protocol for sharing Arrow data between processes and learn to build custom network exchanges using the Arrow format with Flight RPC. Follow along with comprehensive Python examples that illustrate real-world applications of these data exchange patterns. Understand when and how to leverage pyarrow's capabilities to accelerate data transfer within and between your data applications, complete with expert tips and best practices from the presenters.
Syllabus
Sharing is caring: Efficient Data Exchange with pyarrow — Raúl Cumplido, Rok Mihevc, Alenka Frim
Taught by
EuroPython Conference