Get 20% off all career paths from fullstack to AI
Build with Azure OpenAI, Copilot Studio & Agentic Frameworks — Microsoft Certified
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Explore a 30-minute talk by Databricks experts Matthew Powers and Xinrong Meng on Pandas API on Spark, a powerful solution that combines the simplicity of pandas with the scalability of Apache Spark. Learn how this tool addresses the limitations of traditional pandas by enabling distributed data processing for large datasets. Discover how to get started with Pandas on Spark and adapt existing pandas code to handle massive data volumes efficiently. Gain insights into leveraging SQL and machine learning capabilities for enhanced data analysis and processing. Perfect for data scientists and analysts looking to scale their Python-based data workflows without sacrificing the familiar pandas interface.
Syllabus
Pandas on Spark: Simplicity of Pandas with Efficiency of Spark
Taught by
Databricks