Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

What's New in Apache Spark 4.0?

Databricks via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Join this 41-minute conference talk for a comprehensive exploration of Apache Sparkâ„¢ 4.0's most significant enhancements and new features. Discover the latest SQL improvements including ANSI compliance by default, scripting capabilities, SQL pipe syntax, SQL UDF functionality, session variables, and view schema evolution. Learn about the new VARIANT data type and string collation features that expand Spark's data handling capabilities. Explore Python-specific enhancements such as the new Python data source and plotting API that streamline data science workflows. Examine streaming improvements including the state store data source, state store checkpoint v2, and arbitrary state v2 for more robust real-time processing. Understand Spark Connect enhancements featuring expanded API coverage, thin client functionality, and unified Scala interface for better connectivity options. Review infrastructure improvements including better error messaging, structured logging, and support for new Java and Scala versions. Gain insights from Daniel Tenedorio, Sr. Staff Software Engineer at Databricks, and Wenchen Fan, Senior Staff Software Engineer at Databricks, as they guide you through these innovations and demonstrate how to leverage Spark 4.0's capabilities for modern data and AI pipelines, making this session valuable for both experienced Spark users and newcomers to the ecosystem.

Syllabus

What’s New in Apache Spark™ 4.0?

Taught by

Databricks

Reviews

Start your review of What's New in Apache Spark 4.0?

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.