Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

The Future of DSv2 in Apache Spark

Databricks via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore Apache Spark's next-generation Catalog API in this 28-minute conference talk that examines how DSv2 is transforming data source development by shifting complexity to Spark itself while improving connector reliability. Learn about the fundamental design principles behind DSv2 and discover how it enables advanced functionality including catalog federation, MERGE operations, storage-partitioned joins, aggregate pushdown, and stored procedures. Gain insights into the current strengths and limitations of DSv2 implementation, along with its evolving roadmap for future development. Understand how this API evolution benefits both custom-built and off-the-shelf data sources, making it essential knowledge for Spark users and developers working with various data connectors. The session provides practical guidance on leveraging DSv2's capabilities while addressing real-world challenges in data source integration and optimization within the Apache Spark ecosystem.

Syllabus

The Future of DSv2 in Apache Sparkâ„¢

Taught by

Databricks

Reviews

Start your review of The Future of DSv2 in Apache Spark

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.