From Pulsar to Lakehouse - Building a Unified Streaming Storage Engine with Ursa
StreamNative via YouTube
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn how to build a unified streaming storage engine that bridges real-time data ingestion with analytical storage using Ursa, a high-performance streaming engine built on Apache Pulsar. Discover Ursa's architecture that combines Pulsar's scalability with lakehouse storage durability and structure, enabling seamless writes from event streams directly into analytical storage systems. Explore two integration modes: Managed Tables for lightweight, internally managed storage and External Tables for writing to external lakehouses with open catalog registration. Master support for Databricks Unity Catalog, Snowflake's Open Catalog, and Open S3Table format while handling schema evolution, efficient file writing with Parquet optimization, and table registration for downstream analytics. Gain real-world insights into building cloud-native, unified lakehouse pipelines that transform streaming data into queryable tables in real time, eliminating the need for complex integration layers between streaming and analytical systems.
Syllabus
[Use Case] From Pulsar to Lakehouse: Building a Unified Streaming Storage Engine with Ursa
Taught by
StreamNative