From Pulsar to Lakehouse - Building a Unified Streaming Storage Engine with Ursa
StreamNative via YouTube
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn how to build a unified streaming storage engine that bridges real-time data ingestion with analytical storage using Ursa, a high-performance streaming engine built on Apache Pulsar. Discover Ursa's architecture that combines Pulsar's scalability with lakehouse storage durability and structure, enabling seamless writes from event streams directly into analytical storage systems. Explore two integration modes: Managed Tables for lightweight, internally managed storage and External Tables for writing to external lakehouses with open catalog registration. Master support for Databricks Unity Catalog, Snowflake's Open Catalog, and Open S3Table format while handling schema evolution, efficient file writing with Parquet optimization, and table registration for downstream analytics. Gain real-world insights into building cloud-native, unified lakehouse pipelines that transform streaming data into queryable tables in real time, eliminating the need for complex integration layers between streaming and analytical systems.
Syllabus
[Use Case] From Pulsar to Lakehouse: Building a Unified Streaming Storage Engine with Ursa
Taught by
StreamNative