From Pulsar to Lakehouse - Building a Unified Streaming Storage Engine with Ursa
StreamNative via YouTube
Free courses from frontend to fullstack and AI
Power BI Fundamentals - Create visualizations and dashboards from scratch
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to build a unified streaming storage engine that bridges real-time data ingestion with analytical storage using Ursa, a high-performance streaming engine built on Apache Pulsar. Discover Ursa's architecture that combines Pulsar's scalability with lakehouse storage durability and structure, enabling seamless writes from event streams directly into analytical storage systems. Explore two integration modes: Managed Tables for lightweight, internally managed storage and External Tables for writing to external lakehouses with open catalog registration. Master support for Databricks Unity Catalog, Snowflake's Open Catalog, and Open S3Table format while handling schema evolution, efficient file writing with Parquet optimization, and table registration for downstream analytics. Gain real-world insights into building cloud-native, unified lakehouse pipelines that transform streaming data into queryable tables in real time, eliminating the need for complex integration layers between streaming and analytical systems.
Syllabus
[Use Case] From Pulsar to Lakehouse: Building a Unified Streaming Storage Engine with Ursa
Taught by
StreamNative