Unlocking Real-Time Insights - Apache Doris in Stream and Lakehouse Integration
StreamNative via YouTube
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore how Apache Doris enables real-time data warehousing within lakehouse architectures in this 31-minute conference talk. Discover the technical approach behind Apache Doris's lakehouse architecture and its high-performance online analytical services for both self-managed table formats and leading open lake formats including Iceberg, Hudi, and Delta Lake. Learn to construct a real-time lakehouse system using Ursa Engine, AWS S3 Tables, and Apache Doris for rapid data exploration and informed decision-making. Understand how this C++ implementation delivers superior performance compared to alternatives like Trino, offering 3-5x faster querying for lake data. Examine the hybrid model that combines data warehouse governance and performance with data lake scalability and open formats. Gain insights into deployment flexibility across Kubernetes, bare metal, and commercial solutions, while exploring integration capabilities for streaming analytics, operational intelligence, and online feature stores. The session covers practical implementation strategies for bridging traditional data warehouses with modern lakehouse requirements, emphasizing low-latency querying and real-time insights through efficient data transformation and caching techniques.
Syllabus
Unlocking Real-Time Insights: Apache Doris in Stream and Lakehouse Integration
Taught by
StreamNative