Petabyte-Scale On-Chain Insights - Real-Time Intelligence for the Next-Gen Financial Backbone
Databricks via YouTube
Start speaking a new language. It’s just 3 weeks away.
Gain a Splash of New Skills - Coursera+ Annual Nearly 45% Off
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Discover how to build a near real-time, multi-chain data lakehouse for anti-money laundering (AML) monitoring at petabyte scale in this 20-minute conference talk. Explore the complete end-to-end architecture that integrates cutting-edge open-source technologies and AI-driven analytics to seamlessly handle massive on-chain data volumes, complemented by off-chain intelligence to meet rigorous AML requirements. Learn about ChainStorage, an open-source solution originally developed by Coinbase that provides robust blockchain data ingestion and block-level serving, enhanced with Apache Spark and Arrow for high-throughput processing and efficient data serialization, backed by Delta Lake and Kafka. Understand how StarRocks delivers lightning-fast SQL analytics over vast datasets in the serving layer, and examine the implementation of machine learning and AI agents for continuous data curation and near real-time insights crucial for tackling on-chain AML challenges. Gain insights from CipherOwl Inc.'s practical experience in constructing this sophisticated financial intelligence infrastructure that serves as the backbone for next-generation financial monitoring systems.
Syllabus
Petabyte-Scale On-Chain Insights: Real-Time Intelligence for the Next-Gen Financial Backbone
Taught by
Databricks