Best Practices for Building Apache Iceberg Based Lakehouse Architectures on AWS
AWS Events via YouTube
Pass the PMP® Exam on Your First Try — Expert-Led Training
Python, Prompt Engineering, Data Science — Build the Skills Employers Want Now
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore advanced strategies for implementing Apache Iceberg-based lakehouse architectures on AWS in this comprehensive 56-minute conference talk from AWS re:Invent 2025. Discover how to leverage Amazon S3 Tables and integrate Iceberg Rest Catalog with lakehouse solutions in Amazon SageMaker for optimal data management. Learn performance optimization techniques specifically designed for Amazon Athena and Amazon Redshift queries, while mastering real-time data processing capabilities using Apache Spark. Gain practical knowledge on seamlessly integrating Apache Iceberg with Amazon EMR, AWS Glue, and Trino to create robust data processing pipelines. Delve into hands-on implementations of zero-ETL approaches, change data capture (CDC) patterns, and medallion architecture principles that form the foundation of modern data lakes. Develop enterprise-grade expertise in building scalable, high-performance lakehouse solutions that harness the full potential of Apache Iceberg on the AWS cloud platform, enabling you to architect data solutions that meet demanding business requirements while maintaining optimal performance and cost efficiency.
Syllabus
AWS re:Invent 2025 - Best practices for building Apache Iceberg based lakehouse architectures on AWS
Taught by
AWS Events