Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn how to accelerate Apache Spark jobs using Apache Gluten at enterprise scale through this 43-minute conference talk from the OLAP & Data Analysis Track. Discover the implementation strategies and performance optimizations that ByteDance employs to enhance Spark job execution using Apache Gluten, a native vectorized execution engine. Explore the technical architecture, integration challenges, and real-world performance improvements achieved when deploying Gluten in production environments. Gain insights into the specific use cases where Gluten provides the most significant performance benefits and understand the operational considerations for implementing this technology at scale. Examine benchmark results, optimization techniques, and best practices for leveraging native execution engines to improve analytical workloads in large-scale data processing environments.
Syllabus
Accelerating Spark jobs with Apache Gluten at ByteDance Scale
Taught by
The ASF