Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore a comprehensive conference talk that tackles the complex challenge of matching GPS locations to roads and local government areas using large-scale geospatial datasets. Learn about the multi-step approach that leverages H3 indexing to isolate matches with single candidates, followed by advanced geospatial computational techniques for accurately matching points with multiple candidates. Discover how broadcasting and partitioning techniques impact autoscaling, memory usage, and effective data parallelization in distributed computing environments. Gain insights into real-world challenges of large-scale data engineering, Spark performance optimization strategies, and practical solutions for processing massive geospatial datasets efficiently. The presentation covers technical implementation details, scalability considerations, and performance optimization techniques essential for anyone working with geospatial data processing at scale.
Syllabus
Highways and Hexagons: Processing Large Geospatial Datasets With H3
Taught by
Databricks