Learn AI, Data Science & Business — Earn Certificates That Get You Hired
Power BI Fundamentals - Create visualizations and dashboards from scratch
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore the complexities of geospatial data analysis in Apache Spark through this informative 36-minute talk. Delve into the challenges of working with geospatial data in Spark, including projections, geometry types, indices, and storage issues. Learn about various geospatial packages available for Spark, their pros and cons, and best practices for data ingestion and long-term storage. Gain insights into spatial indexing for rapid record retrieval and discover approaches to limit errors and reduce costs when handling large-scale geospatial data. Follow along with a demonstration covering Geo JSON files, spatial disaggregation, and practical code examples to enhance your understanding of geospatial options in Apache Spark.
Syllabus
Introduction
About PNNL
Disclaimer
Challenges
Projections
Index
Finding and curating data
System libraries
Largescale joins
Demo
Steps
Geo JSON Files
Spatial De disaggregation
Code Demo
Conclusion
Taught by
Databricks