Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Explore entity resolution techniques for large-scale data cleaning in this 23-minute conference talk from YOW! 2019. Discover how Software Engineer Huon Wilson from CSIRO's Data61 approaches the challenge of connecting duplicate and corrupted records to their single underlying entity. Learn about the solutions and lessons gained from implementing entity resolution on Apache Spark to process billions of records. Gain insights into overcoming real-world data challenges, scaling data cleaning processes, and improving data quality for more effective analysis and decision-making.
Syllabus
Entity Resolution at Scale • Huon Wilson • YOW! 2019
Taught by
GOTO Conferences