Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn advanced techniques for combining multiple data collections and sources in natural language processing through this comprehensive lecture from Johns Hopkins University's Center for Language and Speech Processing. Explore methodologies for integrating diverse datasets, understand the theoretical foundations of collection fusion approaches, and discover practical applications in computational linguistics and speech processing. Examine strategies for handling heterogeneous data sources, addressing inconsistencies across collections, and optimizing fusion algorithms for improved performance in language processing tasks. Gain insights into the challenges and solutions associated with merging linguistic corpora, speech databases, and other language resources while maintaining data quality and coherence.
Syllabus
David Yarowski: Collection Fusion
Taught by
Center for Language & Speech Processing(CLSP), JHU