Segmentation Indexing and Retrieval of Multilingual Sources
Center for Language & Speech Processing(CLSP), JHU via YouTube
Most AI Pilots Fail to Scale. MIT Sloan Teaches You Why — and How to Fix It
Learn Python with Generative AI - Self Paced Online
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn advanced techniques for processing multilingual text data through this comprehensive lecture that explores segmentation, indexing, and retrieval methods for sources in multiple languages. Discover how to effectively break down multilingual documents into meaningful segments, create efficient indexing systems that can handle diverse linguistic structures, and implement retrieval algorithms that work across language boundaries. Examine the unique challenges posed by different writing systems, grammatical structures, and linguistic features when building cross-lingual information systems. Explore practical approaches to handling code-switching, mixed-language documents, and varying text directionality in multilingual corpora. Gain insights into evaluation metrics and methodologies for assessing the performance of multilingual segmentation and retrieval systems. Understand the computational and linguistic considerations involved in scaling these techniques to large, diverse multilingual datasets and learn about current research directions in cross-lingual natural language processing and information retrieval.
Syllabus
David Palmer: Segmentation Indexing and Retrieval of Multilingual Sources
Taught by
Center for Language & Speech Processing(CLSP), JHU