Broadening Statistical Machine Translation with Comparable Corpora and Generalized Models
Center for Language & Speech Processing(CLSP), JHU via YouTube
The Private Equity Associate Certification
Learn EDR Internals: Research & Development From The Masters
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Explore advanced techniques for expanding statistical machine translation capabilities through the use of comparable corpora and generalized methodologies in this comprehensive lecture. Learn how to leverage non-parallel text collections to improve translation quality and coverage, moving beyond traditional parallel corpus limitations. Discover innovative approaches to extracting translation knowledge from comparable documents that share similar topics or domains but are not direct translations of each other. Examine generalized mathematical frameworks that enable more robust and flexible translation models. Understand the theoretical foundations and practical applications of these advanced SMT techniques, including methods for identifying and exploiting cross-lingual similarities in comparable texts. Gain insights into how these approaches can significantly broaden the scope and effectiveness of machine translation systems, particularly for language pairs with limited parallel training data.
Syllabus
Chris Quirk: Broadening statistical machine translation with comparable corpora and generalized m...
Taught by
Center for Language & Speech Processing(CLSP), JHU