Broadening Statistical Machine Translation with Comparable Corpora and Generalized Models
Center for Language & Speech Processing(CLSP), JHU via YouTube
Free courses from frontend to fullstack and AI
PowerBI Data Analyst - Create visualizations and dashboards from scratch
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore advanced techniques for expanding statistical machine translation capabilities through the use of comparable corpora and generalized methodologies in this comprehensive lecture. Learn how to leverage non-parallel text collections to improve translation quality and coverage, moving beyond traditional parallel corpus limitations. Discover innovative approaches to extracting translation knowledge from comparable documents that share similar topics or domains but are not direct translations of each other. Examine generalized mathematical frameworks that enable more robust and flexible translation models. Understand the theoretical foundations and practical applications of these advanced SMT techniques, including methods for identifying and exploiting cross-lingual similarities in comparable texts. Gain insights into how these approaches can significantly broaden the scope and effectiveness of machine translation systems, particularly for language pairs with limited parallel training data.
Syllabus
Chris Quirk: Broadening statistical machine translation with comparable corpora and generalized m...
Taught by
Center for Language & Speech Processing(CLSP), JHU