Exploring Geometric Representational Disparities Between Multilingual and Bilingual Translation Models
Center for Language & Speech Processing(CLSP), JHU via YouTube
Learn Generative AI, Prompt Engineering, and LLMs for Free
Learn AI, Data Science & Business — Earn Certificates That Get You Hired
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Dive into a 40-minute conference talk presented by Neha Verma, a 3rd year PhD student at the Center for Language & Speech Processing (CLSP) at Johns Hopkins University. Explore the geometric differences in representations between bilingual and one-to-many multilingual translation models. Examine how these models utilize dimensions in their underlying vector space through measures of isotropy, including intrinsic dimensionality and IsoScore. Discover the findings that multilingual model decoder representations are consistently less isotropic than their bilingual counterparts for a given language pair. Investigate how this anisotropy in multilingual decoder representations is largely attributed to modeling language-specific information, potentially limiting remaining representational capacity. Gain insights into the implications of these disparities for parameter efficiency and overall performance in multilingual machine translation, particularly in one-to-many translation settings.
Syllabus
Exploring Geometric Representational Disparities Between Multilingual & Bilingual Translation Models
Taught by
Center for Language & Speech Processing(CLSP), JHU