Applications of Weighted Finite State Transducers in a Speech Recognition Toolkit
Center for Language & Speech Processing(CLSP), JHU via YouTube
Live Online Classes in Design, Coding & AI — Small Classes, Free Retakes
Learn AI, Data Science & Business — Earn Certificates That Get You Hired
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Explore the practical implementation of weighted finite state transducers (WFSTs) in speech recognition systems through this comprehensive lecture by Daniel Povey from Microsoft Research. Learn how WFSTs serve as fundamental mathematical frameworks for modeling various components of automatic speech recognition, including pronunciation dictionaries, language models, and acoustic models. Discover the theoretical foundations of finite state automata and transducers, understand their weighted variants, and examine how these mathematical structures enable efficient composition and optimization of speech recognition pipelines. Gain insights into the Kaldi speech recognition toolkit's architecture and see how WFSTs facilitate modular design, allowing researchers and developers to experiment with different model components independently. Understand the computational advantages of using WFSTs for decoding algorithms, including their role in creating compact and efficient search graphs that combine multiple knowledge sources. Examine practical examples of WFST construction, composition operations, and optimization techniques that improve both accuracy and computational efficiency in real-world speech recognition applications.
Syllabus
Daniel Povey: Applications of weighted finite state transducers in a speech recognition toolkit
Taught by
Center for Language & Speech Processing(CLSP), JHU