Applications of Weighted Finite State Transducers in a Speech Recognition Toolkit
Center for Language & Speech Processing(CLSP), JHU via YouTube
40% Off Career-Building Certificates
AI Engineer - Learn how to integrate AI into software applications
Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore the practical implementation of weighted finite state transducers (WFSTs) in speech recognition systems through this comprehensive lecture by Daniel Povey from Microsoft Research. Learn how WFSTs serve as fundamental mathematical frameworks for modeling various components of automatic speech recognition, including pronunciation dictionaries, language models, and acoustic models. Discover the theoretical foundations of finite state automata and transducers, understand their weighted variants, and examine how these mathematical structures enable efficient composition and optimization of speech recognition pipelines. Gain insights into the Kaldi speech recognition toolkit's architecture and see how WFSTs facilitate modular design, allowing researchers and developers to experiment with different model components independently. Understand the computational advantages of using WFSTs for decoding algorithms, including their role in creating compact and efficient search graphs that combine multiple knowledge sources. Examine practical examples of WFST construction, composition operations, and optimization techniques that improve both accuracy and computational efficiency in real-world speech recognition applications.
Syllabus
Daniel Povey: Applications of weighted finite state transducers in a speech recognition toolkit
Taught by
Center for Language & Speech Processing(CLSP), JHU