Transcribing Speech for Language Processing
Center for Language & Speech Processing(CLSP), JHU via YouTube
PowerBI Data Analyst - Create visualizations and dashboards from scratch
Python, Prompt Engineering, Data Science — Build the Skills Employers Want Now
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore the fundamental challenges and methodologies of speech transcription for natural language processing applications in this lecture by Mari Ostendorf from the University of Washington. Learn about the technical approaches used to convert spoken language into text that can be effectively processed by computational systems. Discover the complexities involved in automatic speech recognition, including handling variations in pronunciation, accent, speaking rate, and background noise. Examine the relationship between speech transcription accuracy and downstream language processing tasks such as information extraction, machine translation, and dialogue systems. Understand the trade-offs between transcription speed, accuracy, and computational resources in real-world applications. Gain insights into the state-of-the-art techniques available in 2007 for speech-to-text conversion and their limitations. Analyze how transcription errors propagate through language processing pipelines and strategies for mitigating their impact. Consider the evaluation metrics used to assess transcription quality and their relevance to different application domains.
Syllabus
Mari Ostendorf: Transcribing Speech for Language Processing
Taught by
Center for Language & Speech Processing(CLSP), JHU