New Methods to Capture and Exploit Multiscale Speech Dynamics - 2007
Center for Language & Speech Processing(CLSP), JHU via YouTube
35% Off Finance Skills That Get You Hired - Code CFI35
Earn Your CS Degree, Tuition-Free, 100% Online!
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore new methods for capturing and exploiting multiscale speech dynamics in this 2007 lecture by Patrick Wolfe from Harvard University. Delve into the variability of speech waveforms and the powerful temporal and spectral dynamics that evolve across multiple scales. Learn about advancements in formant estimation using a statistical model-based tracking approach, including a censored likelihood formulation and vector autoregression to model formant cross-correlation. Discover a novel adaptive short-time Fourier analysis-synthesis scheme for speech enhancement, featuring a modified overlap-add procedure for efficient resynthesis. Examine the potential improvements these methods offer over traditional fixed-resolution enhancement systems, supported by measurements and listening tests. Gain insights from Wolfe's extensive background in electrical engineering, statistics, and audio signal processing, and understand the applications of these techniques in high-dimensional data analysis for speech waveforms and color images.
Syllabus
New Methods to Capture and Exploit Multiscale Speech Dynamics – Patrick Wolfe (Harvard) - 2007
Taught by
Center for Language & Speech Processing(CLSP), JHU