Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Neural Target Speech and Sound Extraction

Center for Language & Speech Processing(CLSP), JHU via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the fascinating world of neural target speech and sound extraction in this comprehensive plenary lecture delivered at JSALT 2025. Delve into the cocktail party effect and selective hearing phenomenon that allows humans to isolate desired sounds from complex acoustic environments, such as focusing on a conversation in a noisy café or identifying specific instruments in musical compositions. Learn about target speech/sound extraction (TSE) techniques that use neural networks to isolate target speakers or sounds from audio mixtures using various identifying clues including spatial information, visual cues from video, or enrollment audio samples. Discover the foundational principles behind TSE technology and examine cutting-edge research developments in neural-based approaches for both speech and arbitrary sound extraction. Gain insights from Distinguished Researcher Marc Delcroix of NTT Communication Science Laboratories, whose expertise spans speech enhancement, robust speech recognition, model adaptation, and speaker diarization, and who has contributed significantly to major challenges and conferences in the field including CHiME, REVERB, ASRU, and SLT.

Syllabus

[camera] JSALT 2025 - Plenary Talk - Marc Delcroix: Neural Target Speech and Sound Extraction

Taught by

Center for Language & Speech Processing(CLSP), JHU

Reviews

Start your review of Neural Target Speech and Sound Extraction

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.