Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Audio Signal Processing for Machine Learning

Valerio Velardo - The Sound of AI via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Master key audio signal processing concepts essential for machine learning applications through this comprehensive 10-hour course. Learn to process raw audio data and extract meaningful features to power audio-driven AI systems. Begin with fundamental concepts including sound and waveforms, intensity, loudness, and timbre before diving into audio signal analysis for machine learning contexts. Explore various types of audio features and discover practical extraction techniques using Python. Develop proficiency in time domain audio features by implementing amplitude envelope extraction from scratch and learning to extract root-mean square energy and zero-crossing rate from audio signals. Gain deep understanding of the Fourier Transform through intuitive explanations and complex number applications in audio signal processing. Master the Discrete Fourier Transform and implement Fourier Transform extraction using Python libraries. Progress to advanced topics including Short-Time Fourier Transform and spectrogram extraction from audio data. Delve into Mel spectrograms and their practical implementation, followed by comprehensive coverage of Mel-Frequency Cepstral Coefficients (MFCCs) with hands-on Python extraction techniques. Conclude with frequency-domain audio features, implementing Band Energy Ratio from scratch and extracting spectral centroid and bandwidth using Python and Librosa library.

Syllabus

Audio Signal Processing for Machine Learning
Sound and Waveforms
Intensity, Loudness, and Timbre
Understanding Audio Signals for Machine Learning
Types of Audio Features for Machine Learning
How to Extract Audio Features
Understanding Time Domain Audio Features
Extracting the amplitude envelope feature from scratch in Python
How to Extract Root-Mean Square Energy and Zero-Crossing Rate from Audio
Demystifying the Fourier Transform: The Intuition
Complex Numbers for Audio Signal Processing
Defining the Fourier Transform with Complex Numbers
Discrete Fourier Transform Explained Easily
How to Extract the Fourier Transform with Python
Short-Time Fourier Transform Explained Easily
How to Extract Spectrograms from Audio with Python
Mel Spectrograms Explained Easily
Extracting Mel Spectrograms with Python
Mel-Frequency Cepstral Coefficients Explained Easily
Extracting Mel-Frequency Cepstral Coefficients with Python
Frequency-Domain Audio Features
Implementing Band Energy Ratio in Python from Scratch
Extracting Spectral Centroid and Bandwidth with Python and Librosa

Taught by

Valerio Velardo - The Sound of AI

Reviews

Start your review of Audio Signal Processing for Machine Learning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.