Text-to-Speech and Voice Cloning Course - How Humans Speak

Text-to-Speech and Voice Cloning Course - How Humans Speak

Valerio Velardo - The Sound of AI via YouTube Direct link

0:00 Intro

1 of 17

1 of 17

0:00 Intro

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Text-to-Speech and Voice Cloning Course - How Humans Speak

Automatically move to the next video in the Classroom when playback concludes

  1. 1 0:00 Intro
  2. 2 1:11 Human vs machine speech pipeline
  3. 3 3:32 Language
  4. 4 5:31 Phonemes
  5. 5 8:30 international Phonetic Alphabet
  6. 6 13:14 English phonetic chart
  7. 7 14:55 Phonetic transcription
  8. 8 16:20 Coarticulation
  9. 9 18:53 Prosody
  10. 10 21:34 Timbre
  11. 11 25:19 Source-fliter model of speech production
  12. 12 30:12 Glottal sound
  13. 13 33:01 More source-filter model
  14. 14 34:42 Formants
  15. 15 40:22 Emotion and expressivity
  16. 16 42:22 Speech is multilayered
  17. 17 44:35 Why is speech hard for machines?

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.