Streaming Speech to Text Models - Kyutai vs Whisper

Streaming Speech to Text Models - Kyutai vs Whisper

Trelis Research via YouTube Direct link

0:00 Streaming Speech to Text Demo with Kyutai TTS

1 of 14

1 of 14

0:00 Streaming Speech to Text Demo with Kyutai TTS

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Streaming Speech to Text Models - Kyutai vs Whisper

Automatically move to the next video in the Classroom when playback concludes

  1. 1 0:00 Streaming Speech to Text Demo with Kyutai TTS
  2. 2 0:42 Demo en français
  3. 3 1:05 Video Overview
  4. 4 2:42 Resources & Repo
  5. 5 3:15 Running Kyutai TTS on your Mac
  6. 6 5:15 Run streaming TTS in a notebook
  7. 7 5:58 Word timestamping
  8. 8 8:52 Text and Audio Assisted Transcription
  9. 9 11:46 Fast STREAMING TTS server with Rust
  10. 10 15:27 Streaming vs Whisper TTS vs Voxtral
  11. 11 19:53 Theory of Timestamping
  12. 12 22:55 Whisper vs Kyutai TTS architectures
  13. 13 24:34 How Kyutai is trained with whisper timestamped data
  14. 14 25:50 Wrap up

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.