Transcription Models Zero to Hero - Data Prep, Train and Serve

Transcription Models Zero to Hero - Data Prep, Train and Serve

Trelis Research via YouTube Direct link

Model training progress and high grad norm observation

11 of 18

11 of 18

Model training progress and high grad norm observation

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Transcription Models Zero to Hero - Data Prep, Train and Serve

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Introduction and overview of pipeline
  2. 2 Data requirements: audio recordings and transcripts
  3. 3 Uploading audio-text pairs and dataset preparation
  4. 4 Saving word swaps and transcribing with model for better training
  5. 5 Warning about clean text being out-of-distribution for small models
  6. 6 Setting up Hugging Face token and Weights & Biases key
  7. 7 Creating validation set using ChatGPT to rephrase text
  8. 8 Configuring training settings and advanced parameters
  9. 9 Baseline evaluation shows 7.09% word error rate
  10. 10 Training begins with falling loss and word error rate
  11. 11 Model training progress and high grad norm observation
  12. 12 Model and logs pushed to Hugging Face Hub
  13. 13 Inspecting evaluation results and specific corrections
  14. 14 Spelling improvements and regressions in fine-tuned model
  15. 15 Deploying model to endpoint with keep warm feature
  16. 16 Auto-sleep containers and API key access options
  17. 17 Testing endpoint and transcript download formats
  18. 18 Evaluation tab features and future text-to-speech plans

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.