Overview
Syllabus
00:00 Introduction to Tracking Loss in Model Training
00:50 Understanding Custom Metrics
01:22 Teacher Forced vs. Auto Regressive Decoding
01:32 Training vs. Inference Libraries
01:43 Options for Computing Custom Metrics
02:00 Challenges with Custom Metrics
03:44 Teacher Forced Decoding Explained
05:55 Auto Regressive Decoding Explained
08:19 Combining Training and Inference
09:22 Implementing Custom Metrics in Training
09:31 GRPO and Custom Metrics
12:11 Memory Management in Custom Metrics
15:44 Practical Demonstration: Setting Up the Environment
18:05 Loading and Training the Model
26:14 Setting Up the Training and Evaluation Dataset
26:31 Formatting the Data for Training
27:35 Custom Optimizer Setup
28:03 Loading and Printing Trainable Parameters
29:30 Fine-Tuning the Model
30:03 Custom Prediction Step and Metrics
31:42 Auto Aggressive Decoding and Evaluation
46:40 Handling Memory and GPU Utilization
49:55 Conclusion and Resources
Taught by
Trelis Research