Group Relative Policy Optimization (GRPO) - Formula and Implementation Tutorial

Group Relative Policy Optimization (GRPO) - Formula and Implementation Tutorial

Yacine Mahdid via YouTube Direct link

- GRPO Trainer code: 13:21

6 of 7

6 of 7

- GRPO Trainer code: 13:21

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Group Relative Policy Optimization (GRPO) - Formula and Implementation Tutorial

Automatically move to the next video in the Classroom when playback concludes

  1. 1 - Introduction: 0:00
  2. 2 - PPO vs GRPO: 1:18
  3. 3 - PPO formula overview: 4:24
  4. 4 - GRPO formula overview: 7:49
  5. 5 - GRPO pseudo code: 11:11
  6. 6 - GRPO Trainer code: 13:21
  7. 7 - Conclusion: 23:48

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.