Completed
08:31 Challenges and Rewards in GRPO
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Understanding GRPO: Group Relative Policy Optimization in Reinforcement Learning
Automatically move to the next video in the Classroom when playback concludes
- 1 00:00 Introduction to Reinforcement Learning
- 2 00:30 Understanding Supervised Fine Tuning
- 3 01:30 Exploring ORPO: Odds Ratio Preference Optimization
- 4 06:57 Diving into GRPO: Group Relative Policy Optimization
- 5 08:31 Challenges and Rewards in GRPO
- 6 14:12 History and Evolution of Policy Optimization
- 7 19:30 Trust Region Policy Optimization TRPO and Proximal Policy Optimization PPO
- 8 22:26 Simplifying PPO with GRPO
- 9 29:34 Final Thoughts on GRPO and Reinforcement Learning