Proximal Policy Optimization (PPO) and Group Relative Policy Optimization (GRPO) - Math Explained

Proximal Policy Optimization (PPO) and Group Relative Policy Optimization (GRPO) - Math Explained

Outlier via YouTube Direct link

21:15 GRPO

10 of 12

10 of 12

21:15 GRPO

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Proximal Policy Optimization (PPO) and Group Relative Policy Optimization (GRPO) - Math Explained

Automatically move to the next video in the Classroom when playback concludes

  1. 1 00:00 Introduction
  2. 2 01:17 Problem Statement
  3. 3 03:17 Intuitive Objective
  4. 4 04:07 Analytically Computable Objective
  5. 5 10:11 Return Function
  6. 6 12:07 Value Function
  7. 7 14:53 Importance Sampling
  8. 8 17:40 TRPO
  9. 9 19:16 PPO
  10. 10 21:15 GRPO
  11. 11 23:45 Summary
  12. 12 24:31 Outro

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.