DeepSeek R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

DeepSeek R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

AI Bites via YouTube Direct link

0:00 - Intro

1 of 8

1 of 8

0:00 - Intro

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

DeepSeek R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Automatically move to the next video in the Classroom when playback concludes

  1. 1 0:00 - Intro
  2. 2 2:38 - Training LLMs
  3. 3 5:05 - DeepSeek R1 Zero Training
  4. 4 5:54 - Group Relative Policy Optimization
  5. 5 8:45 - Reward Modelling
  6. 6 10:21 - Training Performance
  7. 7 11:33 - Self-evolution
  8. 8 17:20 - Results

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.