Understanding Reasoning LLMs - o1, DeepSeek-R1, Gemini Thinking, Grok 3, Claude 3.7

Understanding Reasoning LLMs - o1, DeepSeek-R1, Gemini Thinking, Grok 3, Claude 3.7

Donato Capitella via YouTube Direct link

21:55 - Limitations and challenges of reasoning LLMs

10 of 10

10 of 10

21:55 - Limitations and challenges of reasoning LLMs

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Understanding Reasoning LLMs - o1, DeepSeek-R1, Gemini Thinking, Grok 3, Claude 3.7

Automatically move to the next video in the Classroom when playback concludes

  1. 1 00:00 - Introduction
  2. 2 02:42 - What are reasoning models?
  3. 3 03:56 - The four approaches to building "reasoning" LLMs
  4. 4 04:31 - Inference-time scaling
  5. 5 06:46 - Standard LLM training pipeline
  6. 6 08:26 - Pure Reinforcement Learning DeepSeek R1-Zero
  7. 7 12:21 - Supervised Fine Tuning + Reinforcement Learning DeepSeek R1
  8. 8 17:20 - Summary of STF+RF approach DeepSeek R1
  9. 9 18:18 - Distillation
  10. 10 21:55 - Limitations and challenges of reasoning LLMs

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.