Latent Space Paper Club - AIEWF Special Edition - Test of Time, DeepSeek R1/V3
AI Engineer via YouTube
Overview
Syllabus
00:00:00 Paper Club Year in Review & Future Plans
00:08:00 DeepSeek Paper Discussion
00:09:10 DeepSeek R1 May 28th Update
00:12:40 DeepSeek Distillation
00:16:51 Original DeepSeek Model Overview DeepSeek V3 and R1
00:21:15 Development of reasoning capabilities through a pure RL process
00:24:46 DeepSeek R10
00:39:05 DeepSeek R1 four-stage training pipeline
00:35:01 Emergence of "reflection moments" and "aha moments"
00:44:15 Distillation Strategy
00:52:34 Community and Call to Action
Taught by
AI Engineer