Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Latent Space Paper Club - AIEWF Special Edition - Test of Time, DeepSeek R1/V3

AI Engineer via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore DeepSeek's groundbreaking AI models in this 54-minute conference talk recorded at the AI Engineer World's Fair in San Francisco. Dive deep into the technical details of DeepSeek R1 and V3 models, examining their development through pure reinforcement learning processes and innovative four-stage training pipelines. Learn about the emergence of "reflection moments" and "aha moments" in AI reasoning capabilities, and understand the distillation strategies used to optimize model performance. The session begins with a comprehensive year-in-review of the Paper Club and future plans, then transitions into detailed discussions of DeepSeek's architecture, training methodologies, and the evolution from the original DeepSeek models to the advanced R1 and R10 versions. Gain insights into how these models achieve sophisticated reasoning through reinforcement learning, the technical implementation of distillation processes, and the practical implications for AI development. The talk concludes with community engagement opportunities and actionable next steps for practitioners interested in implementing similar approaches in their own AI projects.

Syllabus

00:00:00 Paper Club Year in Review & Future Plans
00:08:00 DeepSeek Paper Discussion
00:09:10 DeepSeek R1 May 28th Update
00:12:40 DeepSeek Distillation
00:16:51 Original DeepSeek Model Overview DeepSeek V3 and R1
00:21:15 Development of reasoning capabilities through a pure RL process
00:24:46 DeepSeek R10
00:39:05 DeepSeek R1 four-stage training pipeline
00:35:01 Emergence of "reflection moments" and "aha moments"
00:44:15 Distillation Strategy
00:52:34 Community and Call to Action

Taught by

AI Engineer

Reviews

Start your review of Latent Space Paper Club - AIEWF Special Edition - Test of Time, DeepSeek R1/V3

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.