Learn AI, Data Science & Business — Earn Certificates That Get You Hired
PowerBI Data Analyst - Create visualizations and dashboards from scratch
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
This video explores two groundbreaking AI research studies focusing on scaling reinforcement learning through structured cognitive behaviors and extended chain-of-thought reasoning. Dive into how researchers from Stanford University identified four key cognitive behaviors that enable self-improving reasoners, while teams from IN.AI, Tsinghua University, and Carnegie Mellon University work to demystify long chain-of-thought reasoning in Large Language Models. Learn how these complementary approaches create a comprehensive roadmap for developing AI systems that not only solve complex problems but can also explain their reasoning processes in both scientifically precise and intuitively accessible ways. The 34-minute presentation examines how a 3B parameter AI model can be enhanced through these techniques, offering valuable insights for anyone interested in the latest advancements in AI reasoning capabilities and reinforcement learning scaling methods.
Syllabus
Scaling RL: 3B AI w Long Chain-of-Thought & 4 Patterns
Taught by
Discover AI