Google AI Professional Certificate - Learn AI Skills That Get You Hired
The Most Addictive Python and SQL Courses
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore groundbreaking AI research that reveals how reinforcement learning fundamentally transforms large language models through a counterintuitive geometric optimization process. Discover why RLVR (Reinforcement Learning from Human Feedback) achieves dramatic reasoning improvements despite appearing to make only sparse parameter updates, resolving a major paradox in modern AI training. Learn about the revolutionary "Three-Gate Theory" that demonstrates how pre-trained models guide optimizers into low-curvature, off-principal subspaces rather than overwriting critical parameters. Understand the geometric principles underlying how advanced AI models truly learn and why traditional parameter-efficient fine-tuning methods often fail in reinforcement learning contexts. Examine research from Meta AI and the University of Texas at Austin that fundamentally challenges conventional understanding of neural network optimization and provides new insights into the physics of machine learning at scale.
Syllabus
AI Learns in Low-Curvature Subspaces (RLVR)
Taught by
Discover AI