Free courses from frontend to fullstack and AI
AI Engineer - Learn how to integrate AI into software applications
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore groundbreaking AI research that reveals how reinforcement learning fundamentally transforms large language models through a counterintuitive geometric optimization process. Discover why RLVR (Reinforcement Learning from Human Feedback) achieves dramatic reasoning improvements despite appearing to make only sparse parameter updates, resolving a major paradox in modern AI training. Learn about the revolutionary "Three-Gate Theory" that demonstrates how pre-trained models guide optimizers into low-curvature, off-principal subspaces rather than overwriting critical parameters. Understand the geometric principles underlying how advanced AI models truly learn and why traditional parameter-efficient fine-tuning methods often fail in reinforcement learning contexts. Examine research from Meta AI and the University of Texas at Austin that fundamentally challenges conventional understanding of neural network optimization and provides new insights into the physics of machine learning at scale.
Syllabus
AI Learns in Low-Curvature Subspaces (RLVR)
Taught by
Discover AI