Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

The Unreasonable Effectiveness of Mathematics in Large Scale Deep Learning - From Theory to Practice

Centre for Networked Intelligence, IISc via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the groundbreaking mathematical theory behind infinite-width neural networks in this research lecture from Microsoft Research's Greg Yang, presented at the Centre for Networked Intelligence, IISc. Discover muTransfer, a revolutionary technology enabling efficient tuning of massive neural networks like GPT-3's 6.7 billion parameter model using only 7% of its pretraining compute resources. Learn about the Optimal Scaling Thesis and its crucial connection between infinite-size limits and practical large model design. Delve into key mathematical concepts including geometrical intuition, infinite slopes, pure mathematical arguments, and dynamical economy theorems that shape the future of AI development. Yang, a Harvard graduate and winner of prestigious mathematics awards including the Hoopes prize, presents compelling research questions whose answers could revolutionize artificial intelligence development.

Syllabus

Intro
Results
Questions
Support sample
Conclusion
Table
Prioritization
Geometrical Intuition
Infinite Slopes
Pure mathematic argument
dynamical economy theorem
sensible argument
summer trends

Taught by

Centre for Networked Intelligence, IISc

Reviews

Start your review of The Unreasonable Effectiveness of Mathematics in Large Scale Deep Learning - From Theory to Practice

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.