The Unreasonable Effectiveness of Mathematics in Large Scale Deep Learning - From Theory to Practice
Centre for Networked Intelligence, IISc via YouTube
AI Adoption - Drive Business Value and Organizational Impact
Free courses from frontend to fullstack and AI
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the groundbreaking mathematical theory behind infinite-width neural networks in this research lecture from Microsoft Research's Greg Yang, presented at the Centre for Networked Intelligence, IISc. Discover muTransfer, a revolutionary technology enabling efficient tuning of massive neural networks like GPT-3's 6.7 billion parameter model using only 7% of its pretraining compute resources. Learn about the Optimal Scaling Thesis and its crucial connection between infinite-size limits and practical large model design. Delve into key mathematical concepts including geometrical intuition, infinite slopes, pure mathematical arguments, and dynamical economy theorems that shape the future of AI development. Yang, a Harvard graduate and winner of prestigious mathematics awards including the Hoopes prize, presents compelling research questions whose answers could revolutionize artificial intelligence development.
Syllabus
Intro
Results
Questions
Support sample
Conclusion
Table
Prioritization
Geometrical Intuition
Infinite Slopes
Pure mathematic argument
dynamical economy theorem
sensible argument
summer trends
Taught by
Centre for Networked Intelligence, IISc