Exploring the High-dimensional Random Landscapes of Data Science - 3/3
Institut des Hautes Etudes Scientifiques (IHES) via YouTube
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the complex optimization challenges in machine learning and data science through this comprehensive lecture examining high-dimensional random landscapes. Delve into the topological complexity of random functions in very high dimensions and analyze how standard optimization algorithms like Stochastic Gradient Descent (SGD) perform in these challenging contexts. Begin with an accessible introduction to the framework covering typical machine learning tasks and neural network structures before progressing to classical SGD applications in finite dimensions. Examine the Tensor PCA model as a key example, understanding its connection to spherical spin glasses and surveying its topological properties in single spike estimation tasks. Advance to single index models and discover the concept of "effective dynamics" and "summary statistics" that operate in reduced dimensions to govern algorithm performance. Learn how systems identify these summary statistics through dynamical spectral transitions, where Gram matrices and Hessian matrices develop outliers along optimization trajectories. Master essential Random Matrix Theory tools including edge spectrum behavior and the BBP transition within broader mathematical contexts. Apply these concepts to practical machine learning examples including multilayer neural networks for Gaussian mixture classification, XOR problems, and multi-spike Tensor PCA models, drawing from cutting-edge research in optimization theory and statistical physics.
Syllabus
Gérard Ben Arous - 3/3 Exploring the High-dimensional Random Landscapes of Data Science
Taught by
Institut des Hautes Etudes Scientifiques (IHES)