Optical Interconnects for Large-Scale AI Clusters - A Meta Perspective
Open Compute Project via YouTube
AI, Data Science & Cloud Certificates from Google, IBM & Meta
Get 20% off all career paths from fullstack to AI
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a technical presentation from Meta's Optical Engineer Andrew Alduino examining the critical role of optical interconnects in scaling AI infrastructure. Dive into the challenges of building large-scale GPU clusters like Meta's 24k GPU systems for LLama 3 training, focusing on how increasing AI workload demands impact IO requirements and accelerator package design. Learn about the limitations of electrical signaling solutions and discover how integrated optics offer promising alternatives with superior bandwidth capabilities. Understand the complex interplay between GPU hardware, system IO, rack design, power delivery, cooling technologies, and memory architectures in modern AI cluster development, with particular emphasis on optical interconnect optimization for next-generation AI infrastructure.
Syllabus
Optics in AI Clusters - Meta Perspective
Taught by
Open Compute Project