Machine Learning Infrastructure at Facebook Scale
MLOps World: Machine Learning in Production via YouTube
Python, Prompt Engineering, Data Science — Build the Skills Employers Want Now
Build AI Apps with Azure, Copilot, and Generative AI — Microsoft Certified
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore the challenges and solutions in scaling machine learning infrastructure at Facebook in this 18-minute conference talk from MLOps World: Machine Learning in Production. Gain insights into how Facebook's AI Infrastructure team reimagined their entire stack to support rapidly growing ranking models serving over a billion users. Discover the approach taken to redesign and scale the infrastructure, including the creation of specialized hardware using powerful GPUs and network devices, and the development of optimized distributed training algorithms using PyTorch. Learn from Senior AI Infra Engineer Shivam Bharuka as he shares his experience in driving performance, reliability, and efficiency-oriented designs across Facebook's AI Infrastructure components.
Syllabus
Machine Learning Infrastructure at Facebook Scale
Taught by
MLOps World: Machine Learning in Production