PowerBI Data Analyst - Create visualizations and dashboards from scratch
You’re only 3 weeks away from a new language
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore Netflix's development of Mako, a next-generation machine learning training platform designed for large-scale AI workloads, in this 33-minute conference talk from Ray Summit 2025. Learn how Netflix engineers Avin Regmi and Matan Appelbaum evolved the company's legacy training infrastructure to handle increasingly complex models, larger datasets, and rapidly growing GPU requirements. Discover the architectural decisions behind Netflix's custom GPU scheduler that improves utilization, reduces fragmentation, and ensures efficient execution of large multi-node training jobs. Examine the key components of resource orchestration, distributed execution, and system resilience that enable Netflix to scale training across diverse workloads. Understand how Ray's flexible distributed runtime integrates into high-performance training pipelines and supports critical platform components. Gain practical insights into designing modern ML training platforms, optimizing GPU usage at enterprise scale, and leveraging distributed computing frameworks to build robust AI infrastructure capable of supporting the next generation of machine learning applications.
Syllabus
Inside Netflix’s Mako: The Next-Gen ML Training Platform | Ray Summit 2025
Taught by
Anyscale