Free courses from frontend to fullstack and AI
Build the Finance Skills That Lead to Promotions — Not Just Certificates
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a groundbreaking approach to cost-effective deep neural network (DNN) training in this 16-minute conference talk from NSDI '24. Delve into Parcae, an innovative system that leverages preemptible cloud instances to significantly reduce training costs for large DNNs. Learn how Parcae's proactive strategy optimizes 'liveput,' a novel metric combining throughput and robustness, to adapt to predicted resource changes before instance preemptions occur. Discover the system's key features, including lightweight instance migration and an availability predictor, which enable it to outperform existing spot-instance DNN training systems by up to 10 times. Gain insights into Parcae's ability to achieve near-optimal performance for training large DNNs under frequent preemptions, a scenario where current approaches struggle to make progress.
Syllabus
NSDI '24 - Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances
Taught by
USENIX