Master Production-Ready Machine Learning, Step by Step
Get 20% off all career paths from fullstack to AI
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Explore a conference talk from USENIX ATC '24 that introduces Centimani, a novel performance predictor designed to streamline AI accelerator selection for DNN training. Delve into the challenges of choosing optimal hardware for deep neural network models as AI-specific accelerators proliferate. Learn how Centimani accurately predicts DNN training throughput across various accelerators, enabling informed decision-making based on objectives like performance or cost-efficiency. Discover the innovative approach involving memory estimation and decoupled performance models to determine ideal batch sizes and forecast execution times. Examine the validation results showcasing Centimani's high prediction accuracy for both single and multi-device training scenarios across multiple DNN models and accelerators.
Syllabus
USENIX ATC '24 - Centimani: Enabling Fast AI Accelerator Selection for DNN Training with a Novel...
Taught by
USENIX