Master Production-Ready Machine Learning, Step by Step
Become an AI & ML Engineer with Cal Poly EPaCE — IBM-Certified Training
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn how to optimize the utilization of scarce LLM accelerator resources in this 17-minute conference talk from SREcon25 Europe/Middle East/Africa. Discover strategies for maximizing the efficiency of accelerators used for serving Large Language Models, understanding that these resources are extremely limited both globally and within organizations. Explore practical approaches to demonstrate effective resource usage and justify continued access to these valuable computing assets. Gain insights from Google's experience in managing LLM infrastructure and learn why proving efficient utilization is critical for maintaining access to accelerator resources in competitive environments.
Syllabus
SREcon25 Europe/Middle East/Africa - Maximizing Utilization for LLM Accelerators
Taught by
USENIX