Which GPU Sharing Strategy Is Right for You? A Comprehensive Benchmark Study Using Dynamic Resource Allocation
CNCF [Cloud Native Computing Foundation] via YouTube
2,000+ Free Courses with Certificates: Coding, AI, SQL, and More
Build with Azure OpenAI, Copilot Studio & Agentic Frameworks — Microsoft Certified
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore a comprehensive technical conference talk that delves into Dynamic Resource Allocation (DRA) in Kubernetes and its impact on GPU sharing strategies. Learn about different GPU sharing approaches including Multi-Instance GPUs, Multi-Process Service (MPS), and CUDA Time-Slicing through detailed benchmark comparisons. Discover which applications benefit most from each sharing strategy and how to combine multiple approaches for optimal performance. Gain practical insights through real-world application demonstrations and understand potential challenges and future enhancements in GPU resource management. Master the revolutionary capabilities of DRA for managing heterogeneous GPUs in a unified and configurable way, moving beyond traditional device plugin API limitations.
Syllabus
Which GPU Sharing Strategy Is Right for You? A Comprehensive Benchmark St... Kevin Klues & Yuan Chen
Taught by
CNCF [Cloud Native Computing Foundation]