GPU-Accelerated Workloads on KubeVirt - Scaling ML/AI in Kubernetes
CNCF [Cloud Native Computing Foundation] via YouTube
Get Coursera Plus for 40% off
Power BI Fundamentals - Create visualizations and dashboards from scratch
Overview
Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore how to enable GPU-accelerated virtual machines in KubeVirt for demanding AI/ML workloads in this lightning talk from the Cloud Native Computing Foundation. Learn the technical foundations for running GPU-backed VMs in Kubernetes environments and discover why this approach is gaining popularity for secure, scalable, and isolated inference pipelines. Examine the key differences between container-based and VM-based GPU allocation strategies, and understand how KubeVirt seamlessly integrates with CNCF tools like Prometheus and the Kubernetes scheduler to monitor and optimize performance. Gain practical insights into pushing KubeVirt beyond typical virtual machine use cases into production-ready machine learning and artificial intelligence workloads, with technical guidance on implementation and real-world applications for scaling ML/AI infrastructure in cloud-native environments.
Syllabus
GPU‑Accelerated Workloads on KubeVirt: Scaling ML/AI in Kuberne... Amandeep Singh and Shivani Tiwari
Taught by
CNCF [Cloud Native Computing Foundation]