CafeGPT - Serving LLMs Like Coffee With Kubernetes
CNCF [Cloud Native Computing Foundation] via YouTube
Get 20% off all career paths from fullstack to AI
Google AI Professional Certificate - Learn AI Skills That Get You Hired
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn the fundamentals of serving Large Language Models (LLMs) using Kubernetes through an engaging coffee shop analogy in this 26-minute conference talk. Explore how Kubernetes has become the standard platform for LLM workloads while understanding the core concepts of LLM inference, efficient deployment strategies, and GPU scheduling without getting overwhelmed by the rapidly evolving ecosystem. Discover how to decouple fundamental principles from the diverse features offered by various Kubernetes-based solutions today. Master the intersection of Kubernetes and LLM inference systems through practical insights that make complex concepts accessible, all while learning parallels to running a successful cafe operation.
Syllabus
CafeGPT: Serving LLMs Like Coffee With Kubernetes - Madhav Jivrajani & Kartik Ramesh, UIUC
Taught by
CNCF [Cloud Native Computing Foundation]