The Fast and the Curious - Chasing Scalable AI Dreams with Kubernetes and k0rdent
Platform Engineering via YouTube
The Most Addictive Python and SQL Courses
Learn AI, Data Science & Business — Earn Certificates That Get You Hired
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn how to deploy AI at scale on Kubernetes through this 14-minute conference talk that addresses the complex challenges of GPU provisioning, cost management, and performance optimization. Discover how k0rdent automates GPU-ready cluster provisioning to make AI deployment seamless across both cloud and on-premises environments. Explore techniques for serving AI models with KServe, implementing dynamic GPU resource scaling with Knative auto-scaling, and monitoring performance using Prometheus and Grafana. Master strategies for maximizing compute efficiency while minimizing costs in an environment where GPUs are both scarce and expensive. Follow along with a live demonstration that walks through the complete AI deployment workflow, from spinning up clusters to running real-time inference, and gain practical insights into streamlining AI operations with Kubernetes without the typical complexity.
Syllabus
The fast and the curious: Chasing scalable AI dreams with Kubernetes and k0rdent - Bharath Nallapeta
Taught by
Platform Engineering