Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

The Fast and the Curious - Chasing Scalable AI Dreams with Kubernetes and k0rdent

Platform Engineering via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Learn how to deploy AI at scale on Kubernetes through this 14-minute conference talk that addresses the complex challenges of GPU provisioning, cost management, and performance optimization. Discover how k0rdent automates GPU-ready cluster provisioning to make AI deployment seamless across both cloud and on-premises environments. Explore techniques for serving AI models with KServe, implementing dynamic GPU resource scaling with Knative auto-scaling, and monitoring performance using Prometheus and Grafana. Master strategies for maximizing compute efficiency while minimizing costs in an environment where GPUs are both scarce and expensive. Follow along with a live demonstration that walks through the complete AI deployment workflow, from spinning up clusters to running real-time inference, and gain practical insights into streamlining AI operations with Kubernetes without the typical complexity.

Syllabus

The fast and the curious: Chasing scalable AI dreams with Kubernetes and k0rdent - Bharath Nallapeta

Taught by

Platform Engineering

Reviews

Start your review of The Fast and the Curious - Chasing Scalable AI Dreams with Kubernetes and k0rdent

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.