Serving the Future: KServe's Next Chapter Hosting LLMs and GenAI Models
CNCF [Cloud Native Computing Foundation] via YouTube
Live Online Classes in Design, Coding & AI — Small Classes, Free Retakes
Finance Certifications Goldman Sachs & Amazon Teams Trust
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Join Alexa Griffith and Tessa Pham from Bloomberg in this 29-minute conference talk exploring KServe's evolution for hosting Large Language Models (LLMs) and Generative AI models. Discover how KServe has transformed from a simple ML model deployment platform on Kubernetes to a comprehensive solution for managing advanced AI workloads at scale. Learn about KServe's latest features specifically designed for generative AI, including enhanced serving runtimes, scalability improvements, and integration strategies across hybrid environments. Gain practical insights from the speakers' dual perspective as both KServe maintainers and practitioners who implement these solutions in Bloomberg's production clusters. The presentation includes fun drawings to illustrate concepts and shares valuable real-world experiences and lessons learned when deploying and scaling generative models in production.
Syllabus
Serving the Future: KServe’s Next Chapter Hosting LLMs & GenAI Models... Alexa Griffith & Tessa Pham
Taught by
CNCF [Cloud Native Computing Foundation]