Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

CNCF [Cloud Native Computing Foundation]

Inference Awakens - Tools for the Age of GenAI

CNCF [Cloud Native Computing Foundation] via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Explore the evolution from traditional stateless microservices to modern GenAI platforms in this conference talk that addresses the challenges of serving AI inference workloads at scale. Learn how the shift from simple REST APIs to streaming tokens, prompt orchestration, and GPU-aware routing is exposing the limitations of traditional gateways and requiring new architectural approaches. Discover a real-world reference architecture built with open-source tools including Envoy AI Gateway and KServe that supports dynamic model-based routing, token-level rate limiting, secure upstream authentication, comprehensive observability, and multi-provider failover capabilities. Understand why these features have become essential requirements rather than optional enhancements for reliable AI inference systems. Gain practical insights into routing, serving, and monitoring LLM traffic while exploring how current CNCF tools are adapting to meet the demands of the GenAI era, leaving you with a concrete blueprint for implementing scalable AI inference infrastructure.

Syllabus

Inference Awakens: Tools for the Age of GenAI - Alexa Griffith, Bloomberg & Erica Hughberg, Tetrate

Taught by

CNCF [Cloud Native Computing Foundation]

Reviews

Start your review of Inference Awakens - Tools for the Age of GenAI

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.