AI, Data Science & Cloud Certificates from Google, IBM & Meta
Get 20% off all career paths from fullstack to AI
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Join this MLOps Podcast episode featuring Erica Hughberg, Community Advocate at Tetrate, as she discusses why API infrastructure needs to evolve to handle GenAI traffic. Learn how traditional API gateways optimized for microservices are struggling with the new demands of generative AI workloads, which require long-lived connections, massive payloads, and streaming responses. Explore the evolution from the C10K problem through the microservices revolution to today's AI-driven challenges that demand new approaches to token-based rate limiting, cost-aware request shaping, and scalable AI inference traffic. Hughberg, a technical leader and maintainer of Envoy AI Gateway, shares insights on building scalable, secure application platforms while making complex technical concepts accessible through engaging narratives.
Syllabus
[00:00]
Taught by
MLOps.community