Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

The Missing Piece in Scalable AI Inference

Linode via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore how AI gateways are revolutionizing the deployment and management of AI models as APIs in this 13-minute conference talk. Discover why these gateways have become essential infrastructure for addressing real-world challenges in scalability, cost-efficiency, versioning, and responsible AI governance. Learn about the shift from AI training to inference, the explosion of AI APIs requiring robust management solutions, and how semantic caching optimizes both performance and costs. Examine ContentGuard's role in implementing enterprise-grade responsible AI policies, and witness real-world demonstrations featuring sentiment analysis and chat completions with integrated security and efficiency features. Understand how Traefik's AI Gateway integrates with LKE to optimize AI workloads on Akamai Cloud, providing intelligent traffic management, streamlined model serving, and real-time request routing for scalable AI inference solutions.

Syllabus

The Missing Piece in Scalable AI Inference

Taught by

Linode

Reviews

Start your review of The Missing Piece in Scalable AI Inference

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.