Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

llm-d - Kubernetes Native Distributed Inferencing

DevConf via YouTube

Overview

Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn how to serve large language models at scale using llm-d, a Kubernetes-native solution for distributed inference that supports any model across diverse hardware accelerators, presented by Robert Shaw in this 33-minute conference talk from DevConf.US 2025.

Syllabus

llm-d: Kubernetes Native Distributed Inferencing - DevConf.US 2025

Taught by

DevConf

Reviews

Start your review of llm-d - Kubernetes Native Distributed Inferencing

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.