Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

LLM Serving - The 4 Hard Truths No One Tells You

InfoQ via YouTube

Overview

Coursera Spring Sale
40% Off Coursera Plus Annual!
Grab it
Discover the critical engineering challenges of scaling Large Language Models to production in this 50-minute conference talk by Charlotte Qi from Meta's AI Infrastructure team. Learn about the four hard truths that engineers face when building real-world LLM serving infrastructure, going beyond simple model deployment to address optimization for speed, reliability, and cost at unprecedented scale. Explore the engineering minefield of production LLM systems and gain insights from Meta's experience in handling AI infrastructure challenges. Understand the complexities involved in serving LLMs effectively in enterprise environments and the practical considerations that are often overlooked in theoretical discussions about AI deployment.

Syllabus

LLM Serving: The 4 Hard Truths No One Tells You

Taught by

InfoQ

Reviews

Start your review of LLM Serving - The 4 Hard Truths No One Tells You

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.