Coursera Plus Annual is 40% Off — Ends July 6.

View

Class Central

Rankings
Career Certificates

Subjects

View all

Technology
Business
Creative
STEM
Health & Wellness
People & Society
Personal Growth & Lifestyle

View all Subjects

Universities
The Report

Courses from 1000+ universities

Rankings

Best Courses

Best of All Time
Best of the Year

Most Popular Courses

Most Popular of All Time
Most Popular of the Year

Career Certificates

Generative AI
UX Design
Data Science
Finance
DevOps
Project Management
View all Career Certificates

Technology

Computer Science
Artificial Intelligence
Data Science
Web Development
Programming
Databases
IT & Networking
DevOps
UX/UI Design
Generative AI
Cloud Computing
Cybersecurity
Game Development
Product Management
View all Technology

Business

Management & Leadership
Finance
Entrepreneurship
Marketing
Strategic Management
Industry Specific
Project Management
Sales
Business Software
Real Estate
Professional Development
Accounting
Supply Chain Management
Business Strategy
Human Resources
View all Business

Creative

Music
Digital Media
Visual Arts
Design
Crafts
Performing Arts
AI Art
View all Creative

STEM

Mathematics
Engineering
Science
Statistics & Probability
Chemistry
Physics
Environmental Science
Astronomy
Biology
View all STEM

Health & Wellness

Nutrition & Wellness
Disease & Disorders
Public Health
Health Care
Nursing
Mental Health
Continuing Medical Education (CME)
Medicine
Wellness
Nutrition
View all Health & Wellness

People & Society

Humanities
Education & Teaching
Social Sciences
History
Literature
Sociology
Economics
Psychology
Anthropology
Political Science
Law
Language Learning
Writing
Philosophy
Religion
Sustainability
View all People & Society

Personal Growth & Lifestyle

Personal Development
Sports & Recreation
Personal Finance
Parenting & Family
Food & Drink
Self-Defense & Martial Arts
Gardening
Productivity & Time Management
Games
Study Skills
Travel
Pets & Pet Care
Beauty & Makeup
Critical Thinking
View all Personal Growth & Lifestyle

The Report

A Simplilearn Certificate Goes on LinkedIn Each Minute; Here’s How Krishna Kumar Built It From a Blog

17 years ago, Krishna Kumar started offering free PMP prep online. Today, it’s a leading digital upskilling platform that helps millions upskill in AI, cybersecurity, data science, and more.

Class Central Team Jun 22, 2026

Latest

8 Best SolidWorks Courses for 2026
The Business of Online Education: Khan Academy Tax Returns Analysis (2008–2025)
10 Best Spring Boot Courses for 2026: Beginner to Advanced
Massive List of Online Learning Platforms in China
Best SQL Courses for 2026: Top 14 from 3,300+

Write for The Report

Visit The Report

600 Free Google Certifications
Trending

Most common

Popular subjects

Computer Science
52,161 courses
Artificial Intelligence
32,440 courses
Digital Skills
437 courses

Popular courses

Introduction to HTML5
University of Michigan
Rome: A Virtual Tour of the Ancient City
University of Reading
The Modern World, Part One: Global History from 1760 to 1910
University of Virginia

Organize and share your learning with Class Central Lists.

View our Lists Showcase

0 Reviews

Start learning

Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Technology
Computer Science
Machine Learning
MLOps

Technology
Computer Science
Machine Learning
MLOps

Business
Business Strategy
Cost Reduction

Technology
Artificial Intelligence
Mistral AI

Technology
Computer Science
Machine Learning
LLM Inference

Business
Business Strategy
Cost Optimization

Exploring the Latency, Throughput, and Cost Space for LLM Inference

MLOps.community via YouTube

Write review

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Effort

30 minutes
Sessions

Self-Paced
Level

Advanced

Found in

MLOps Courses
Cost Reduction Courses
Mistral AI Courses
LLM Inference Courses
Cost Optimization Courses

Explore the intricacies of LLM inference stacks in this 30-minute conference talk by Timothée Lacroix, CTO of Mistral. Delve into the process of selecting the optimal model for specific tasks, choosing appropriate hardware, and implementing efficient inference code. Examine popular inference stacks and setups, uncovering the factors that contribute to inference costs. Gain insights into leveraging current open-source models effectively and learn about the limitations in existing open-source serving stacks. Discover the potential advancements that future generations of models may bring to the field of LLM inference.

Syllabus

Exploring the Latency/Throughput & Cost Space for LLM Inference // Timothée Lacroix // CTO Mistral

Taught by

MLOps.community

Related Courses

Ad

Become an AI & ML Engineer with Cal Poly EPaCE — IBM-Certified Training

Learn More →
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Getting Started with Mistral
LLM Fine-Tuning for Modern AI Teams - How One E-Commerce Unicorn Cut Inference Cost by 90%
Scale to 0 LLM Inference: Cost Efficient Open Model Deployment on Serverless GPUs
Scaling Ultra Low Latency LLM Inference

PowerBI Data Analyst - Create visualizations and dashboards from scratch Ad
[2026] Generative AI Mastery: 900+ Courses to Develop Your AI Superpowers
[2026] Massive List of Thousands of Free Certificates and Badges
12 Best Applied AI & ML Courses for 2026
A Free Tool to Learn Languages Through Netflix and YouTube: Language Reactor Review
5 Best YouTube Marketing Courses for Business in 2026