Coursera Plus Annual is 40% Off — Ends July 6.

View

Class Central

Rankings
Career Certificates

Subjects

View all

Technology
Business
Creative
STEM
Health & Wellness
People & Society
Personal Growth & Lifestyle

View all Subjects

Universities
The Report

Courses from 1000+ universities

Rankings

Best Courses

Best of All Time
Best of the Year

Most Popular Courses

Most Popular of All Time
Most Popular of the Year

Career Certificates

Generative AI
UX Design
Data Science
Finance
DevOps
Project Management
View all Career Certificates

Technology

Computer Science
Artificial Intelligence
Data Science
Web Development
Programming
Databases
IT & Networking
DevOps
UX/UI Design
Generative AI
Cloud Computing
Cybersecurity
Game Development
Product Management
View all Technology

Business

Management & Leadership
Finance
Entrepreneurship
Marketing
Strategic Management
Industry Specific
Project Management
Sales
Business Software
Real Estate
Professional Development
Accounting
Supply Chain Management
Business Strategy
Human Resources
View all Business

Creative

Music
Digital Media
Visual Arts
Design
Crafts
Performing Arts
AI Art
View all Creative

STEM

Mathematics
Engineering
Science
Statistics & Probability
Chemistry
Physics
Environmental Science
Astronomy
Biology
View all STEM

Health & Wellness

Nutrition & Wellness
Disease & Disorders
Public Health
Health Care
Nursing
Mental Health
Continuing Medical Education (CME)
Medicine
Wellness
Nutrition
View all Health & Wellness

People & Society

Humanities
Education & Teaching
Social Sciences
History
Literature
Sociology
Economics
Psychology
Anthropology
Political Science
Law
Language Learning
Writing
Philosophy
Religion
Sustainability
View all People & Society

Personal Growth & Lifestyle

Personal Development
Sports & Recreation
Personal Finance
Parenting & Family
Food & Drink
Self-Defense & Martial Arts
Gardening
Productivity & Time Management
Games
Study Skills
Travel
Pets & Pet Care
Beauty & Makeup
Critical Thinking
View all Personal Growth & Lifestyle

The Report

A Simplilearn Certificate Goes on LinkedIn Each Minute; Here’s How Krishna Kumar Built It From a Blog

17 years ago, Krishna Kumar started offering free PMP prep online. Today, it’s a leading digital upskilling platform that helps millions upskill in AI, cybersecurity, data science, and more.

Class Central Team Jun 22, 2026

Latest

8 Best SolidWorks Courses for 2026
The Business of Online Education: Khan Academy Tax Returns Analysis (2008–2025)
10 Best Spring Boot Courses for 2026: Beginner to Advanced
Massive List of Online Learning Platforms in China
Best SQL Courses for 2026: Top 14 from 3,300+

Write for The Report

Visit The Report

600 Free Google Certifications
Trending

Most common

Popular subjects

Computer Science
52,161 courses
Artificial Intelligence
32,440 courses
Digital Skills
437 courses

Popular courses

Introduction to HTML5
University of Michigan
Rome: A Virtual Tour of the Ancient City
University of Reading
The Modern World, Part One: Global History from 1760 to 1910
University of Virginia

Organize and share your learning with Class Central Lists.

View our Lists Showcase

0 Reviews

Start learning

Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Technology
Computer Science
Machine Learning
MLOps

Technology
Artificial Intelligence
Natural Language Processing (NLP)

Technology
Computer Science
Machine Learning
MLOps

Technology
Computer Science
Machine Learning
Quantization

Technology
Computer Science
Machine Learning

Making LLM Inference Affordable - Part 2

MLOps.community via YouTube

Write review

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Effort

32 minutes
Sessions

Self-Paced
Level

Advanced

Found in

MLOps Courses
Natural Language Processing (NLP) Courses
Quantization Courses
Machine Learning Courses

Explore techniques for making large language model (LLM) inference more affordable and efficient in this 32-minute conference talk by Daniel Campos at the LLMs in Production Conference. Learn about the challenges of using foundational models and APIs, and discover alternatives like self-hosting models. Delve into methods for optimizing model performance within latency and inference budgets, including pseudo-labeling, knowledge distillation, pruning, and quantization. Gain insights from Campos' extensive experience in NLP, ranging from his work at Microsoft on Bing's ranking system to his current Ph.D. research on efficient LLM inference and robust dense retrieval at the University of Illinois Urbana Champaign.

Syllabus

Making LLM Inference Affordable // Daniel Campos // LLMs in Production Conference Part 2

Taught by

MLOps.community

Related Courses

Ad

Power BI Fundamentals - Create visualizations and dashboards from scratch

Learn More →
Machine Learning Model Optimization
LLM Inference Optimization
Efficiently Serving LLMs
Distillation, Quantization, and Pruning in Advanced NLP - Lecture 11
AWQ for LLM Quantization - Efficient Inference Framework for Large Language Models

The Private Equity Associate Certification Ad
9 Best Vector Database Courses for 2026: Build RAG Apps and Semantic Search
Write Prompts That Actually Work: ZTM’s Prompt Engineering Bootcamp Review
12 Best Applied AI & ML Courses for 2026
11 Best Embeddings & Transformer Models Courses in 2026 (Free & Paid): Word2Vec, Vector Search, and RAG
[2026] Generative AI Mastery: 900+ Courses to Develop Your AI Superpowers