Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

The State of LLM Compression — From Research to Production

Neural Magic via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
This weekly AI seminar from the "Random Samples" series explores the evolving landscape of Large Language Model (LLM) compression techniques, bridging research innovations with practical implementation. Dive into the challenges of managing massive LLMs and learn about cutting-edge compression methods including quantization and sparsity, with clear explanations of their accuracy-performance tradeoffs. Understand the significant differences between academic benchmarks and real-world applications, discover which compression techniques are production-ready versus those still in research phases, and explore strategies for optimizing LLM deployment across various computing environments. The presentation includes comprehensive session slides and is designed for AI developers, data scientists, and researchers looking to implement more efficient generative AI systems. Part of a weekly series that keeps participants at the forefront of AI advancements.

Syllabus

Random Samples: The State of LLM Compression — From Research to Production

Taught by

Neural Magic

Reviews

Start your review of The State of LLM Compression — From Research to Production

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.