Learn Generative AI, Prompt Engineering, and LLMs for Free
Google, IBM & Meta Certificates — 40% Off for a Limited Time
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore post-training quantization and quantization-aware training techniques for neural network quantization in this comprehensive lecture from MIT's 6.S965 course. Delve into advanced topics such as binary/ternary quantization and mixed-precision quantization. Gain insights into efficient machine learning techniques that enable powerful deep learning applications on resource-constrained devices. Learn how to overcome challenges in deploying neural networks on mobile and IoT devices, and discover methods to accelerate neural network training. Access accompanying slides and additional course materials to enhance your understanding of efficient deep learning computing and TinyML.
Syllabus
Lecture 06 - Quantization (Part II) | MIT 6.S965
Taught by
MIT HAN Lab