Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore post-training quantization and quantization-aware training techniques for neural network quantization in this comprehensive lecture from MIT's 6.S965 course. Delve into advanced topics such as binary/ternary quantization and mixed-precision quantization. Gain insights into efficient machine learning techniques that enable powerful deep learning applications on resource-constrained devices. Learn how to overcome challenges in deploying neural networks on mobile and IoT devices, and discover methods to accelerate neural network training. Access accompanying slides and additional course materials to enhance your understanding of efficient deep learning computing and TinyML.
Syllabus
Lecture 06 - Quantization (Part II) | MIT 6.S965
Taught by
MIT HAN Lab