Learn EDR Internals: Research & Development From The Masters
Gain a Splash of New Skills - Coursera+ Annual Just ₹7,999
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the fundamentals of Vision Transformers in this MIT graduate-level lecture delivered by Professor Song Han as part of the EfficientML.ai series (MIT 6.5940, Fall 2024). Delve into the architectural principles, mechanisms, and applications of Vision Transformers in computer vision tasks. Learn how these transformers adapt the successful natural language processing transformer architecture for visual data processing, understanding their key components, operational workflow, and performance characteristics. Gain insights into how Vision Transformers have revolutionized the field of computer vision by offering an alternative to traditional convolutional neural networks.
Syllabus
EfficientML.ai Lecture 16 - Vision Transformer (MIT 6.5940, Fall 2024)
Taught by
MIT HAN Lab