Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Learn about the fundamental mechanisms behind transformer architectures in this technical lecture from MIT professor Ankur Moitra, delivered as part of the Simons Institute's Special Year on Large Language Models and Transformers Boot Camp. Dive deep into the inner workings of transformer models, exploring their key components, architectural design principles, and the mathematical foundations that make them so effective for natural language processing tasks. Gain valuable insights into attention mechanisms, positional encodings, and the overall structure that has made transformers the backbone of modern language models.
Syllabus
How Do Transformers Work?
Taught by
Simons Institute