Learn Excel & Financial Modeling the Way Finance Teams Actually Use Them
Learn Backend Development Part-Time, Online
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Learn about the fundamental mechanisms behind transformer architectures in this technical lecture from MIT professor Ankur Moitra, delivered as part of the Simons Institute's Special Year on Large Language Models and Transformers Boot Camp. Dive deep into the inner workings of transformer models, exploring their key components, architectural design principles, and the mathematical foundations that make them so effective for natural language processing tasks. Gain valuable insights into attention mechanisms, positional encodings, and the overall structure that has made transformers the backbone of modern language models.
Syllabus
How Do Transformers Work?
Taught by
Simons Institute