TransformerFAM and BSWA: Understanding Feedback Attention Memory and Block Sliding Window Attention
Discover AI via YouTube
AI Engineer - Learn how to integrate AI into software applications
50% OFF: In-Depth AI & Machine Learning Course
Overview
Syllabus
3 videos on infinity context length
Visualization of new transformerFAM
Pseudocode for two new transformer
Basics of Attention calculations
TransformerBSWA - Block Sliding Window Attention
TransformerFAM - Feedback Attention Memory
Symmetries in operational feedback code
Time series visualization of new FAM and BSWA
Outlook on Reasoning w/ TransformerFAM
Taught by
Discover AI