Completed
intro
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Superposition in LLM Feature Representations
Automatically move to the next video in the Classroom when playback concludes
- 1 intro
- 2 preamble
- 3 mechanistic interpretability
- 4 neural network representations
- 5 qualities of representations
- 6 decomposability
- 7 linearity
- 8 linear composition as a compression scheme
- 9 demands of linearity
- 10 the linear representation puzzle
- 11 neuron - feature requirements
- 12 experience with llms
- 13 the superposition hypothesis
- 14 sparsity
- 15 recovering features in superposition
- 16 demands of linearity
- 17 feature exploration
- 18 thanks