Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Attribution Graphs - Edge Weights and Pruning

UofU Data Science via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore advanced techniques for understanding large language model interpretability through this university lecture focusing on attribution graphs, edge weights, and automated circuit extraction methods. Learn how to analyze the internal mechanisms of LLMs by examining edge weights in neural network architectures and discover methods for automatically extracting extremely sparse feature circuits that reveal how these models process information. Delve into the mathematical foundations of attribution graphs and their role in making black-box language models more transparent and interpretable. Master techniques for pruning neural networks while maintaining performance, and understand how vignettes can be used to visualize and comprehend the sparse circuits that emerge from automated extraction processes.

Syllabus

Edge weights
Automatically extracting extremely-sparse feature circuits vignettes

Taught by

UofU Data Science

Reviews

Start your review of Attribution Graphs - Edge Weights and Pruning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.