Overview
Syllabus
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 1: Overview and Tokenization
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lec. 2: Pytorch, Resource Accounting
Stanford CS336 Lang. Modeling from Scratch | Spring 2025 | Lec. 3: Architectures, Hyperparameters
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 4: Mixture of experts
Stanford CS336 I Language Modeling from Scratch | Spring 2025 | Lecture 5: GPUs
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 6: Kernels, Triton
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 8: Parallelism 2
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 9: Scaling laws 1
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 10: Inference
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 11: Scaling laws 2
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 12: Evaluation
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 13: Data 1
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 14: Data 2
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 15: Alignment - SFT/RLHF
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 16: Alignment - RL 1
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 17: Alignment - RL 2
Taught by
Stanford Online