Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Stanford University

Stanford CS336 - Language Modeling from Scratch 2025

Stanford University via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore the complete process of building language models from the ground up in this comprehensive Stanford University course spanning over 22 hours. Master the fundamental concepts and practical skills needed to develop modern natural language processing systems by walking through every stage of language model creation. Begin with essential topics including tokenization, PyTorch implementation, and resource accounting before diving into advanced architectures, hyperparameter optimization, and mixture of experts models. Gain hands-on experience with GPU programming, kernel optimization using Triton, and various parallelism strategies for scaling model training. Understand the mathematical foundations through scaling laws and learn critical evaluation methodologies for assessing model performance. Develop expertise in data collection, preprocessing, and cleansing techniques essential for pre-training large language models. Conclude with advanced alignment techniques including supervised fine-tuning (SFT), reinforcement learning from human feedback (RLHF), and reinforcement learning approaches. Build practical skills in transformer model construction, distributed training, efficient inference optimization, and deployment strategies while gaining deep theoretical understanding of the principles underlying modern AI systems.

Syllabus

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 1: Overview and Tokenization
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lec. 2: Pytorch, Resource Accounting
Stanford CS336 Lang. Modeling from Scratch | Spring 2025 | Lec. 3: Architectures, Hyperparameters
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 4: Mixture of experts
Stanford CS336 I Language Modeling from Scratch | Spring 2025 | Lecture 5: GPUs
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 6: Kernels, Triton
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 8: Parallelism 2
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 9: Scaling laws 1
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 10: Inference
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 11: Scaling laws 2
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 12: Evaluation
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 13: Data 1
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 14: Data 2
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 15: Alignment - SFT/RLHF
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 16: Alignment - RL 1
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 17: Alignment - RL 2

Taught by

Stanford Online

Reviews

Start your review of Stanford CS336 - Language Modeling from Scratch 2025

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.