Train an LLM from Scratch with Karpathy's Nanochat

Train an LLM from Scratch with Karpathy's Nanochat

Trelis Research via YouTube Direct link

0:00 Train an LLM from scratch with $100 Nanochat by Karpathy

1 of 12

1 of 12

0:00 Train an LLM from scratch with $100 Nanochat by Karpathy

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Train an LLM from Scratch with Karpathy's Nanochat

Automatically move to the next video in the Classroom when playback concludes

  1. 1 0:00 Train an LLM from scratch with $100 Nanochat by Karpathy
  2. 2 1:44 One-click Runpod Template: https://console.runpod.io/deploy?template=ikas3s2cii&ref=jmfkcdio
  3. 3 4:46 Running speedrun.sh script with wandb
  4. 4 7:55 Tokenizer Training Byte Pair Encoding, BPE
  5. 5 10:52 Pretraining on 10B tokens of fine-web edu
  6. 6 15:15 Pre-training evaluation and Mid-training on Smoltalk+
  7. 7 17:33 Instructions on pushing and pulling checkpoints to / from HuggingFace Hub
  8. 8 18:49 Instructions for running the chat interface / UI
  9. 9 21:15 Post-training on benchmark training/aux splits
  10. 10 23:11 Demo of running the chat interface
  11. 11 25:29 Pushing the model, optimizer, report and tokenizer to hub
  12. 12 28:20 Resources: Model and Repo

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.