Small Language Models - When and When NOT to Use Them + Mistral 3.1 & Gemma-3 Comparison

Small Language Models - When and When NOT to Use Them + Mistral 3.1 & Gemma-3 Comparison

Oxen via YouTube Direct link

34:45 o3-mini, Mistral Small-3.1, and Gemma-3’s Eval on Rust

15 of 19

15 of 19

34:45 o3-mini, Mistral Small-3.1, and Gemma-3’s Eval on Rust

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Small Language Models - When and When NOT to Use Them + Mistral 3.1 & Gemma-3 Comparison

Automatically move to the next video in the Classroom when playback concludes

  1. 1 0:00 Welcome to Arxiv Dive
  2. 2 1:12 $ whois
  3. 3 1:59 $ whoami
  4. 4 3:18 What is Oxen.ai
  5. 5 4:24 Lets Talk Smol Lms4:35 Benefits of Smol Lms
  6. 6 6:33 When Not to Use Smol LMs
  7. 7 7:47 What is a Data Flywheel
  8. 8 9:01 Why Smol LMs Are Important Now
  9. 9 13:42 Did I Use a Framework for SFT or RL
  10. 10 14:09 Only Your Data and Criteria Matters
  11. 11 16:18 Gemma-3 vs. Mistral-3.1 Evals
  12. 12 16:41 How to Evaluate a Model
  13. 13 26:49 o3-mini, Mistral Small-3.1, and Gemma-3 on SimpleQA
  14. 14 28:17 Training a Model to Program in Rust
  15. 15 34:45 o3-mini, Mistral Small-3.1, and Gemma-3’s Eval on Rust
  16. 16 38:17 Questions
  17. 17 43:36 What About Smol Multimodal Models?
  18. 18 48:56 Test a Homemade Phi-4 Multimodal Chatbot
  19. 19 58:45 QR Code for Free Compute Credits

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.