Reinforcement Learning and Fine-Tuning on TPUs - The Agent Factory Podcast

Reinforcement Learning and Fine-Tuning on TPUs - The Agent Factory Podcast

Google Cloud Tech via YouTube Direct link

- The added value in RL

5 of 10

5 of 10

- The added value in RL

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Reinforcement Learning and Fine-Tuning on TPUs - The Agent Factory Podcast

Automatically move to the next video in the Classroom when playback concludes

  1. 1 - Introduction: Gemini 3 and the rise of TPUs
  2. 2 - Why fine-tune? Specialization and privacy
  3. 3 - What is fine-tuning? SFT and RL explained
  4. 4 - What is RL and why do we need it?
  5. 5 - The added value in RL
  6. 6 - Industry pulse: Why 2025 is the year of RL DeepSeek-R1, Grok 4, Gemini 3
  7. 7 - The challenges of RL: Infrastructure, algorithms, and orchestration
  8. 8 - Factory floor: How TPUs are designed for scale
  9. 9 - [Demo] Reinforcement Learning GRPO with MaxText 2.0 on TPUs
  10. 10 - Scaling to 1000+ chips and season wrap up

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.