AlphaGo - Mastering the Game of Go with Deep Neural Networks and Tree Search - RL Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube
Learn Generative AI, Prompt Engineering, and LLMs for Free
Free courses from frontend to fullstack and AI
Overview
Syllabus
Intro
Context behind the game of Go
High-level overview of components - SL policies
RL policy network
The value network
Going deeper
Details around value network
Understanding the search MTCS
Evaluation of AlphaGo
Older techniques
Even more detailed explanation of APV-MTCS
Virtual loss
Engineering
Neural networks and symmetries
Taught by
Aleksa Gordić - The AI Epiphany