Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Reinforcement Learning 101

CodeEmporium via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Learn the fundamentals of reinforcement learning through this comprehensive video series covering essential concepts from basic elements to advanced techniques. Explore multi-armed bandits as an introduction to reinforcement learning principles, then progress to Markov Decision Processes for systematic problem-solving approaches. Master the Bellman Equation and its applications in value function optimization. Dive into temporal difference learning as the foundation for Q-learning algorithms, followed by detailed explanations of Q-learning implementation and the distinctions between on-policy and off-policy algorithms. Understand Monte Carlo methods in reinforcement learning contexts and advance to Deep Q-Networks for handling complex state spaces. Discover Proximal Policy Optimization, the algorithm powering ChatGPT, and conclude with Reinforcement Learning through Human Feedback (RLHF) techniques that enable AI systems to learn from human preferences and feedback.

Syllabus

Elements of Reinforcement Learning
Multi Armed Bandits - Reinforcement Learning Explained!
How to solve problems with Reinforcement Learning | Markov Decision Process
Bellman Equation - Explained!
Foundation of Q-learning | Temporal Difference Learning explained!
Q-learning - Explained!
Reinforcement Learning: on-policy vs off-policy algorithms
Monte Carlo in Reinforcement Learning
Deep Q-Networks Explained!
Proximal Policy Optimization | ChatGPT uses this
Reinforcement Learning through Human Feedback - EXPLAINED! | RLHF

Taught by

CodeEmporium

Reviews

Start your review of Reinforcement Learning 101

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.