Model Based Reinforcement Learning - Policy Iteration, Value Iteration, and Dynamic Programming
Steve Brunton via YouTube
Launch a New Career with Certificates from Google, IBM & Microsoft
Stuck in Tutorial Hell? Learn Backend Dev the Right Way
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore dynamic programming as a fundamental concept in model-based reinforcement learning. Delve into policy iteration and value iteration techniques, leading to an understanding of the quality function and Q-learning. Learn how these methods form the basis for solving reinforcement learning problems. Gain insights from examples and explanations provided in this 27-minute lecture, which is part of a comprehensive series on reinforcement learning based on the new Chapter 11 from the 2nd edition of "Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control" by Brunton and Kutz.
Syllabus
REINFORCEMENT LEARNING
VALUE FUNCTION
DYNAMIC PROGRAMMING!
VALUE ITERATION
POLICY ITERATION
QUALITY FUNCTION
Taught by
Steve Brunton