N-Step Temporal Difference Learning with Optimal n
International Centre for Theoretical Sciences via YouTube
Get 50% Off Udacity Nanodegrees — Code CC50
Gain a Splash of New Skills - Coursera+ Annual Nearly 45% Off
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore n-step temporal difference learning algorithms and discover methods for determining the optimal value of n in this 57-minute conference talk. Learn about advanced reinforcement learning techniques that bridge the gap between Monte Carlo methods and one-step temporal difference learning. Understand how varying the number of steps n affects learning performance and convergence properties in temporal difference algorithms. Examine theoretical foundations and practical considerations for selecting the optimal n parameter to maximize learning efficiency. Gain insights into the mathematical framework underlying n-step methods and their applications in various reinforcement learning scenarios. Discover how optimal n selection can improve sample efficiency and reduce variance in value function estimation, making this essential knowledge for researchers and practitioners working with temporal difference learning algorithms in machine learning and artificial intelligence applications.
Syllabus
N-Step Temporal Difference Learning with Optimal n by Shalabh Bhatnagar
Taught by
International Centre for Theoretical Sciences