N-Step Temporal Difference Learning with Optimal n
International Centre for Theoretical Sciences via YouTube
Overview
AI, Data Science & Cloud Certificates from Google, IBM & Meta — 40% Off
One plan covers every Professional Certificate on Coursera. 40% off Coursera Plus Annual.
Unlock All Certificates
Explore n-step temporal difference learning algorithms and discover methods for determining the optimal value of n in this 57-minute conference talk. Learn about advanced reinforcement learning techniques that bridge the gap between Monte Carlo methods and one-step temporal difference learning. Understand how varying the number of steps n affects learning performance and convergence properties in temporal difference algorithms. Examine theoretical foundations and practical considerations for selecting the optimal n parameter to maximize learning efficiency. Gain insights into the mathematical framework underlying n-step methods and their applications in various reinforcement learning scenarios. Discover how optimal n selection can improve sample efficiency and reduce variance in value function estimation, making this essential knowledge for researchers and practitioners working with temporal difference learning algorithms in machine learning and artificial intelligence applications.
Syllabus
N-Step Temporal Difference Learning with Optimal n by Shalabh Bhatnagar
Taught by
International Centre for Theoretical Sciences