N-Step Temporal Difference Learning with Optimal n
International Centre for Theoretical Sciences via YouTube
Learn Backend Development Part-Time, Online
Build the Finance Skills That Lead to Promotions — Not Just Certificates
Overview
Google, IBM & Meta Certificates — All 10,000+ Courses at 40% Off
One annual plan covers every course and certificate on Coursera. 40% off for a limited time.
Get Full Access
Explore n-step temporal difference learning algorithms and discover methods for determining the optimal value of n in this 57-minute conference talk. Learn about advanced reinforcement learning techniques that bridge the gap between Monte Carlo methods and one-step temporal difference learning. Understand how varying the number of steps n affects learning performance and convergence properties in temporal difference algorithms. Examine theoretical foundations and practical considerations for selecting the optimal n parameter to maximize learning efficiency. Gain insights into the mathematical framework underlying n-step methods and their applications in various reinforcement learning scenarios. Discover how optimal n selection can improve sample efficiency and reduce variance in value function estimation, making this essential knowledge for researchers and practitioners working with temporal difference learning algorithms in machine learning and artificial intelligence applications.
Syllabus
N-Step Temporal Difference Learning with Optimal n by Shalabh Bhatnagar
Taught by
International Centre for Theoretical Sciences