Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

A Bit of Freedom Goes a Long Way - Quantum and Classical Algorithms for Online Learning of MDPs under a Generative Model

Centre for Quantum Technologies via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore a conference talk presenting novel classical and quantum online algorithms for learning Markov Decision Processes (MDPs) in both finite-horizon and infinite-horizon average-reward settings. Discover how researchers Andris Ambainis, Joao F. Doriguello, and Debbie Huey Chih Lim developed algorithms based on a hybrid exploration-generative reinforcement learning model that allows agents to interact with environments through generative sampling or "simulator" access. Learn how these approaches avoid traditional reinforcement learning paradigms like "optimism in the face of uncertainty" and "posterior sampling" by computing and using optimal policies directly, resulting in superior regret bounds compared to previous work. Understand how the quantum algorithm for finite-horizon MDPs achieves regret bounds that depend only logarithmically on the number of time steps T, breaking the classical O(√T) barrier while improving dependence on state space size S and action space size A parameters. Examine the infinite-horizon MDP results where both classical and quantum bounds maintain Õ(√T) dependence but with enhanced S and A factors, and discover the novel regret measure for infinite-horizon MDPs that enables the quantum algorithm to achieve poly-logarithmic T regret, exponentially outperforming classical algorithms. Gain insights into how these results extend to compact continuous state spaces, presented at the Quantum Techniques in Machine Learning (QTML) 2025 conference in Singapore.

Syllabus

QTML 2025: A Bit of Freedom Goes a Long Way: Quantum and Classical Algorithms for Online Learning

Taught by

Centre for Quantum Technologies

Reviews

Start your review of A Bit of Freedom Goes a Long Way - Quantum and Classical Algorithms for Online Learning of MDPs under a Generative Model

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.