Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

MOPO - Model-Based Offline Policy Optimization

Simons Institute via YouTube

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Effort

38 minutes
Sessions

Self-Paced
Level

Advanced

Found in

Deep Reinforcement Learning Courses

Explore a deep reinforcement learning presentation on Model-Based Offline Policy Optimization (MOPO) delivered by Tengyu Ma from Stanford University at the Simons Institute. Delve into topics such as distributional domain shift, answer identification, and improved sketch techniques as the speaker discusses innovative approaches to offline reinforcement learning. Gain insights into the challenges and solutions in developing effective policies from pre-collected datasets without direct interaction with the environment.