Google AI Professional Certificate - Learn AI Skills That Get You Hired
35% Off Finance Skills That Get You Hired - Code CFI35
Overview
Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore a deep reinforcement learning presentation on Model-Based Offline Policy Optimization (MOPO) delivered by Tengyu Ma from Stanford University at the Simons Institute. Delve into topics such as distributional domain shift, answer identification, and improved sketch techniques as the speaker discusses innovative approaches to offline reinforcement learning. Gain insights into the challenges and solutions in developing effective policies from pre-collected datasets without direct interaction with the environment.
Syllabus
Introduction
Workshop Overview
Presentation
Distributional Domain Shift
Answer Identification
Improved Sketch
Summary
Discussion
Taught by
Simons Institute