Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Scaling Policy Gradients for Reinforcement Learning in Robotics - Part 1

Montreal Robotics via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
This lecture from the Montreal Robotics series explores reinforcement learning (RL) in robotics contexts, with a deep focus on policy gradients and their real-world applications. Learn about autonomous cleaning robots as practical examples of scaled RL implementation. Discover how policy gradients optimize reward functions without explicitly modeling environmental dynamics, and examine different policy distributions including cross-entropy for discrete actions and Gaussian distributions for continuous actions through interactive demonstrations. Gain insights into the mathematical foundations of policy gradients, understanding their sampling-based approach and how policy parameters are optimized to maximize expected returns. The lecture includes a comprehensive walkthrough of a robotics dataset homework assignment using Google Colab, covering essential techniques for data processing, standardization, and model training, with specific guidance on challenges like action dimension scaling. Access the accompanying materials on GitHub to practice implementing these concepts.

Syllabus

RobotLearning: Scaling PolicyGradients Part 1

Taught by

Montreal Robotics

Reviews

Start your review of Scaling Policy Gradients for Reinforcement Learning in Robotics - Part 1

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.