Reinforcement Learning - Spring 2018

Reinforcement Learning - Spring 2018

Pascal Poupart via YouTube Direct link

CS885 Lecture 1a: Course Introduction

1 of 41

1 of 41

CS885 Lecture 1a: Course Introduction

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Reinforcement Learning - Spring 2018

Automatically move to the next video in the Classroom when playback concludes

  1. 1 CS885 Lecture 1a: Course Introduction
  2. 2 CS885 Lecture 1b: Markov Processes
  3. 3 CS885 Lecture 2a: Markov Decision Processes
  4. 4 CS885 Lecture 2b: Value Iteration
  5. 5 CS885 Lecture 3a: Policy Iteration
  6. 6 CS885 Lecture 3b: Introduction to RL
  7. 7 CS885 Lecture 4a: Deep Neural Networks
  8. 8 CS885 Lecture 4b: Deep Q-Networks
  9. 9 CS885 Lecture 5: Conversational Agents (Nabiha Asghar)
  10. 10 CS885 Lecture 6a: OpenAI Environments (Mike Rudd)
  11. 11 CS885 Lecture 6b: DQN and TensorFlow (Timmy Tse)
  12. 12 CS885 Lecture 7a: Policy Gradient
  13. 13 CS885 Lecture 7b: Actor Critic
  14. 14 CS885 Lecture 8a: Multi-armed bandits
  15. 15 CS885 Lecture 8b: Bayesian and Contextual Bandits
  16. 16 CS885 Lecture 9: Model-based RL
  17. 17 CS885 Lecture 10: Bayesian RL
  18. 18 CS885 Lecture 11a: Hidden Markov Models
  19. 19 CS885 Lecture 11b: Partially Observable RL
  20. 20 CS885 Lecture 12: Deep Recurrent Q-Networks
  21. 21 CS885 Lecture 13a: Playing FPS Games with Deep RL (presenter: Mark Iwanchyshyn)
  22. 22 CS885 Lecture 13b: Lifelong Learning in Minecraft (Presenter: Yetian Wang)
  23. 23 CS885 Lecture 13c: Adversarial Search
  24. 24 CS885 Lecture 14a: Mastering the Game of Go (Presenter: Henry Chen)
  25. 25 CS885 Lecture 14b: Mastering Chess and Shogi (Presenter: Kira Selby)
  26. 26 CS885 Lecture 14c: Trust Region Methods
  27. 27 CS885 Lecture 15a: Trust Region Policy Optimization (Presenter: Shivam Kalra)
  28. 28 CS885 Lecture 15b: Proximal Policy Optimization (Presenter: Ruifan Yu)
  29. 29 CS885 Lecture 15c: Semi-Markov Decision Processes
  30. 30 CS885 Lecture 16a: The Option-Critic Architecture (Presenter: Zebin Kang)
  31. 31 CS885 Lecture 16b: FeUdal Networks for Hierarchical RL (Presenter: Rene Bidart)
  32. 32 CS885 Lecture 17a: Target-Driven Visual Navigation (Presenter: James Cagalawan)
  33. 33 CS885 Lecture 17b: Control of a Quadrotor (Presenter Nicole McNabb)
  34. 34 CS885 Lecture17c: Inverse Reinforcement Learning
  35. 35 CS885 Lecture 18a: Safe multi-agent RL for autonomous driving (Presenter: Ashish Gaurav)
  36. 36 CS885 Lecture 19a: End-to-end LSTM based dialog control (Presenter: Hamidreza Shahidi)
  37. 37 CS885 Lecture 19b: Learning cooperative visual dialog agents (Presenter: Nalin Chhibber)
  38. 38 CS885 Lecture 19c: Memory Augmented Networks
  39. 39 CS885 Lecture 20a: Neural map: structured memory for deep RL (Presenter: Andreas Stöckel)
  40. 40 CS885 Lecture 20b: Memory augmented control networks (Presenter: Aravind Balakrishnan)
  41. 41 CS885 Lecture 18b: Learning Driving Styles for Autonomous Vehicles (Presenter: Marko Ilievski)

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.