Completed
CS885 Lecture 1a: Course Introduction
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Reinforcement Learning - Spring 2018
Automatically move to the next video in the Classroom when playback concludes
- 1 CS885 Lecture 1a: Course Introduction
- 2 CS885 Lecture 1b: Markov Processes
- 3 CS885 Lecture 2a: Markov Decision Processes
- 4 CS885 Lecture 2b: Value Iteration
- 5 CS885 Lecture 3a: Policy Iteration
- 6 CS885 Lecture 3b: Introduction to RL
- 7 CS885 Lecture 4a: Deep Neural Networks
- 8 CS885 Lecture 4b: Deep Q-Networks
- 9 CS885 Lecture 5: Conversational Agents (Nabiha Asghar)
- 10 CS885 Lecture 6a: OpenAI Environments (Mike Rudd)
- 11 CS885 Lecture 6b: DQN and TensorFlow (Timmy Tse)
- 12 CS885 Lecture 7a: Policy Gradient
- 13 CS885 Lecture 7b: Actor Critic
- 14 CS885 Lecture 8a: Multi-armed bandits
- 15 CS885 Lecture 8b: Bayesian and Contextual Bandits
- 16 CS885 Lecture 9: Model-based RL
- 17 CS885 Lecture 10: Bayesian RL
- 18 CS885 Lecture 11a: Hidden Markov Models
- 19 CS885 Lecture 11b: Partially Observable RL
- 20 CS885 Lecture 12: Deep Recurrent Q-Networks
- 21 CS885 Lecture 13a: Playing FPS Games with Deep RL (presenter: Mark Iwanchyshyn)
- 22 CS885 Lecture 13b: Lifelong Learning in Minecraft (Presenter: Yetian Wang)
- 23 CS885 Lecture 13c: Adversarial Search
- 24 CS885 Lecture 14a: Mastering the Game of Go (Presenter: Henry Chen)
- 25 CS885 Lecture 14b: Mastering Chess and Shogi (Presenter: Kira Selby)
- 26 CS885 Lecture 14c: Trust Region Methods
- 27 CS885 Lecture 15a: Trust Region Policy Optimization (Presenter: Shivam Kalra)
- 28 CS885 Lecture 15b: Proximal Policy Optimization (Presenter: Ruifan Yu)
- 29 CS885 Lecture 15c: Semi-Markov Decision Processes
- 30 CS885 Lecture 16a: The Option-Critic Architecture (Presenter: Zebin Kang)
- 31 CS885 Lecture 16b: FeUdal Networks for Hierarchical RL (Presenter: Rene Bidart)
- 32 CS885 Lecture 17a: Target-Driven Visual Navigation (Presenter: James Cagalawan)
- 33 CS885 Lecture 17b: Control of a Quadrotor (Presenter Nicole McNabb)
- 34 CS885 Lecture17c: Inverse Reinforcement Learning
- 35 CS885 Lecture 18a: Safe multi-agent RL for autonomous driving (Presenter: Ashish Gaurav)
- 36 CS885 Lecture 19a: End-to-end LSTM based dialog control (Presenter: Hamidreza Shahidi)
- 37 CS885 Lecture 19b: Learning cooperative visual dialog agents (Presenter: Nalin Chhibber)
- 38 CS885 Lecture 19c: Memory Augmented Networks
- 39 CS885 Lecture 20a: Neural map: structured memory for deep RL (Presenter: Andreas Stöckel)
- 40 CS885 Lecture 20b: Memory augmented control networks (Presenter: Aravind Balakrishnan)
- 41 CS885 Lecture 18b: Learning Driving Styles for Autonomous Vehicles (Presenter: Marko Ilievski)