The Fastest Way to Become a Backend Developer Online
Most AI Pilots Fail to Scale. MIT Sloan Teaches You Why — and How to Fix It
Overview
Syllabus
Intro
Sequential Decision Making in Complex Environments
Data from the Existing Simulation Environments
Procedural Generation of New Environments
Benchmarking RL Generalization
Benchmarking Safe Reinforcement Learning
Benchmarking Multi-Agent Reinforcement Learning
Real2Sim: Learning to generate traffic scenarios
Pretraining Policy Representation with Real World Data
Self-supervised Learning through Contrastive Learning
Policy Pretraining with Human Actions
Action-conditioned Contrastive Learning
Pretrained Representation for Imitation Learning
Human-in-the-loop Reinforcement Learning
Human-Al Copilot Optimization (HACO)
Demo Video: Learning to drive in CARLA environment
Policy Dissection through Frequency Analysis
Taught by
Bolei Zhou