Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Reinforcement Learning with LLMs - A New Era of AI Agents

Shaw Talebi via YouTube

Overview

Coursera Flash Sale
40% Off Coursera Plus for 3 Months!
Grab it
Explore three cutting-edge approaches to training large language models and AI agents using reinforcement learning in this 21-minute educational video. Learn how reinforcement learning fundamentals apply to modern LLM training, starting with an overview of how LLMs are traditionally trained before diving into three specific methodologies. Discover Reinforcement Learning from Human Feedback (RLHF), which uses human preferences to guide model behavior, followed by Reinforcement Learning from AI Feedback (RLAIF), where AI systems provide the training signals instead of humans. Examine Reinforcement Learning from Verifiable Rewards (RLVR), a newer approach that leverages verifiable outcomes for training. Understand the current limitations of these approaches and gain insights into future developments in the field. The presentation includes detailed timestamps for easy navigation and references to recent research papers, making it valuable for both beginners seeking to understand RL applications in AI and practitioners looking to implement these techniques.

Syllabus

Introduction -
Reinforcement Learning RL -
RL with LLMs -
How LLMs are Trained -
3 Ways to RL with LLMs -
Way 1: RLHF -
Way 2: RLAIF -
Way 3: RLVR -
Limitations -
What's Next? -

Taught by

Shaw Talebi

Reviews

Start your review of Reinforcement Learning with LLMs - A New Era of AI Agents

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.