How to Fine-tune LLMs with RLVR - OpenAI's RFT API

How to Fine-tune LLMs with RLVR - OpenAI's RFT API

Shaw Talebi via YouTube Direct link

Introduction -

1 of 13

1 of 13

Introduction -

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

How to Fine-tune LLMs with RLVR - OpenAI's RFT API

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Introduction -
  2. 2 RL with LLMs -
  3. 3 RLVR -
  4. 4 SFT vs RLVR -
  5. 5 Example: HDFS Classification with RLVR -
  6. 6 Step 0: Imports -
  7. 7 Step 1: Train-Validation Split -
  8. 8 Step 2: Format Data -
  9. 9 Step 3: Create Grader -
  10. 10 Step 4: Fine-tune Model -
  11. 11 Step 5: Evaluate Model -
  12. 12 Limitations -
  13. 13 What's Next? -

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.