Page 2 - 50+ RLHF Online Courses for 2026 | Explore Free Courses & Certifications

Yacine Mahdid

Exploring GRPO Through the RAFT Algorithm - RLHF and RLVR

LLMs from Scratch - Practical Engineering from Base Model to PPO RLHF

Amazon Web Services; Amazon

AWS SimuLearn: Fine-Tune a Base Model with RLHF

Serrano.Academy

Reinforcement Learning for Large Language Models - RLHF, PPO, DPO, and GRPO

Career Certificate

LLM Application Engineering and Development Certification

Alberta Machine Intelligence Institute Career Certificate

Generative AI Essentials

Whizlabs

NVIDIA: Large Language Models and Generative AI Deployment

Career Certificate

Quick Start Guide to Large Language Models (LLMs)

Board Infinity

Mastering DeepSeek: From Architecture to Application

Quick Start Guide to Large Language Models (LLMs): Unit 3

Discover AI

Coding RLHF on LLama 2 with LoRA, 4-bit Quantization, TRL and DPO

Mastering ChatGPT (AI) and PowerPoint presentation

LLM Fine Tuning Fundamentals + Fine tune OpenAI GPT model

Discover AI

CriticGPT: Understanding RLHF and Force Sampling Beam Search Optimization

Generative AI on AWS

Generative AI and Large Language Models: From GPT-3 to ChatGPT with RLHF

RLHF Courses and Certifications

Exploring GRPO Through the RAFT Algorithm - RLHF and RLVR

LLMs from Scratch - Practical Engineering from Base Model to PPO RLHF

AWS SimuLearn: Fine-Tune a Base Model with RLHF

Reinforcement Learning for Large Language Models - RLHF, PPO, DPO, and GRPO

LLM Application Engineering and Development Certification

Generative AI Essentials

NVIDIA: Large Language Models and Generative AI Deployment

Quick Start Guide to Large Language Models (LLMs)

Mastering DeepSeek: From Architecture to Application

Quick Start Guide to Large Language Models (LLMs): Unit 3

Coding RLHF on LLama 2 with LoRA, 4-bit Quantization, TRL and DPO

Mastering ChatGPT (AI) and PowerPoint presentation

LLM Fine Tuning Fundamentals + Fine tune OpenAI GPT model

CriticGPT: Understanding RLHF and Force Sampling Beam Search Optimization

Generative AI and Large Language Models: From GPT-3 to ChatGPT with RLHF

RLHF Courses and Certifications

Exploring GRPO Through the RAFT Algorithm - RLHF and RLVR

LLMs from Scratch - Practical Engineering from Base Model to PPO RLHF

AWS SimuLearn: Fine-Tune a Base Model with RLHF

Reinforcement Learning for Large Language Models - RLHF, PPO, DPO, and GRPO

LLM Application Engineering and Development Certification

Generative AI Essentials

NVIDIA: Large Language Models and Generative AI Deployment

Quick Start Guide to Large Language Models (LLMs)

Mastering DeepSeek: From Architecture to Application

Quick Start Guide to Large Language Models (LLMs): Unit 3

Coding RLHF on LLama 2 with LoRA, 4-bit Quantization, TRL and DPO

Mastering ChatGPT (AI) and PowerPoint presentation

LLM Fine Tuning Fundamentals + Fine tune OpenAI GPT model

CriticGPT: Understanding RLHF and Force Sampling Beam Search Optimization

Generative AI and Large Language Models: From GPT-3 to ChatGPT with RLHF