0
Skip to Content
Serrano.Academy
Home
Learning Hub
About
Speaking
Contact
Serrano.Academy
Home
Learning Hub
About
Speaking
Contact
Home
Learning Hub
About
Speaking
Contact
Progress
Reinforcement Learning for LLMs
Complete & Continue Next Lesson Learn More
Chapter 1
5 Lessons
Lesson 1: Deep Reinforcement Learning and Policy Gradients
Lesson 2: Reinforcement Learning with Human Feedback (RLHF)
Lesson 3: Principal Policy Optimization (PPO)
Lesson 4: Direct Preference Optimization (DPO)
Lesson 5: Group Relative Policy Optimization (GRPO)
Reinforcement Learning for LLMs
Complete & Continue Next Lesson Learn More
Add a video
Chapter 1

Lesson 5: Group Relative Policy Optimization (GRPO)

Complete & Continue Next Lesson Learn More

LEARN

Learning Hub

Supervised Learning

Unsupervised Learning

Deep Learning

Large Language Models

Reinforcement Learning for LLMs

FOR ORGANIZATIONS

Speaking & Consulting

Let’s Connect

RESOURCES

Grokking Machine Learning Book

YouTube Channel

FOLLOW