Add TRL example notebook to RLHF docs (#26346)

Signed-off-by: sergiopaniego <sergiopaniegoblanco@gmail.com>
This commit is contained in:
Sergio Paniego Blanco 2025-10-07 13:31:28 +02:00 committed by GitHub
parent e1098ced95
commit 883b42896a
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -12,4 +12,5 @@ See the following basic examples to get started if you don't want to use an exis
See the following notebooks showing how to use vLLM for GRPO:
- [Efficient Online Training with GRPO and vLLM in TRL](https://huggingface.co/learn/cookbook/grpo_vllm_online_training)
- [Qwen-3 4B GRPO using Unsloth + vLLM](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_(4B)-GRPO.ipynb)