opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Organizes research papers, codebases, datasets, and blogs spanning foundational RLHF techniques (Inverse Reinforcement Learning, Apprenticeship Learning, Interactive Machine Learning) through modern LLM alignment applications. Tracks papers chronologically from 2020-present with structured metadata including authors, keywords, code repositories, and experimental environments, enabling systematic tracking of RLHF research frontiers.
4,325 stars.
Stars
4,325
Forks
250
Language
—
License
Apache-2.0
Category
Last pushed
Dec 09, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/opendilab/awesome-RLHF"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
hud-evals/hud-python
OSS RL environment + evals toolkit
hiyouga/EasyR1
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
sail-sg/oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning,...
NVlabs/GDPO
Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for...