opendilab/awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

/ 100

Established

Organizes research papers, codebases, datasets, and blogs spanning foundational RLHF techniques (Inverse Reinforcement Learning, Apprenticeship Learning, Interactive Machine Learning) through modern LLM alignment applications. Tracks papers chronologically from 2020-present with structured metadata including authors, keywords, code repositories, and experimental environments, enabling systematic tracking of RLHF research frontiers.

4,325 stars.

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

4,325

Forks

250

Language

—

License

Apache-2.0

Related tools

hud-evals/hud-python

OSS RL environment + evals toolkit

hiyouga/EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

OpenRL-Lab/openrl

Unified Reinforcement Learning Framework

sail-sg/oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning,...

NVlabs/GDPO

Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for...

Explore LLM Tools

All categories Trending LLM Tool directory Insights