rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

71
/ 100
Verified

Provides automatic LLM call tracing through a decorator pattern, with a dual-backend training system supporting both distributed GPU training (verl) and single-machine setups (tinker). Implements a unified agent pipeline that transparently captures token IDs and logprobs via LiteLLM proxy, enabling reward computation and policy updates using algorithms like GRPO and REINFORCE without modifying agent code.

5,219 stars. Actively maintained with 104 commits in the last 30 days.

No Package No Dependents
Maintenance 25 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

5,219

Forks

515

Language

Python

License

Apache-2.0

Last pushed

Mar 13, 2026

Commits (30d)

104

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/rllm-org/rllm"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.