rllm-org/rllm
Democratizing Reinforcement Learning for LLMs
71
/ 100
Verified
Provides automatic LLM call tracing through a decorator pattern, with a dual-backend training system supporting both distributed GPU training (verl) and single-machine setups (tinker). Implements a unified agent pipeline that transparently captures token IDs and logprobs via LiteLLM proxy, enabling reward computation and policy updates using algorithms like GRPO and REINFORCE without modifying agent code.
5,219 stars. Actively maintained with 104 commits in the last 30 days.
No Package
No Dependents
Maintenance
25 / 25
Adoption
10 / 25
Maturity
16 / 25
Community
20 / 25
Stars
5,219
Forks
515
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
104
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/rllm-org/rllm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.