rllm-org/rllm

Democratizing Reinforcement Learning for LLMs

/ 100

Verified

Provides automatic LLM call tracing through a decorator pattern, with a dual-backend training system supporting both distributed GPU training (verl) and single-machine setups (tinker). Implements a unified agent pipeline that transparently captures token IDs and logprobs via LiteLLM proxy, enabling reward computation and policy updates using algorithms like GRPO and REINFORCE without modifying agent code.

5,219 stars. Actively maintained with 104 commits in the last 30 days.

No Package No Dependents

Maintenance 25 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

5,219

Forks

515

Language

Python

License

Apache-2.0

Category

reinforcement-learning-frameworks

Last pushed

Mar 13, 2026

Commits (30d)

104

GitHub

Reinforcement Learning Frameworks · 1 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/rllm-org/rllm"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Explore RAG Tools

All categories Trending RAG directory Insights