liuxiaotong/knowlyr-gym
Gymnasium-style RL framework for LLM agent training — MDP environments, three-layer process reward & SFT/DPO/GRPO policy optimization. CLI + MCP ready.
Stars
3
Forks
—
Language
Python
License
—
Category
Last pushed
Mar 15, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/liuxiaotong/knowlyr-gym"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SALT-NLP/collaborative-gym
Framework and toolkits for building and evaluating collaborative agents that can work together...
ai4co/reevo
[NeurIPS 2024] ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution
Gen-Verse/LatentMAS
Latent Collaboration in Multi-Agent Systems
lean-dojo/LeanCopilot
LLMs as Copilots for Theorem Proving in Lean
sethkarten/LLM-Economist
Official repository of the 2025 paper, LLM Economist: Large Population Models and Mechanism...