AMAP-ML/Tree-GRPO
[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning
This project helps AI researchers and developers improve how large language models (LLMs) answer complex questions. By using a 'tree-search' approach instead of simpler methods, it makes LLM agents more accurate and efficient. You input an LLM and question-answering datasets, and it outputs a more capable, optimized LLM agent for various QA tasks.
304 stars.
Use this if you are developing or fine-tuning LLM agents for complex question-answering and want to achieve better performance with fewer computational resources.
Not ideal if you are looking for a plug-and-play solution for basic LLM applications or do not have experience with reinforcement learning and LLM agent training.
Stars
304
Forks
25
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 26, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/AMAP-ML/Tree-GRPO"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
xrsrke/toolformer
Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools
Auromix/ROS-LLM
ROS-LLM is a framework designed for embodied intelligence applications in ROS. It allows natural...
xingyaoww/code-act
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao...
MozerWang/AMPO
[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents
WxxShirley/GNN4TaskPlan
[NeurIPS 2024] Official implementation for paper "Can Graph Learning Improve Planning in...