AMAP-ML/Tree-GRPO

[ICLR 2026] Tree Search for LLM Agent Reinforcement Learning

/ 100

Emerging

This project helps AI researchers and developers improve how large language models (LLMs) answer complex questions. By using a 'tree-search' approach instead of simpler methods, it makes LLM agents more accurate and efficient. You input an LLM and question-answering datasets, and it outputs a more capable, optimized LLM agent for various QA tasks.

304 stars.

Use this if you are developing or fine-tuning LLM agents for complex question-answering and want to achieve better performance with fewer computational resources.

Not ideal if you are looking for a plug-and-play solution for basic LLM applications or do not have experience with reinforcement learning and LLM agent training.

LLM-agent-training reinforcement-learning natural-language-processing AI-research question-answering-systems

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 14 / 25

How are scores calculated?

Stars

304

Forks

Language

Python

License

Apache-2.0

Related models

xrsrke/toolformer

Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools

Auromix/ROS-LLM

ROS-LLM is a framework designed for embodied intelligence applications in ROS. It allows natural...

xingyaoww/code-act

Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao...

MozerWang/AMPO

[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents

WxxShirley/GNN4TaskPlan

[NeurIPS 2024] Official implementation for paper "Can Graph Learning Improve Planning in...

Explore Transformer Models

All categories Trending Transformer directory Insights