NVlabs/ToolOrchestra

ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.

/ 100

Established

Trains small orchestrator models via multi-turn reinforcement learning to coordinate diverse tools and specialist LLMs, optimizing jointly for outcome, efficiency, and preference rewards. Includes automatic synthetic data generation pipeline (ToolScale dataset) for scaling RL training across complex multi-step tasks. Integrates with web search, code interpreters, and multiple expert models (GPT-4o, Claude, Llama variants) through unified tool-calling interface, evaluated on GAIA, HLE, FRAMES, and τ²-Bench benchmarks.

677 stars. Actively maintained with 2 commits in the last 30 days.

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 13 / 25

Community 21 / 25

How are scores calculated?

Stars

677

Forks

Language

Python

License

Apache-2.0

Related agents

Pipelex/pipelex

Declarative language for composable Al workflows. Devtool for agents and mere humans.

lobehub/lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that...

lemony-ai/cascadeflow

Cascading runtime for AI agents. Optimize cost, latency, quality, and policy decisions inside...

strands-agents/sdk-typescript

A model-driven approach to building AI agents in just a few lines of code.

agents-flex/agents-flex

Agents-flex is A Lightweight Java AI Application Development Framework.

Explore AI Agents

All categories Trending AI Agent directory Insights