NVlabs/ToolOrchestra

ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.

57
/ 100
Established

Trains small orchestrator models via multi-turn reinforcement learning to coordinate diverse tools and specialist LLMs, optimizing jointly for outcome, efficiency, and preference rewards. Includes automatic synthetic data generation pipeline (ToolScale dataset) for scaling RL training across complex multi-step tasks. Integrates with web search, code interpreters, and multiple expert models (GPT-4o, Claude, Llama variants) through unified tool-calling interface, evaluated on GAIA, HLE, FRAMES, and τ²-Bench benchmarks.

677 stars. Actively maintained with 2 commits in the last 30 days.

No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 13 / 25
Community 21 / 25

How are scores calculated?

Stars

677

Forks

92

Language

Python

License

Apache-2.0

Last pushed

Jan 27, 2026

Commits (30d)

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/NVlabs/ToolOrchestra"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.