OSU-NLP-Group/TravelPlanner

[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

52
/ 100
Established

Comprises a multi-constraint planning benchmark with three evaluation modes: a two-stage tool-use setup where agents search for information before planning, a sole-planning mode providing pre-gathered data, and strategy variants (direct, chain-of-thought, ReAct, reflexion). The environment includes a structured database (JSON format), GPT-4-based postprocessing for natural language-to-JSON conversion, and comprehensive evaluation metrics that assess adherence to environment, commonsense, and hard constraints. Supports multiple LLM backends (GPT-3.5/4, Gemini, Mistral, Mixtral) with fine-tuned model variants available via HuggingFace and LLaMA-Factory integration.

491 stars.

No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

491

Forks

73

Language

Python

License

MIT

Last pushed

Nov 07, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/OSU-NLP-Group/TravelPlanner"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.