OSU-NLP-Group/TravelPlanner
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
Comprises a multi-constraint planning benchmark with three evaluation modes: a two-stage tool-use setup where agents search for information before planning, a sole-planning mode providing pre-gathered data, and strategy variants (direct, chain-of-thought, ReAct, reflexion). The environment includes a structured database (JSON format), GPT-4-based postprocessing for natural language-to-JSON conversion, and comprehensive evaluation metrics that assess adherence to environment, commonsense, and hard constraints. Supports multiple LLM backends (GPT-3.5/4, Gemini, Mistral, Mixtral) with fine-tuned model variants available via HuggingFace and LLaMA-Factory integration.
491 stars.
Stars
491
Forks
73
Language
Python
License
MIT
Category
Last pushed
Nov 07, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/OSU-NLP-Group/TravelPlanner"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
LAMDASZ-ML/ChinaTravel
ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning
1937983507/ai-tourism
aI-tourism 是一个智能旅游规划系统,后端基于 Spring Boot、LangChain4j、MySQL、MyBatis、Sa-Token 等技术栈,集成了多种 AI 能力(如 AI...
AdritPal08/TravelPlanner-CrewAi-Agents-Streamlit
Generate personalized travel itineraries based on user preferences.
YihongT/ITINERA
[EMNLP 2024 Industry Track & KDD UrbComp 2024 Best Paper Award] ITINERA: Integrating Spatial...
RobertoCorti/gptravel
Travel planning Streamlit web-app based on OpenAI API