mcp-tool-bench/MCPToolBenchPP

MCPToolBench++ MCP Model Context Protocol Tool Use Benchmark on AI Agent and Model Tool Use Ability

/ 100

Emerging

Comprehensive benchmark for evaluating LLM tool-use capabilities across 45+ MCP server categories (browser automation, file systems, search, maps, payments, finance) with 4k+ instances covering single and multi-step tool calls. Evaluation uses standardized metrics (AST and Pass@K) with an LLM-as-judge approach, supporting major models like GPT-4o, Qwen, and Claude across multilingual scenarios. Integrates with MCP ecosystem servers and the OneKey MCP Router for simplified API access to commercial services like Google Maps and Perplexity.

No License No Package No Dependents

Maintenance 6 / 25

Adoption 7 / 25

Maturity 7 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Featured in

Your Agent Doesn't Have an Email Address (Yet)

Higher-rated alternatives

toolsdk-ai/toolsdk-mcp-registry

MCPSDK.dev(ToolSDK.ai)'s Awesome MCP Servers and Packages Registry and Database with Structured...

Dicklesworthstone/mcp_agent_mail

Asynchronous coordination layer for AI coding agents: identities, inboxes, searchable threads,...

last9/last9-mcp-server

Last9 MCP Server

burugo/one-mcp

A centralized reverse-proxy platform for MCP servers — manage, group, and export as Skills from...

LSTM-Kirigaya/openmcp-client

All in one vscode plugin for mcp developer

Explore MCP Servers

All categories Trending MCP Server directory Insights