CommissarSilver/PrismBench

PrismBench: A comprehensive framework for evaluating Large Language Model capabilities through Monte Carlo Tree Search. Systematically maps model strengths, automatically discovers challenging concept combinations, and provides detailed performance analysis with containerized deployment and OpenAI-compatible API support.

/ 100

Experimental

No License No Package No Dependents

Maintenance 10 / 25

Adoption 3 / 25

Maturity 7 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Higher-rated alternatives

rynfar/meridian

Use your Claude Max subscription with OpenCode. Proxy that bridges Anthropic's official SDK to...

a-tokyo/aiworkspace

🧑‍💻 Set up and manage AI agent skills and configs for Cursor, Claude Code, Codex and more across...

calesthio/OpenMontage

World's first open-source, agentic video production system. 11 pipelines, 49 tools, 400+ agent...

cruzyjapan/Gemini-CLI-UI

A responsive web-based UI that provides an intuitive interface for Google's Gemini CLI, enabling...

AntonOsika/gpt-engineer

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Explore AI Coding Tools

All categories Trending AI Coding directory Insights