CommissarSilver/PrismBench
PrismBench: A comprehensive framework for evaluating Large Language Model capabilities through Monte Carlo Tree Search. Systematically maps model strengths, automatically discovers challenging concept combinations, and provides detailed performance analysis with containerized deployment and OpenAI-compatible API support.
Stars
3
Forks
—
Language
Python
License
—
Category
Last pushed
Mar 06, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ai-coding/CommissarSilver/PrismBench"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rynfar/meridian
Use your Claude Max subscription with OpenCode. Proxy that bridges Anthropic's official SDK to...
a-tokyo/aiworkspace
🧑💻 Set up and manage AI agent skills and configs for Cursor, Claude Code, Codex and more across...
calesthio/OpenMontage
World's first open-source, agentic video production system. 11 pipelines, 49 tools, 400+ agent...
cruzyjapan/Gemini-CLI-UI
A responsive web-based UI that provides an intuitive interface for Google's Gemini CLI, enabling...
AntonOsika/gpt-engineer
CLI platform to experiment with codegen. Precursor to: https://lovable.dev