CommissarSilver/PrismBench

PrismBench: A comprehensive framework for evaluating Large Language Model capabilities through Monte Carlo Tree Search. Systematically maps model strengths, automatically discovers challenging concept combinations, and provides detailed performance analysis with containerized deployment and OpenAI-compatible API support.

20
/ 100
Experimental
No License No Package No Dependents
Maintenance 10 / 25
Adoption 3 / 25
Maturity 7 / 25
Community 0 / 25

How are scores calculated?

Stars

3

Forks

Language

Python

License

Category

code-editor

Last pushed

Mar 06, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ai-coding/CommissarSilver/PrismBench"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.