the-crypt-keeper/can-ai-code

Self-evaluating interview for AI coders

42
/ 100
Emerging

Parametric difficulty scaling generates unlimited unique problems across two dimensions (length and depth) rather than fixed test cases, measuring how far each model climbs a difficulty ramp instead of binary pass/fail. The framework evaluates three metrics—height (max difficulty achieved), efficiency (tokens consumed), and constrained performance (resource-limited capability)—revealing distinct cognitive fingerprints across model families (OpenAI's reasoning strength vs. Llama's balanced baseline). Benchmarks automatically escalate difficulty when models cluster near ceiling performance, maintaining discrimination power as capabilities advance while supporting any domain where difficulty can be parameterized.

602 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 14 / 25

How are scores calculated?

Stars

602

Forks

34

Language

Python

License

MIT

Last pushed

Jun 21, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/the-crypt-keeper/can-ai-code"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.