the-crypt-keeper/can-ai-code

Self-evaluating interview for AI coders

/ 100

Emerging

Parametric difficulty scaling generates unlimited unique problems across two dimensions (length and depth) rather than fixed test cases, measuring how far each model climbs a difficulty ramp instead of binary pass/fail. The framework evaluates three metrics—height (max difficulty achieved), efficiency (tokens consumed), and constrained performance (resource-limited capability)—revealing distinct cognitive fingerprints across model families (OpenAI's reasoning strength vs. Llama's balanced baseline). Benchmarks automatically escalate difficulty when models cluster near ceiling performance, maintaining discrimination power as capabilities advance while supporting any domain where difficulty can be parameterized.

602 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

602

Forks

Language

Python

License

MIT

Higher-rated alternatives

balisujohn/localwriter

A LibreOffice Writer extension that adds local-inference generative AI features.

ChanithaAbey/AI-Agent-for-Stock-Prediction

An AI Agent for stock data analysis, news rerieval, and prediction; powered by yfinance,...

its-kumar-yash/deep-study-ai-agent

DeepStudy AI automates research, refines queries dynamically, and generates high-quality...

luiskugel/AI-Writing-Assistant-for-Thunderbird

A Thunderbird extension that helps improve your email writing using various AI models (LLMs) and...

hemangjoshi37a/hjAlgos

AI based algorithmic trading platform for zerodha users

Explore Transformer Models

All categories Trending Transformer directory Insights