the-crypt-keeper/can-ai-code
Self-evaluating interview for AI coders
Parametric difficulty scaling generates unlimited unique problems across two dimensions (length and depth) rather than fixed test cases, measuring how far each model climbs a difficulty ramp instead of binary pass/fail. The framework evaluates three metrics—height (max difficulty achieved), efficiency (tokens consumed), and constrained performance (resource-limited capability)—revealing distinct cognitive fingerprints across model families (OpenAI's reasoning strength vs. Llama's balanced baseline). Benchmarks automatically escalate difficulty when models cluster near ceiling performance, maintaining discrimination power as capabilities advance while supporting any domain where difficulty can be parameterized.
602 stars. No commits in the last 6 months.
Stars
602
Forks
34
Language
Python
License
MIT
Category
Last pushed
Jun 21, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/the-crypt-keeper/can-ai-code"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
balisujohn/localwriter
A LibreOffice Writer extension that adds local-inference generative AI features.
ChanithaAbey/AI-Agent-for-Stock-Prediction
An AI Agent for stock data analysis, news rerieval, and prediction; powered by yfinance,...
its-kumar-yash/deep-study-ai-agent
DeepStudy AI automates research, refines queries dynamically, and generates high-quality...
luiskugel/AI-Writing-Assistant-for-Thunderbird
A Thunderbird extension that helps improve your email writing using various AI models (LLMs) and...
hemangjoshi37a/hjAlgos
AI based algorithmic trading platform for zerodha users