AmariahAK/arp
Extremely hard, multi-turn, open-source-grounded coding evaluations that reliably break every current frontier models (Claude, GPT, Grok, Gemini, Llama, etc.) on numerical stability, zero-allocation, autograd, SIMD, and long-chain correctness.
Stars
—
Forks
—
Language
—
License
MIT
Category
Last pushed
Jan 27, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ai-coding/AmariahAK/arp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
github/spec-kit
💫 Toolkit to help you get started with Spec-Driven Development
nold-ai/specfact-cli
The “swiss knife” CLI for agile DevOps teams. Keep backlog, specs, tests, and code in sync....
ossature/ossature
An open-source harness for spec-driven code generation.
888888888881/spec-kit-chinese
🇨🇳 Spec-Kit 中文汉化版 | GitHub 规范驱动开发工具包完整汉化 | Chinese Localization of GitHub Spec-Kit
speq-ai/speq
The SPEQ specification: a declarative format for AI-assisted development