ShabnamAtf/ScenarioBench
Trace-grounded compliance benchmark for Text-to-SQL and RAG
17
/ 100
Experimental
No commits in the last 6 months.
Stale 6m
No Package
No Dependents
Maintenance
2 / 25
Adoption
0 / 25
Maturity
15 / 25
Community
0 / 25
Stars
—
Forks
—
Language
Python
License
—
Category
Last pushed
Oct 06, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/ShabnamAtf/ScenarioBench"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TJ-Neary/AI_Eval
Comprehensive LLM evaluation framework comparing local and cloud models with hardware-aware...
24
masaakisakamoto/memory-os
Deterministic continuity for AI systems. Detect and repair inconsistencies across sessions — not...
23
dahlinomine/local-llm-rag-bench
Python tool for benchmarking local LLM performance on specific RAG datasets.
22
VectoringAI/ai-engineering
Practical tutorials to build AI Engineering skills
22
priyanshus/evaliphy
E2E RAG Testing Tool
22