ShabnamAtf/ScenarioBench

Trace-grounded compliance benchmark for Text-to-SQL and RAG

/ 100

Experimental

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 0 / 25

Maturity 15 / 25

Community 0 / 25

Stars

—

Forks

—

Language

Python

License

—

Category

Last pushed

Oct 06, 2025

Commits (30d)

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/ShabnamAtf/ScenarioBench"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

TJ-Neary/AI_Eval

Comprehensive LLM evaluation framework comparing local and cloud models with hardware-aware...

masaakisakamoto/memory-os

Deterministic continuity for AI systems. Detect and repair inconsistencies across sessions — not...

dahlinomine/local-llm-rag-bench

Python tool for benchmarking local LLM performance on specific RAG datasets.

VectoringAI/ai-engineering

Practical tutorials to build AI Engineering skills

priyanshus/evaliphy

E2E RAG Testing Tool