vectara/mirage-bench

Repository for Multililngual Generation, RAG evaluations, and surrogate judge training for Arena RAG leaderboard (NAACL'25)

/ 100

Emerging

No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 2 / 25

Adoption 5 / 25

Maturity 18 / 25

Community 7 / 25

Stars

Forks

Language

Python

License

Apache-2.0

Category

Last pushed

Apr 10, 2025

Commits (30d)

Dependencies

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/vectara/mirage-bench"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Featured in

HZYAI/RagScore

⚡️ The "1-Minute RAG Audit" — Generate QA datasets & evaluate RAG systems in Colab, Jupyter, or...

vectara/open-rag-eval

RAG evaluation without the need for "golden answers"

DocAILab/XRAG

XRAG: eXamining the Core - Benchmarking Foundational Component Modules in Advanced...

AIAnytime/rag-evaluator

A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).

microsoft/benchmark-qed

Automated benchmarking of Retrieval-Augmented Generation (RAG) systems