rungalileo/hallucination-index

Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.

/ 100

Emerging

Evaluates 22 models across three RAG scenarios (short/medium/long context) using ChainPoll—a multi-polling chain-of-thought technique—to quantify hallucinations and contextual adherence. Tests both open and closed-source models against variable context lengths (5k-100k tokens) and prompting strategies like Chain-of-Note. Includes custom LLM-based evaluation for factual accuracy and position-bias analysis across 10,000 domain-specific documents.

116 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 11 / 25

How are scores calculated?

Stars

116

Forks

Language

—

License

—

Higher-rated alternatives

onestardao/WFGY

WFGY: open-source reasoning and debugging infrastructure for RAG and AI agents. Includes the...

KRLabsOrg/verbatim-rag

Hallucination-prevention RAG system with verbatim span extraction. Ensures all generated content...

iMoonLab/Hyper-RAG

"Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven Retrieval-Augmented Generation"...

frmoretto/clarity-gate

Stop LLMs from hallucinating your guesses as facts. Clarity Gate is a verification protocol for...

chensyCN/LogicRAG

Source code of LogicRAG at AAAI'26.

Explore RAG Tools

All categories Trending RAG directory Insights