Hallucination Detection RAG RAG Tools
Tools and systems specifically designed to detect, mitigate, verify, and prevent hallucinations in RAG pipelines through claim extraction, evidence retrieval, and factuality validation. Does NOT include general RAG quality monitoring, broader fact-checking systems outside RAG context, or hallucination research in non-RAG LLM applications.
There are 40 hallucination detection rag tools tracked. 3 score above 50 (established tier). The highest-rated is onestardao/WFGY at 64/100 with 1,620 stars. 1 of the top 10 are actively maintained.
Get all 40 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=rag&subcategory=hallucination-detection-rag&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
onestardao/WFGY
WFGY: open-source reasoning and debugging infrastructure for RAG and AI... |
|
Established |
| 2 |
KRLabsOrg/verbatim-rag
Hallucination-prevention RAG system with verbatim span extraction. Ensures... |
|
Established |
| 3 |
iMoonLab/Hyper-RAG
"Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven... |
|
Established |
| 4 |
frmoretto/clarity-gate
Stop LLMs from hallucinating your guesses as facts. Clarity Gate is a... |
|
Emerging |
| 5 |
anulum/director-ai
Real-time LLM hallucination guardrail — NLI + RAG fact-checking with... |
|
Emerging |
| 6 |
project-miracl/nomiracl
NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM... |
|
Emerging |
| 7 |
chensyCN/LogicRAG
Source code of LogicRAG at AAAI'26. |
|
Emerging |
| 8 |
anlp-team/LTI_Neural_Navigator
"Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case... |
|
Experimental |
| 9 |
Betswish/MIRAGE
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications.... |
|
Experimental |
| 10 |
rafay123321/embedding-hallucinations
This repo shows how foundational model hallucinates and how we can fix such... |
|
Experimental |
| 11 |
rungalileo/hallucination-index
Initiative to evaluate and rank the most popular LLMs across common task... |
|
Experimental |
| 12 |
amitgambhir/rag-auditor
Open source RAG evaluation platform — automatically score faithfulness,... |
|
Experimental |
| 13 |
MukundaKatta/RAGGuard
RAG hallucination detection — verify LLM responses are grounded in source... |
|
Experimental |
| 14 |
aryan-bhadana/rag-debugger
A production-style RAG debugger with hybrid retrieval, failure detection,... |
|
Experimental |
| 15 |
TECHKNOWMAD-LABS/ground-truth
Hallucination detection for RAG pipelines. |
|
Experimental |
| 16 |
scasella/adaptive_rag_rlm
A verifiers RLM environment for testing whether adaptive recursive search... |
|
Experimental |
| 17 |
renataennes/rag-hallucination-detector
RAG pipeline with bilingual EN/PT hallucination detection |
|
Experimental |
| 18 |
lechmazur/confabulations
Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes... |
|
Experimental |
| 19 |
tarekmasryo/rag-qa-logs-and-corpus
Multi-table RAG QA telemetry + decision-grade RAG Ops notebook for retrieval... |
|
Experimental |
| 20 |
metawake/raglint
pytest-native quality checks for RAG systems. Catches hallucinated entities,... |
|
Experimental |
| 21 |
onurcandonmezer/rag-quality-monitor
RAG quality monitoring and assurance platform |
|
Experimental |
| 22 |
PolarisLiu1/LAT
Look As You Think: Unifying Reasoning and Visual Evidence Attribution for... |
|
Experimental |
| 23 |
GreyCatVP/raft-canon
Architectural canon for production-grade RAFT / RAG systems: evaluation,... |
|
Experimental |
| 24 |
kareem2002-k/clara-vs-rag-comparison
🔬 Compare CLaRa (latent compression) vs RAG (prompt stuffing) for document... |
|
Experimental |
| 25 |
hemanthballa07/HALO-RAG
Self-Verification Chains for Hallucination-Free Retrieval-Augmented... |
|
Experimental |
| 26 |
bdeva1975/hallucinationbench
Detect hallucinations in your RAG pipeline output — in two lines of Python. |
|
Experimental |
| 27 |
Padraigobrien08/model-failure-lab
Toolkit for discovering, classifying, and debugging failure modes in LLM and... |
|
Experimental |
| 28 |
alp-oz/cautious-rag
A RAG system that knows when not to answer using concentration inequalities |
|
Experimental |
| 29 |
nickhuang99/Intent-Aware-RAG
Why Pure Vector Search is a "False Proposition" for RAG? |
|
Experimental |
| 30 |
yuvaraj949/Dynamic-Uncertainty-Aware-Attribution-RAG
Token-level hallucination detection for RAG systems using Contextual... |
|
Experimental |
| 31 |
samuel-isr/VeritasRAG
A hallucination-resistant Retrieval-Augmented Generation (RAG) system. |
|
Experimental |
| 32 |
usal-research/rag_ctxdq
Implementation prototype for and executable context-aware data quality assessment |
|
Experimental |
| 33 |
Kanisha-Shah/Hallucination-Mitigation-Using-RAG
A Columbia University capstone project focused on mitigating hallucinations... |
|
Experimental |
| 34 |
emory-irlab/conqret-rag
Controversial Questions for Argumentation and Retrieval |
|
Experimental |
| 35 |
Sakshi3027/rag-handbook-qa
A production-ready RAG system with citations and hallucination prevention |
|
Experimental |
| 36 |
qualigenai/rag-learning
Production-ready RAG system with evaluation framework — zero hallucination,... |
|
Experimental |
| 37 |
Arnav-Ajay/rag-failure-modes
Failure-first analysis of retrieval-augmented and agentic systems, focused... |
|
Experimental |
| 38 |
Arnav-Ajay/rag-systems-foundations
A systems-level analysis of static RAG pipelines, isolating ingestion,... |
|
Experimental |
| 39 |
khaledahmed-Tech/rag-patterns-in-production
RAG reliability patterns: failure modes, observability, and quality loops. |
|
Experimental |
| 40 |
apatni24/VisionQA
Context-aware tool for automated BDD test generation and execution using... |
|
Experimental |