LLM Hallucination Mitigation LLM Tools
Tools and techniques for detecting, measuring, and correcting hallucinations in large language models across text and multimodal outputs. Does NOT include general LLM evaluation, factuality benchmarks, or non-hallucination-specific safety measures.
There are 46 llm hallucination mitigation tools tracked. 1 score above 50 (established tier). The highest-rated is vectara/hallucination-leaderboard at 62/100 with 3,122 stars. 1 of the top 10 are actively maintained.
Get all 46 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=llm-hallucination-mitigation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
vectara/hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when... |
|
Established |
| 2 |
amir-hameed-mir/Sirraya_LSD_Code
Layer-wise Semantic Dynamics (LSD) is a model-agnostic framework for... |
|
Emerging |
| 3 |
Amirhosein-gh98/Gnosis
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits |
|
Emerging |
| 4 |
PKU-YuanGroup/Hallucination-Attack
Attack to induce LLMs within hallucinations |
|
Emerging |
| 5 |
MemTensor/HaluMem
HaluMem is the first operation level hallucination evaluation benchmark... |
|
Emerging |
| 6 |
NishilBalar/Awesome-LVLM-Hallucination
up-to-date curated list of state-of-the-art Large vision language models... |
|
Emerging |
| 7 |
intuit/sac3
Official repo for SAC3: Reliable Hallucination Detection in Black-Box... |
|
Emerging |
| 8 |
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper:... |
|
Experimental |
| 9 |
OpenMOSS/HalluQA
Dataset and evaluation script for "Evaluating Hallucinations in Chinese... |
|
Experimental |
| 10 |
Mattbusel/LLM-Hallucination-Detection-Script
A comprehensive toolkit for detecting potential hallucinations in LLM... |
|
Experimental |
| 11 |
hongcheki/sweet-watermark
Official repository of the paper: Who Wrote this Code? Watermarking for Code... |
|
Experimental |
| 12 |
plll4zzx/Awesome-LLM-Watermark
A collection list for Large Language Model (LLM) Watermark |
|
Experimental |
| 13 |
VITA-MLLM/Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models |
|
Experimental |
| 14 |
hzy312/Awesome-LLM-Watermark
UP-TO-DATE LLM Watermark paper. 🔥🔥🔥 |
|
Experimental |
| 15 |
10nc0/Nyan-Protocol
Hallucination guard for AI — one invariant, any model, no training required. |
|
Experimental |
| 16 |
18907305772/KCA
EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud |
|
Experimental |
| 17 |
hallucinatemd/hallucinate.md
The open standard for telling AI not to hallucinate. |
|
Experimental |
| 18 |
amazon-science/THRONE
Code release for THRONE, a CVPR 2024 paper on measuring object... |
|
Experimental |
| 19 |
oumi-ai/halloumi-demo
Try out HallOumi, a state-of-the-art claim verification model in a simple UI! |
|
Experimental |
| 20 |
destination-earth/DestinE_ESA_DUA
DestinE Usage Assessment (DUA) framework services provides a system capable... |
|
Experimental |
| 21 |
hongbinye/Cognitive-Mirage-Hallucinations-in-LLMs
Repository for the paper "Cognitive Mirage: A Review of Hallucinations in... |
|
Experimental |
| 22 |
zjunlp/FactCHD
[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection |
|
Experimental |
| 23 |
utkukose/llm_persona_hallucination_study
Code for the study on persona vectors in controling / understanding... |
|
Experimental |
| 24 |
WindJammer6/37.-A-Hallucination-Mitigation-Scheme-in-Security-Policy-Generation-with-Large-Language-Models
Source code for the paper: A Hallucination Mitigation Scheme in Security... |
|
Experimental |
| 25 |
llaraspata/HallucinationDetection
Analyzing the correlation between Hallucinations and Knowledge Conflicts in... |
|
Experimental |
| 26 |
Sayar-212/TheLucidityBound
A Unified Framework for Hallucination Benchmarking and Mitigation in Large... |
|
Experimental |
| 27 |
MoNejjar/echo-hallucination-detect
Mitigating Hallucination Potential in User Prompts Through AI-Guided... |
|
Experimental |
| 28 |
Intelligent-Computing-Research-Group/HaVen
[DATE 2025] haven: hallucination-mitigated llm for verilog code generation... |
|
Experimental |
| 29 |
schipp/hallucination_guarantees
A pipeline that gives probabilistic guarantees for reducing contextual... |
|
Experimental |
| 30 |
stbiadmin/diu-halu
Framework for evaluating LLM hallucination detection in Department of... |
|
Experimental |
| 31 |
KhoiBui16/UIT_CS221_Basic_Natural_Language_Processing
The project focuses on classifying hallucinations in Vietnamese LLM outputs... |
|
Experimental |
| 32 |
lilakk/PostMark
Official repository for "PostMark: A Robust Blackbox Watermark for Large... |
|
Experimental |
| 33 |
rkhokhla/kakeya
When AI makes $10M decisions, hallucinations aren't bugs—they're business... |
|
Experimental |
| 34 |
IAAR-Shanghai/ICSFSurvey
Explore concepts like Self-Correct, Self-Refine, Self-Improve,... |
|
Experimental |
| 35 |
141forever/UncerSema4HalluDetec
This is the repository for the paper 'Enhancing Uncertainty Modeling with... |
|
Experimental |
| 36 |
tranhoangtu-it/halluciguard-api
HalluciGuard API — AI Hallucination Firewall as a Service. Detect and filter... |
|
Experimental |
| 37 |
strayfear/HalluWorld
🌍 Explore the HalluWorld project, a benchmark for understanding and defining... |
|
Experimental |
| 38 |
DegenAI-Labs/HalluWorld
Repository for the paper "A Unified Definition of Hallucination: It’s The... |
|
Experimental |
| 39 |
ruisizhang123/REMARK-LLM
[USENIX Security'24] REMARK-LLM: A robust and efficient watermarking... |
|
Experimental |
| 40 |
kjgpta/WhoDunIt-Evaluation_benchmark_for_culprit_detection_in_mystery_stories
WHODUNIT is a benchmark repository for evaluating large language models'... |
|
Experimental |
| 41 |
lasithadilshan/Hallucination-Detector-App
A Hallucination Detection Tool powered by UQML, designed to identify whether... |
|
Experimental |
| 42 |
akborsusom/watermark-ai-analysis
Reproduction and attack analysis of LLM text watermarking (Kirchenbauer et... |
|
Experimental |
| 43 |
Xieyangxinyu/Watermark-in-the-Classroom
This repository reproduces the results reported in Watermark in the... |
|
Experimental |
| 44 |
AInnovateLab/watermark-collision
[NAACL'25 Findings] Lost in Overlap: Exploring Logit-based Watermark... |
|
Experimental |
| 45 |
serhanylmz/pas2
PAS2: A Python-based hallucination detection system that evaluates AI... |
|
Experimental |
| 46 |
pranav-kural/llm-hallucination-detection-service
Build your own open-source REST API endpoint to detect hallucination in LLM... |
|
Experimental |