Local PDF RAG Systems Vector Databases

Complete RAG implementations for querying PDF documents using local/offline infrastructure (Ollama, FAISS, ChromaDB, etc.). Does NOT include cloud-hosted solutions, non-PDF document types as primary focus, or general RAG frameworks without PDF-specific examples.

There are 65 local pdf rag systems tools tracked. 1 score above 50 (established tier). The highest-rated is VectifyAI/PageIndex at 65/100 with 21,374 stars. 1 of the top 10 are actively maintained.

Get all 65 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=local-pdf-rag-systems&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 VectifyAI/PageIndex

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

65
Established
2 thearpankumar/GPUaccelerated-multilingual-RAG

GPU - vector DB - AI-powered document processing platform for financial...

33
Emerging
3 praj2408/RAG-Enhanced-NCERT-Tutor

RAG-Enhanced-NCERT-Tutor is an AI-powered tutor for NCERT curriculum, using...

28
Experimental
4 justine-george/ai-markdown-llm-retrieval

AI-powered document query system using LangChain, ChromaDB, and OpenAI for...

26
Experimental
5 Vikas-ai56/Contextual_RAG

An Advanced RAG system using Python and Langgraph for intelligent, stateful...

24
Experimental
6 BerkCpro/academic-pdf-rag

RAG system for answering questions from academic PDFs using FAISS, BGE...

22
Experimental
7 RITIK1442840127/Enterprise-PDF-Q-A-System-RAG-LLM-

AI-powered Enterprise PDF Management System using RAG + LLM for semantic...

22
Experimental
8 Ashish4144/pageindex

Build hierarchical document indexes using LLM reasoning for intuitive...

22
Experimental
9 eugen-goebel/smart-doc-qa

RAG system to chat with PDF, DOCX, and TXT documents with source-grounded answers

22
Experimental
10 altafpinjari2001/rag-document-qa

Production-ready RAG pipeline for intelligent document Q&A with LangChain,...

22
Experimental
11 KayraBulbul/NSW-Crime-RAG-System

A RAG application for analysing NSW crime statistics using LangChain, OpenAI...

20
Experimental
12 0x5h31d0n/Bajaj-Hackrx

A RAG model that takes document input and answers query related pertaining...

19
Experimental
13 baranozgurtas/Academic-RAG-Assistant

End-to-end RAG pipeline for academic PDFs with citation-grounded QA and...

19
Experimental
14 keyvar/paperbrace

Local-first PDF literature navigator with RAG: index your library, ask...

19
Experimental
15 vishalkumar-swe/documind-rag-endee

RAG-powered document Q&A system built using Endee Vector Database and FastAPI

19
Experimental
16 abhijithj12/Legal-Compliance-RAG-Assistant

Domain-restricted Legal & Compliance RAG Assistant built with LangChain,...

19
Experimental
17 intersystems-ib/workshop-llm

🧠 Hands-on RAG workshop using InterSystems IRIS & LLMs - Build PDF Q&A...

18
Experimental
18 dulhara79/Research_Assistant_for_PDFs

Research Assistant for PDFs is a full-stack prototype that helps you upload,...

17
Experimental
19 kbhujbal/KnowledgeAssist-Retrieval-Augmented-Generation-RAG-Document-QA-System

A full-stack RAG application that enables intelligent document Q&A. Upload...

17
Experimental
20 zmwzmo11130/neuraldocs

Demo RAG API (FastAPI, OpenAI, ChromaDB, Docker) automatically generated...

16
Experimental
21 Akshitha0118/ContextCore-Domain-Aware-RAG-Chatbot

RAG-based Question Answering system using LangChain, Groq LLM, and ChromaDB...

15
Experimental
22 SahilKhan101/smartdocs

Production-ready RAG documentation assistant built with FastAPI, LangChain,...

15
Experimental
23 RomanRosa/docschat-rag

Advanced RAG system for technical documentation with source citation, hybrid...

15
Experimental
24 perlathebian/document-chatbot-rag

RAG-based document chatbot — upload PDFs, ask questions, get answers with...

15
Experimental
25 gondar-tech/langchain-assessment

Langchain assessment - Docuemnt Q&A Agent

15
Experimental
26 SKcoder6344/insurance-rag-endee

RAG-based Insurance Policy Q&A using Endee Vector DB + OpenAI GPT | FastAPI | Docker

15
Experimental
27 th3akash/PDF-RAG-AI

A local Retrieval-Augmented Generation (RAG) application built with FastAPI,...

15
Experimental
28 asaeles/manual-master

Manual Master: CLI tool to build local RAG knowledge bases from docs (PDF,...

15
Experimental
29 muqadasejaz/PDF-QA-RAG-System

A PDF Question-Answering App built with RAG (Retrieval-Augmented...

15
Experimental
30 aimaster-dev/SmartRAG

SmartRAG is a terminal-based RAG system using LangGraph. It processes...

15
Experimental
31 pavankethavath/PDF_question_answering_chatbot_using_RAG

A PDF Question Answering System leveraging Retrieval-Augmented Generation...

14
Experimental
32 SakuraPuare/Delphi

可离线部署的本地知识库系统 — 导入代码仓库、技术文档、音视频,基于 RAG 的智能问答

14
Experimental
33 sothulthorn/RAG-Maritime-Safety

A Retrieval-Augmented Generation application for maritime safety. Ask...

14
Experimental
34 marianaciccone1989-pixel/corporate-rag-assistant

Local RAG system for querying PDF documents using FAISS, LangChain, and...

14
Experimental
35 Purnachander-Konda/RAG-Based-PDF-Assistant

RAG-based Research Paper Assistant - Upload PDFs and ask questions using...

14
Experimental
36 Namanpaliyal/KardiaFlow

🚀 Build and deploy a lightweight RAG demo using FastAPI, Chroma, and Ollama...

14
Experimental
37 killboy7/rag-pdf-qa

📘 Enable natural-language querying of PDFs with a local RAG system using...

14
Experimental
38 bhatt-aditya03/DocuMind

RAG-based Legal & Research Document Analyzer — LangChain, ChromaDB, Groq...

14
Experimental
39 IvayloP0709/langchain-pdf-rag

Retrieval-Augmented Generation (RAG) system for PDF documents using...

14
Experimental
40 Exoshiva/Ask-PDFi

Containerized RAG SaaS prototype featuring local document vectorization,...

14
Experimental
41 mattialoszach/RISCV-GPT

RAG powered tool that answers technical questions about the official RISC-V ISA

13
Experimental
42 Varunv003/langchain-palm2-rag_application

DocChat: Langchain Retrieval System, seamlessly navigate and converse with...

12
Experimental
43 davidenegri0/LLaMASearchDocs

AI-based document search engine based on LLaMa2 by Meta, early development version

12
Experimental
44 Jimlibo/custom-rag-app

A streamlit app that allows the user to upload his/her own PDFs, and uses...

12
Experimental
45 gregorydearing/AI-Handbook-Generator

AI-powered system that generates comprehensive handbooks from PDF documents...

12
Experimental
46 AyushSingh360/Enterprise-RAG-System

A robust RAG backend featuring semantic chunking, embedding caching, and a...

12
Experimental
47 sangramh007/pdf-llm

Local PDF Question-Answering system using Ollama, Phi-3, embeddings, and...

11
Experimental
48 makers10/PDF-RAG-BOT

A state-of-the-art Retrieval-Augmented Generation (RAG) system with hybrid...

11
Experimental
49 A7medElsharkawy/DocQA

This project leverages LayoutLMv2, a state-of-the-art model for document...

11
Experimental
50 francis-rf/RAG-document-qa

RAG-powered document Q&A system using LangGraph and FAISS vector store....

11
Experimental
51 MuskanPaliwal/rag-tool-zenml

A ZenML-based RAG system for document Q&A with multi-format support. My...

11
Experimental
52 Varshakaleeswaran/RAG_Project

AI PDF Assistant using RAG, ChromaDB, and Streamlit for intelligent document...

11
Experimental
53 nenosoft131/rag-app-using-ollama

A modern Retrieval-Augmented Generation (RAG) application with a cleanly...

11
Experimental
54 JeevanB1111/rag-interview-assistant

Production-ready Retrieval-Augmented Generation (RAG) system built with...

11
Experimental
55 24f2000058/pdf_knowledge_tool

Enterprise-ready, local-first RAG system that converts complex PDFs into...

11
Experimental
56 GowriPriyanka27/local-rag-system

Multi-document Retrieval-Augmented Generation system using FastAPI, FAISS,...

11
Experimental
57 ai-prasanth/sample-claude-rag-system

AI-powered PDF Q&A system using RAG with Claude API, Qdrant, and...

11
Experimental
58 22CB006/AI-Knowledge-Copilot

Production-grade RAG system for querying PDFs and web content with semantic...

11
Experimental
59 moubarak1ezzyani/RAG-IT-Support-Assistant

A reliable RAG-powered assistant for IT technicians to query procedures,...

11
Experimental
60 Queen-esther01/RAG-Langchain

RAG-powered PDF Q&A app built with LangChain, OpenAI GPT-4o, ChromaDB and...

11
Experimental
61 gunjitnegi/Smart_Doc_RAG

GPU-Accelerated Retrieval-Augmented Generation (RAG) system for long-form...

11
Experimental
62 SaiKrishnaRaoAnugu/SmartDoc-AI

Local RAG-based document intelligence assistant using Mistral, FAISS, and Streamlit.

11
Experimental
63 Hassi34/document-portal

A FastAPI-powered application to analyze, compare, and chat with documents...

11
Experimental
64 shraddhag97/genai-doc-qa

Document Question Answering using Retrieval-Augmented Generation

11
Experimental
65 ADHAYA-Technos/Automated-Information-Retrieval-and-Summarization-for-Academic-Research-Articles

Developing an OCR + LLM-powered tool that helps researchers, students, and...

10
Experimental

Comparisons in this category