weiwill88/Local_Pdf_Chat_RAG
🧠 纯原生 Python 实现的 RAG 框架 | FAISS + BM25 混合检索 | 支持 Ollama / SiliconFlow | 适合新手入门学习
Implements a complete RAG pipeline with modular components decomposing document loading, text chunking, embedding, vector storage (FAISS), and LLM generation into learnable stages. Combines dense vector retrieval with BM25 sparse retrieval, adds cross-encoder reranking and recursive retrieval for improved accuracy, and provides a Gradio interface for interactive learning. Supports pluggable LLM backends via auto-detection of local Ollama or SiliconFlow API endpoints.
842 stars. Actively maintained with 1 commit in the last 30 days.
Stars
842
Forks
159
Language
Python
License
—
Category
Last pushed
Mar 18, 2026
Commits (30d)
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/weiwill88/Local_Pdf_Chat_RAG"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
shubham0204/OnDevice-RAG-Android
A custom RAG pipeline for multi-document QA from PDF/DOCX documents, in Android
EarthlyAlien/Document-Assistant
RAG based Document Assistant for Search
dev-it-with-me/RagUltimateAdvisor
A complete Retrieval-Augmented Generation (RAG) application that demonstrates modern AI...
souradipp76/ReadMeReady
Tool for auto-generating README documentation for code repositories