Ozgur-al/local-rag-server
Privacy-first Local RAG Server: Chat with PDF & DOCX using GGUF models via llama.cpp and Qdrant. A lightweight, standalone FastAPI server with a clean HTML UI. High-performance, fully offline document intelligence. No Ollama, no cloud, no API keys.
Stars
2
Forks
—
Language
Python
License
MIT
Category
Last pushed
Feb 24, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/Ozgur-al/local-rag-server"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
undreamai/LLMUnity
Create characters in Unity with LLMs!
Mintplex-Labs/anythingllm-docs
Documentation of AnythingLLM by Mintplex Labs Inc.
bloodworks-io/phlox
Open source, local first AI medical scribe for desktop and web.
mamei16/LLM_Web_search
An extension for oobabooga/text-generation-webui that enables the LLM to search the web
snexus/llm-search
Querying local documents, powered by LLM