dmayboroda/minima
On-premises conversational RAG with configurable containers
Supports four deployment modes—fully local with Ollama, custom OpenAI-compatible LLM servers (vLLM, TGI, LocalAI), ChatGPT via custom GPT integration, and Anthropic Claude via MCP—with containerized architecture using Docker Compose. Implements semantic search with Sentence Transformer embeddings and Qdrant vector storage, optionally adding HuggingFace CrossEncoder reranking in Ollama mode, while custom LLM mode uses function calling for intelligent retrieval. Provides web UI at localhost:3000 and Electron desktop app, indexing PDF, Excel, DOCX, TXT, Markdown, and CSV documents from configurable local or cloud directories.
1,039 stars.
Stars
1,039
Forks
104
Language
Python
License
MPL-2.0
Category
Last pushed
Jan 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/dmayboroda/minima"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
vitali87/code-graph-rag
The ultimate RAG for your monorepo. Query, understand, and edit multi-language codebases with...
stevereiner/flexible-graphrag
Flexible GraphRAG: Python, LlamaIndex, Docker Compose: 8 Graph dbs, 10 Vector dbs, OpenSearch,...
christopherkarani/Wax
Lightening fast RAG on Apple Silicon. On-Device. No Server. No API. One File. Pure Swift
ggozad/haiku.rag
Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling
shredEngineer/Archive-Agent
Find your files with natural language and ask questions.