dmayboroda/minima

On-premises conversational RAG with configurable containers

/ 100

Established

Supports four deployment modes—fully local with Ollama, custom OpenAI-compatible LLM servers (vLLM, TGI, LocalAI), ChatGPT via custom GPT integration, and Anthropic Claude via MCP—with containerized architecture using Docker Compose. Implements semantic search with Sentence Transformer embeddings and Qdrant vector storage, optionally adding HuggingFace CrossEncoder reranking in Ollama mode, while custom LLM mode uses function calling for intelligent retrieval. Provides web UI at localhost:3000 and Electron desktop app, indexing PDF, Excel, DOCX, TXT, Markdown, and CSV documents from configurable local or cloud directories.

1,039 stars.

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

1,039

Forks

104

Language

Python

License

MPL-2.0

Related tools

vitali87/code-graph-rag

The ultimate RAG for your monorepo. Query, understand, and edit multi-language codebases with...

stevereiner/flexible-graphrag

Flexible GraphRAG: Python, LlamaIndex, Docker Compose: 8 Graph dbs, 10 Vector dbs, OpenSearch,...

christopherkarani/Wax

Lightening fast RAG on Apple Silicon. On-Device. No Server. No API. One File. Pure Swift

ggozad/haiku.rag

Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling

shredEngineer/Archive-Agent

Find your files with natural language and ask questions.

Explore RAG Tools

All categories Trending RAG directory Insights