BjornMelin/docmind-ai-llm

DocMind AI is a powerful, open-source Streamlit application leveraging LlamaIndex, LangGraph, and local Large Language Models (LLMs) via Ollama, LMStudio, llama.cpp, or vLLM for advanced document analysis. Analyze, summarize, and extract insights from a wide array of file formats, securely and privately, all offline.

50
/ 100
Established

Combines hybrid retrieval (dense vector + sparse BM25) with optional GraphRAG for entity/relationship extraction, routed through a LangGraph supervisor coordinating five specialized agents (router, planner, retrieval, synthesis, validation). Ingestion uses LlamaIndex pipelines with Unstructured readers, spaCy NLP enrichment, and optional title extraction; retrieval includes BGE cross-encoder reranking and SigLIP visual reranking for image-rich PDFs. Stores page images as content-addressed artifacts with optional AES-GCM encryption, exports knowledge graphs as JSONL/Parquet, and includes DuckDB-backed snapshot caching with deterministic ingestion manifests for reproducibility.

100 stars.

No Package No Dependents
Maintenance 10 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

100

Forks

14

Language

Python

License

MIT

Last pushed

Feb 05, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/BjornMelin/docmind-ai-llm"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.