David-Lolly/ViewRAG

图文并茂的 PDF RAG 系统:支持版式感知分块、图表深度理解与精准视觉溯源。 Multimodal PDF RAG: Features layout-aware chunking, visual chart understanding, and precise inline image citations.

40
/ 100
Emerging

Implements a multimodal RAG pipeline combining PaddleX layout-aware PDF parsing with vision LLM understanding of charts and images, storing structured semantic descriptions in pgvector for retrieval. The system uses OpenAI-compatible APIs for flexible model selection (Qwen, DeepSeek, GLM, Ollama) and integrates MinIO for image storage, enabling inline image citations in LLM responses with precise PDF page/section tracing through a custom reference attribution system.

No Package No Dependents
Maintenance 10 / 25
Adoption 6 / 25
Maturity 9 / 25
Community 15 / 25

How are scores calculated?

Stars

21

Forks

4

Language

Python

License

MIT

Last pushed

Feb 27, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/David-Lolly/ViewRAG"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.