Arterning/DeepParseX

DeepParseX 是一个强大的多模态文档解析与知识管理平台,支持 PDF、Word、Excel、PPT、图片、视频、音频 等多种文件格式的智能解析,自动提取关键信息,并构建 检索增强生成(RAG) 和 知识图谱(Knowledge Graph) 系统,实现结构化数据的智能检索与推理。

51
/ 100
Established

Leverages ParadeDB for vector-based semantic search within RAG pipelines and uses prompt engineering with LLMs (GPT/Llama/Claude) for entity-relation extraction in knowledge graph construction. Built on FastAPI with modular architecture supporting custom parsers and embedding models, plus REST API and Python SDK for enterprise integration. Handles non-structured content through integrated OCR, ASR, and NLP services across text, tables, images, and audio-visual data streams.

No Package No Dependents
Maintenance 10 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

56

Forks

11

Language

Python

License

MIT

Last pushed

Feb 21, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/Arterning/DeepParseX"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.