atpuxiner/docsloader

This is a documents loader. (文档解析加载器,rag文档解析,rag知识库构建)

41
/ 100
Emerging

Supports 9+ file formats (txt, csv, md, html, xlsx, pptx, docx, pdf, images) with modular optional dependencies and an `AutoLoader` that automatically detects file type. Built on async/await patterns for non-blocking document processing, enabling efficient batch ingestion for RAG pipelines. Designed as a flexible loader component that can be integrated into knowledge base construction workflows.

129 stars. Used by 1 other package. Available on PyPI.

Maintenance 6 / 25
Adoption 11 / 25
Maturity 24 / 25
Community 0 / 25

How are scores calculated?

Stars

129

Forks

Language

Python

License

MIT

Last pushed

Dec 16, 2025

Commits (30d)

0

Dependencies

5

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/atpuxiner/docsloader"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.