datallmhub/ragctl

A powerful CLI tool to manage, test, and optimize RAG pipelines. Streamline your Retrieval-Augmented Generation workflows from terminal.

54
/ 100
Established

Supports advanced document ingestion with OCR cascading (EasyOCR → PaddleOCR → pytesseract fallback) and intelligent semantic chunking via LangChain, outputting metadata-rich chunks across JSON/JSONL/CSV formats. Built for production with batch processing, automatic retry with exponential backoff, error recovery modes, and optional Qdrant vector store integration for complete RAG pipeline automation.

Available on PyPI.

Maintenance 10 / 25
Adoption 10 / 25
Maturity 18 / 25
Community 16 / 25

How are scores calculated?

Stars

18

Forks

7

Language

Python

License

MIT

Last pushed

Jan 12, 2026

Monthly downloads

33

Commits (30d)

0

Dependencies

31

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/datallmhub/ragctl"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.