datallmhub/ragctl
A powerful CLI tool to manage, test, and optimize RAG pipelines. Streamline your Retrieval-Augmented Generation workflows from terminal.
Supports advanced document ingestion with OCR cascading (EasyOCR → PaddleOCR → pytesseract fallback) and intelligent semantic chunking via LangChain, outputting metadata-rich chunks across JSON/JSONL/CSV formats. Built for production with batch processing, automatic retry with exponential backoff, error recovery modes, and optional Qdrant vector store integration for complete RAG pipeline automation.
Available on PyPI.
Stars
18
Forks
7
Language
Python
License
MIT
Category
Last pushed
Jan 12, 2026
Monthly downloads
33
Commits (30d)
0
Dependencies
31
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/datallmhub/ragctl"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Bessouat40/RAGLight
RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to...
superagent-ai/super-rag
Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters...
McKern3l/RAGdrag
RAG pipeline security testing toolkit - 27 techniques across 6 kill chain phases, mapped to MITRE ATLAS
feld-m/rag_blueprint
A modular framework for building and deploying Retrieval-Augmented Generation (RAG) systems with...
mburaksayici/RAG-Boilerplate
RAG boilerplate with semantic/propositional chunking, hybrid search (BM25 + dense), LLM...