notadev-iamaura/OneRAG
Production-ready RAG Framework (Python/FastAPI). 1-line config swaps: 6 Vector DBs (Weaviate, Pinecone, Qdrant, ChromaDB, pgvector, MongoDB), 5 LLMs (Gemini, OpenAI, Claude, Ollama, OpenRouter). OpenAI-compatible API. 2100+ tests.
Supports hybrid search (dense + BM25), GraphRAG for knowledge graph reasoning, and pluggable rerankers (6 options including Jina and Cohere) through a modular pipeline architecture. Includes built-in PII detection/masking, semantic/Redis caching layers, and query routing that classifies requests before retrieval. Designed for gradual complexity—start with basic vector search and layer in advanced features like agents and tool execution without refactoring the codebase.
113 stars.
Stars
113
Forks
35
Language
Python
License
MIT
Category
Last pushed
Mar 09, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/notadev-iamaura/OneRAG"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
pinecone-io/canopy
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
teilomillet/raggo
A lightweight, production-ready RAG (Retrieval Augmented Generation) library in Go.
electricpipelines/barq
Dabarqus is incredibly fast RAG that runs everywhere.
MERakram/Advanced-RAG-monorepo
🚀 Production-ready modular RAG monorepo: Local LLM inference (vLLM) • Hybrid retrieval with...
balavenkatesh3322/rag-doctor
🩺 Agentic RAG pipeline failure diagnosis tool. Tells you why your RAG failed — chunk...