mohameddmansurr/kubernetes-rag-ops
A production-ready MLOps pipeline for Retrieval Augmented Generation (RAG). Features a FastAPI inference service and Qdrant vector database, orchestrated on Kubernetes with StatefulSets, resource scaling, and self-healing capabilities.
Stars
—
Forks
—
Language
Python
License
—
Category
Last pushed
Jan 08, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/mohameddmansurr/kubernetes-rag-ops"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
notadev-iamaura/OneRAG
Production-ready RAG Framework (Python/FastAPI). 1-line config swaps: 6 Vector DBs (Weaviate,...
pinecone-io/canopy
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
balavenkatesh3322/rag-doctor
🩺 Agentic RAG pipeline failure diagnosis tool. Tells you why your RAG failed — chunk...
MERakram/Advanced-RAG-monorepo
🚀 Production-ready modular RAG monorepo: Local LLM inference (vLLM) • Hybrid retrieval with...
teilomillet/raggo
A lightweight, production-ready RAG (Retrieval Augmented Generation) library in Go.