HaseebKhalid1507/VelociRAG

Lightning-fast RAG for AI agents. ONNX-powered, 4-layer fusion, MCP server. No PyTorch.

41
/ 100
Emerging

Combines four independent retrieval methods—vector embeddings (FAISS), full-text search (SQLite FTS5), knowledge graph traversal, and metadata filtering—fused via reciprocal rank fusion with cross-encoder reranking, all running on ONNX Runtime without PyTorch. Provides MCP server integration for Claude/Cursor/Windsurf agents, a Unix socket daemon maintaining warm model state for sub-10ms searches, and incremental graph updates that detect file changes and only rebuild affected nodes. Targets developers building AI agents that need fast, multi-modal retrieval without external APIs or GPU dependencies.

3 stars and 1,078 monthly downloads. Available on PyPI.

Maintenance 13 / 25
Adoption 10 / 25
Maturity 18 / 25
Community 0 / 25

How are scores calculated?

Stars

3

Forks

Language

Python

License

MIT

Last pushed

Mar 28, 2026

Monthly downloads

1,078

Commits (30d)

0

Dependencies

10

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/mcp/HaseebKhalid1507/VelociRAG"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.