RAG Pipeline Optimization Vector Databases

Tools for benchmarking, evaluating, and optimizing RAG pipeline components (chunking, embedding, retrieval methods). Includes frameworks for testing configurations, comparing techniques, and improving retrieval quality. Does NOT include full RAG applications, domain-specific implementations, or vector database backends themselves.

There are 33 rag pipeline optimization tools tracked. 1 score above 50 (established tier). The highest-rated is danny-avila/rag_api at 60/100 with 772 stars. 1 of the top 10 are actively maintained.

Get all 33 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=rag-pipeline-optimization&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 danny-avila/rag_api

ID-based RAG FastAPI: Integration with Langchain and PostgreSQL/pgvector

60
Established
2 mburaksayici/smallevals

smallevals — CPU-fast, GPU-blazing fast offline retrieval evaluation for RAG...

35
Emerging
3 naxoc/riffrag

A local RAG builder for code with a Claude Code skills creator

30
Emerging
4 GoparapukethaN/rag-forge

Modular RAG framework with hybrid retrieval, intelligent chunking, and...

23
Experimental
5 kxgst228/rag-forge

Modular RAG framework with hybrid retrieval, intelligent chunking, and...

23
Experimental
6 Dyinu/rag-forge

Benchmark multiple chunking, embedding, and retrieval combinations for RAG...

22
Experimental
7 hiatamaworkshop/dcp-rag

Data Cost Protocol encoder for system→AI data injection — converts...

22
Experimental
8 hamittokay/context-window

A simple RAG toolkit.

22
Experimental
9 AlbertMein/rag-document-processing

RAG pipeline components: document loaders, chunking, vector stores, retrieval

22
Experimental
10 oguzhankir/omnichunk

Structure-aware text chunking library for code, prose, and markup files....

22
Experimental
11 Arthrocentesisgenusphylloxera328/rag-forge

Benchmark RAG pipeline configurations by testing chunking, embedding, and...

22
Experimental
12 ai-agents-buzz/rag-chunking-playground

Visual tool to compare 6 RAG chunking strategies side-by-side with grading...

22
Experimental
13 irfanalidv/ragfallback

ragfallback is a Python library that prevents silent RAG failures — chunk...

22
Experimental
14 metawake/chunkweaver

RAG chunker that respects document structure.

22
Experimental
15 jvorndran/Unravel

A visual sandbox for experimenting with RAG configurations. Interactive...

20
Experimental
16 ThanhHung2112/Semantic_chunking

Semantic Chunking is a Python library for segmenting text into meaningful...

20
Experimental
17 oegozutok/RAG-Visualizer

An explainable AI tool that visualizes vector similarity and embedding...

19
Experimental
18 Maki-Grz/lumen-rag

A modular, database-agnostic RAG framework for Rust supporting MongoDB and Qdrant.

19
Experimental
19 Tek233/RAG

Multi-model RAG

19
Experimental
20 NuTerraLabs/ContextTape

File-based RAG storage: Zero-infrastructure vector database alternative for...

19
Experimental
21 Jogesh6895/chromadb-rag-system-python

⚡ Complete RAG pipeline implementation with ChromaDB vector database....

19
Experimental
22 naman20sharma/Chunking-Policies-in-RAG

RAG system with advanced chunking strategies (Late Chunking, Meta-Chunking,...

16
Experimental
23 arturoburigo/bfc_script_RAG

RAG for a Domain-Specific-language, using vectorDB and semantic search with...

16
Experimental
24 patteg21/pigeon-evals

A End-To-End RAG Pipeline that includes Evaluations, iterations, and...

16
Experimental
25 alex3ai/rag-benchmark-core

🚀 High-performance RAG Benchmarking Suite for Milvus. Measures Latency...

15
Experimental
26 SK7Cosmo/rag-playground

A hands-on sandbox for experimenting with Retrieval-Augmented Generation...

14
Experimental
27 lilhuss26/ProofRAG

Compare RAG techniques: simple vs. proposition-based embedding, standard vs....

14
Experimental
28 Ri-yan/RAGForge

A generic, production-grade Retrieval-Augmented Generation pipeline exposed...

14
Experimental
29 Heron4gf/rag-notes

Manage clipboard and notes in a vector database

13
Experimental
30 naaas94/rag-light-demo

A local-first RAG demo that emphasizes production-grade patterns:...

11
Experimental
31 olneyjR/pace-wise-RAG

RAG system aggregating peer running experiences for marathon training guidance.

11
Experimental
32 gurre/chunker

Text chunking library for splitting strings into size-limited segments with overlap.

11
Experimental
33 gidea/chunkpad

Chunkpad is designed to prepare documents for Retrieval-Augmented Generation...

10
Experimental