Document Chunking Embedding Pipelines Vector Databases
There are 65 document chunking embedding pipelines tools tracked. The highest-rated is Siddhant-K-code/distill at 46/100 with 136 stars.
Get all 65 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=document-chunking-embedding-pipelines&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
Siddhant-K-code/distill
Reliable LLM outputs start with clean context. Deterministic deduplication,... |
|
Emerging |
| 2 |
louisbrulenaudet/ragoon
High level library for batched embeddings generation, blazingly-fast... |
|
Emerging |
| 3 |
pesu-dev/ask-pesu
A RAG pipeline for question answering about PES University |
|
Emerging |
| 4 |
namtroi/RAGBase
Open Source RAG ETL Platform. Turns PDFs, Docs & Slides into queryable... |
|
Emerging |
| 5 |
B-A-M-N/FlockParser
Distributed document RAG system with intelligent GPU/CPU orchestration.... |
|
Emerging |
| 6 |
aws-samples/rag-with-amazon-postgresql-using-pgvector-and-sagemaker
Question Answering application with Large Language Models (LLMs) and Amazon... |
|
Emerging |
| 7 |
PerciValXIII/CAFB-food-wise-ai
AI-powered content automation tool for the Capital Area Food Bank (CAFB),... |
|
Experimental |
| 8 |
aws-samples/rag-with-amazon-opensearch-and-sagemaker
Question Answering Generative AI application with Large Language Models... |
|
Experimental |
| 9 |
CarlosManuelDiaz/rag-ready-extractor
Stop indexing noise. Turn messy websites and PDFs into clean, structured... |
|
Experimental |
| 10 |
libraryofcelsus/LLM_File_Parser
AutoML/Unstructured Data Processing for RAG and LLM Dataset Creation. ... |
|
Experimental |
| 11 |
Daddy-Myth/D-RAGon_System
Local Retrieval-Augmented Generation (RAG) system for PDF question answering... |
|
Experimental |
| 12 |
devangvyas-it/fastapi-rag-starter
Lightweight, self-contained RAG application built with FastAPI. It enables... |
|
Experimental |
| 13 |
mpessis/rag-doc-search
Semantic search over technical documentation using natural language. RAG... |
|
Experimental |
| 14 |
tuitige/fijian-rag-app
Public-benefit GenAI platform for the Fijian language — combining Claude +... |
|
Experimental |
| 15 |
Abdellatif404/Eigen-Field
A local Retrieval-Augmented Generation (RAG) system for agricultural... |
|
Experimental |
| 16 |
Amayes985-stack/Mimir
Privacy-first RAG pipeline application that transforms personal documents... |
|
Experimental |
| 17 |
himaenshuu/Multi_modal_rag-application
A powerful, easy-to-use platform for question answering over documents, web... |
|
Experimental |
| 18 |
noaman680/rag-from-scratch
Production-ready RAG (Retrieval Augmented Generation) system built from... |
|
Experimental |
| 19 |
bharghavaram/rag-knowledge-assistant
A lightweight Retrieval-Augmented Generation (RAG) system for answering... |
|
Experimental |
| 20 |
tanmay271/RAG-Qdrant-AI
High-performance RAG pipeline engineered to eliminate LLM hallucinations... |
|
Experimental |
| 21 |
neehanthreddym/doc_query_rag
A basic RAG pipeline which uses gpt-oss-20b model to answer the user query... |
|
Experimental |
| 22 |
pashpashpash/python-rag-scaffold
A comprehensive RAG FastAPI service that handles document uploads and... |
|
Experimental |
| 23 |
josephsenior/Microbione
Multimodal RAG system for microbiome data analysis with cross-modal search,... |
|
Experimental |
| 24 |
Ashish-Abraham/DocWhisperer-Qdrant
A Retrieval-Augmented Generation (RAG) System for PDF Chat using Qdrant... |
|
Experimental |
| 25 |
Debasish-87/rag-based-document-qa
rag-based-document-qa is a Retrieval-Augmented Generation (RAG) based... |
|
Experimental |
| 26 |
johnIT56/STAR-RAG
STAR-RAG is a self-reflective, retrieval-augmented question answering system... |
|
Experimental |
| 27 |
B-A-M-N/FlockParser-legacy
Legacy version of FlockParser PDF processing system |
|
Experimental |
| 28 |
RoodyCode/rag
A modular, self-hosted RAG pipeline for building a private, searchable... |
|
Experimental |
| 29 |
ajitsingh98/Building-RAG-System-with-Deepseek-R1-Locally
This repository contains an end-to-end Retrieval-Augmented Generation (RAG)... |
|
Experimental |
| 30 |
olexmal/ragu
RAGU - Retrieval-Augmented Generation Universal. A privacy-focused RAG... |
|
Experimental |
| 31 |
LEADisDEAD/Vector-Forge
Production-style Retrieval-Augmented Generation (RAG) system with... |
|
Experimental |
| 32 |
gurbaj5124871/rag-app-deepseek
A RAG (Retrieval-Augmented Generation) application which combines... |
|
Experimental |
| 33 |
SrijanShovit/HomeoRAG
A RAG application to search documents for homeopathic remedies based on... |
|
Experimental |
| 34 |
rithunkp/RAG-Codebase
Retrieval-Augmented Generation (RAG) assistant that lets users ask natural... |
|
Experimental |
| 35 |
RijuSaha-01/RAG-Document-Assistant-with-Azure-Cosmos-DB
A RAG pipeline implementation using Azure Cosmos DB (MongoDB vCore) and... |
|
Experimental |
| 36 |
smoothemerson/ragscope
Q&A over documents using RAG (FastAPI + ChromaDB + Ollama + MLflow) |
|
Experimental |
| 37 |
RAK0152/doc-watch-rag
Async document watcher that keeps your RAG index hot. Automatically ingests... |
|
Experimental |
| 38 |
Mohamed-samy2/Arabic-Islamic-Assessment
This repository implements a compact, efficient Retrieval-Augmented... |
|
Experimental |
| 39 |
ramyasri-m/RAG_Property_Document_Pipeline
A RAG pipeline for property documents using Weaviate, sentence-transformers,... |
|
Experimental |
| 40 |
Boney-massiveness357/ragscope
Build a Q&A API that indexes PDFs and text using RAG, logging queries with... |
|
Experimental |
| 41 |
ashankgupta/rag-flow
A visual, node-based RAG (Retrieval-Augmented Generation) pipeline builder... |
|
Experimental |
| 42 |
razevedo1994/paper-rag-pipeline
A complete RAG ingestion pipeline for scientific papers. |
|
Experimental |
| 43 |
QuantumDrizzy/rag-scientific-papers
Full RAG pipeline over 30 seminal AI/ML papers · FAISS vector store · ReAct... |
|
Experimental |
| 44 |
felix-dowl/ResearchPal
Basic RAG pipeline for uploading documents and making natural language queries |
|
Experimental |
| 45 |
Abs01ute000/policymind-rag-showcase
Semantic search and RAG showcase built with FastAPI, ChromaDB,... |
|
Experimental |
| 46 |
Vaibhavii3/AI-Knowlendge-Base-RAG
Built a Retrieval-Augmented Generation system that allows users to upload... |
|
Experimental |
| 47 |
Selam1431/Rag-Document-Search
AI-powered document search system using Retrieval-Augmented Generation (RAG)... |
|
Experimental |
| 48 |
alunoshacker-beep/ragscope
Build an offline Q&A API using RAG to query PDFs and texts, with automated... |
|
Experimental |
| 49 |
tahamohmadf19-dev/rag-document-search
Document search with retrieval-augmented generation using FastAPI, Qdrant... |
|
Experimental |
| 50 |
jy02140251/rag-document-loader
Load documents for RAG pipelines: PDF, DOCX, HTML, Markdown. Smart chunking,... |
|
Experimental |
| 51 |
srinivas-sateesh/RAG-query-classifier
Smart Query Classifier to earn user trust and save $$$ |
|
Experimental |
| 52 |
shubham5027/RAG-Qwen-2.5-72b-instruct
I built a production-style RAG system focused on grounded generation, not... |
|
Experimental |
| 53 |
sjlewis25/rag-pipeline
Hybrid RAG pipeline with local/cloud LLM support for semantic document... |
|
Experimental |
| 54 |
ankit123nag/pdf-rag-assistant
Production-grade RAG backend for document ingestion and semantic retrieval... |
|
Experimental |
| 55 |
PrinceKay145/multiDocRAG
Multi-Document RAG System with source attribution and query logging |
|
Experimental |
| 56 |
raza242k5-sys/rag-ai-system
Retrieval-Augmented Generation (RAG) based Intelligent QA System using... |
|
Experimental |
| 57 |
Powerostad/talk_to_github
A Retrieval-Augmented Generation (RAG) system enabling natural language... |
|
Experimental |
| 58 |
DRJ-14/context-aware-email-assistant-RAG
RAG system to query Gmail Takeout (.mbox) with semantic search + local LLM... |
|
Experimental |
| 59 |
thendralmagudapathi/RAG-for-NCERT
A professional-grade Retrieval-Augmented Generation (RAG) system designed... |
|
Experimental |
| 60 |
GowriPriyanka27/adaptive-rag-auto-optimizer
Adaptive Retrieval-Augmented Generation (RAG) system with dynamic... |
|
Experimental |
| 61 |
bijay-odyssey/Personal-Knowledge-Base-RAG-API
Personal Knowledge Base RAG API – FastAPI-based RAG system for querying... |
|
Experimental |
| 62 |
AbhashK1/Verbo
RAG based document query system that performs OCR(Tesseract) for text... |
|
Experimental |
| 63 |
Farhaj499/RAG_with_Weaviate_DB
This project implements a Retrieval Augmented Generation (RAG) system that... |
|
Experimental |
| 64 |
tolios/XPL
A simple cli tool for RAG on documents |
|
Experimental |
| 65 |
daviaraujocc/rag-docs
A simple project about implementing RAG (Retrieval-Augmented Generation) for... |
|
Experimental |