Document Processing Platforms Vector Databases
Tools for converting, parsing, and indexing unstructured documents (PDFs, Word, PowerPoint, images, audio) into searchable, queryable formats with vector embeddings and semantic search. Does NOT include general chatbots, code documentation generators, or vector database infrastructure itself.
There are 33 document processing platforms tools tracked. The highest-rated is AmadeusITGroup/docs2vecs at 40/100 with 6 stars.
Get all 33 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=document-processing-platforms&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
AmadeusITGroup/docs2vecs
CLI that helps with docs splitting, embedding and exposing them in a seamless manner |
|
Emerging |
| 2 |
in-c0/updAPI
Free, open-source collection of latest public API documentations - Update... |
|
Emerging |
| 3 |
AlexisBalayre/RagDocs
An AI-powered search engine to interact with documentation using RAG and... |
|
Experimental |
| 4 |
CodebyKumar/QueryWise
AI Document assistant |
|
Experimental |
| 5 |
dhruvkshah75/docstream
Turn static PDF archives into an interactive, searchable AI knowledge base |
|
Experimental |
| 6 |
LikithMeruvu/Framework-Docs-AI
Framework Docs AI is a powerful SaaS solution for managing framework... |
|
Experimental |
| 7 |
lh0x00/docsifer
Docsifer is a powerful tool for converting various data formats into... |
|
Experimental |
| 8 |
Surya-Hariharan/DocuQueryAI
Built for HackRx 6.0 β Bajaj Finservβs Annual Hackathon, this backend system... |
|
Experimental |
| 9 |
humanhady/DocMine
π Transform documents into queryable knowledge with exact recall and entity... |
|
Experimental |
| 10 |
SOHAIL-IQB/DocQuerry
AI-powered document Q&A platform built with a Retrieval-Augmented Generation... |
|
Experimental |
| 11 |
AvishkaGihan/documind-ai
π§ Secure AI Assistant to chat with your documents. Isolated vector data... |
|
Experimental |
| 12 |
Milesdexter/docschat-rag
π Enhance technical documentation queries with DocsChat RAG, a robust... |
|
Experimental |
| 13 |
existential-birds/pearl
Open-source DeepWiki alternative: AI-generated documentation and natural... |
|
Experimental |
| 14 |
jlorenzo681/documind
DocuMind is an intelligent document processing platform that uses 6... |
|
Experimental |
| 15 |
bcfeen/DocMine
Knowledge-centric document ingestion with stable IDs, provenance, entities,... |
|
Experimental |
| 16 |
imtiaj-007/docugenie-backend
Your AI genie for PDFs and documents - A FastAPI backend service for... |
|
Experimental |
| 17 |
heyshivamjaiswal/Folio
RAG-powered knowledge library β save articles, YouTube videos, PDFs & text,... |
|
Experimental |
| 18 |
AbdulRehman393/DocuMind-Nexus
π§ DocuMind Nexus β A docs-first RAG assistant (FastAPI + Streamlit +... |
|
Experimental |
| 19 |
Janmesh23/sidequest
SideQuest is an AI assistant that helps query big documents/pdfs/files with... |
|
Experimental |
| 20 |
MITHILESHK11/IntelProject
Intel Nexus β An enterprise-grade document intelligence platform that... |
|
Experimental |
| 21 |
AIAfterDark/AI-URL-Read
A Python-based documentation assistant that uses local LLMs to crawl... |
|
Experimental |
| 22 |
hishamzargar/DocumentAgent
AI agent API (Python/FastAPI) to upload documents (PDF/TXT) and answer... |
|
Experimental |
| 23 |
pritom169/documind-ai
AI-powered document analysis platform with multi-agent RAG, hybrid vector... |
|
Experimental |
| 24 |
oaht9412/DocSage
Process unstructured documents intelligently using DocSage, a serverless... |
|
Experimental |
| 25 |
Thamizh0206/DocuMind-AI
DocuMind AI is a RAG-powered application that lets users chat with multiple... |
|
Experimental |
| 26 |
dineshjsd/smart-doc-ai
A production-ready RAG system built with Next.js and Node.js. Uses MongoDB... |
|
Experimental |
| 27 |
y-randhal/smartdoc-analyst
RAG (Retrieval Augmented Generation) app for document analysis. Upload PDFs,... |
|
Experimental |
| 28 |
rishirochan/DocVaultAI
A privacy-centric document intelligence platform designed for secure, local... |
|
Experimental |
| 29 |
rajj28/DocPilot
π€ AI-powered browser extension that summarizes documentation pages and... |
|
Experimental |
| 30 |
ChanikyaSaiL/AI-Document-Search
This project is an AI-powered Document Intelligence System that enables... |
|
Experimental |
| 31 |
franjofranjic27/knomi
knomi is a CLI tool that indexes your documents into a vector database and... |
|
Experimental |
| 32 |
emanuelrechsteiner/DocScraper
Documentation Scraper & Post-Processor |
|
Experimental |
| 33 |
noelmarior/arivagam-cloud-rag
A Full Stack, RAG application which acts as a workspace for students to... |
|
Experimental |