Document Processing Platforms Vector Databases

Tools for converting, parsing, and indexing unstructured documents (PDFs, Word, PowerPoint, images, audio) into searchable, queryable formats with vector embeddings and semantic search. Does NOT include general chatbots, code documentation generators, or vector database infrastructure itself.

There are 33 document processing platforms tools tracked. The highest-rated is AmadeusITGroup/docs2vecs at 40/100 with 6 stars.

Get all 33 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=vector-db&subcategory=document-processing-platforms&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 AmadeusITGroup/docs2vecs

CLI that helps with docs splitting, embedding and exposing them in a seamless manner

40
Emerging
2 in-c0/updAPI

Free, open-source collection of latest public API documentations - Update...

35
Emerging
3 AlexisBalayre/RagDocs

An AI-powered search engine to interact with documentation using RAG and...

27
Experimental
4 CodebyKumar/QueryWise

AI Document assistant

25
Experimental
5 dhruvkshah75/docstream

Turn static PDF archives into an interactive, searchable AI knowledge base

25
Experimental
6 LikithMeruvu/Framework-Docs-AI

Framework Docs AI is a powerful SaaS solution for managing framework...

23
Experimental
7 lh0x00/docsifer

Docsifer is a powerful tool for converting various data formats into...

22
Experimental
8 Surya-Hariharan/DocuQueryAI

Built for HackRx 6.0 – Bajaj Finserv’s Annual Hackathon, this backend system...

22
Experimental
9 humanhady/DocMine

πŸ“„ Transform documents into queryable knowledge with exact recall and entity...

22
Experimental
10 SOHAIL-IQB/DocQuerry

AI-powered document Q&A platform built with a Retrieval-Augmented Generation...

22
Experimental
11 AvishkaGihan/documind-ai

🧠 Secure AI Assistant to chat with your documents. Isolated vector data...

22
Experimental
12 Milesdexter/docschat-rag

πŸ“š Enhance technical documentation queries with DocsChat RAG, a robust...

22
Experimental
13 existential-birds/pearl

Open-source DeepWiki alternative: AI-generated documentation and natural...

20
Experimental
14 jlorenzo681/documind

DocuMind is an intelligent document processing platform that uses 6...

19
Experimental
15 bcfeen/DocMine

Knowledge-centric document ingestion with stable IDs, provenance, entities,...

17
Experimental
16 imtiaj-007/docugenie-backend

Your AI genie for PDFs and documents - A FastAPI backend service for...

16
Experimental
17 heyshivamjaiswal/Folio

RAG-powered knowledge library β€” save articles, YouTube videos, PDFs & text,...

15
Experimental
18 AbdulRehman393/DocuMind-Nexus

🧠 DocuMind Nexus β€” A docs-first RAG assistant (FastAPI + Streamlit +...

15
Experimental
19 Janmesh23/sidequest

SideQuest is an AI assistant that helps query big documents/pdfs/files with...

14
Experimental
20 MITHILESHK11/IntelProject

Intel Nexus – An enterprise-grade document intelligence platform that...

12
Experimental
21 AIAfterDark/AI-URL-Read

A Python-based documentation assistant that uses local LLMs to crawl...

12
Experimental
22 hishamzargar/DocumentAgent

AI agent API (Python/FastAPI) to upload documents (PDF/TXT) and answer...

12
Experimental
23 pritom169/documind-ai

AI-powered document analysis platform with multi-agent RAG, hybrid vector...

11
Experimental
24 oaht9412/DocSage

Process unstructured documents intelligently using DocSage, a serverless...

11
Experimental
25 Thamizh0206/DocuMind-AI

DocuMind AI is a RAG-powered application that lets users chat with multiple...

11
Experimental
26 dineshjsd/smart-doc-ai

A production-ready RAG system built with Next.js and Node.js. Uses MongoDB...

11
Experimental
27 y-randhal/smartdoc-analyst

RAG (Retrieval Augmented Generation) app for document analysis. Upload PDFs,...

11
Experimental
28 rishirochan/DocVaultAI

A privacy-centric document intelligence platform designed for secure, local...

11
Experimental
29 rajj28/DocPilot

πŸ€– AI-powered browser extension that summarizes documentation pages and...

11
Experimental
30 ChanikyaSaiL/AI-Document-Search

This project is an AI-powered Document Intelligence System that enables...

11
Experimental
31 franjofranjic27/knomi

knomi is a CLI tool that indexes your documents into a vector database and...

11
Experimental
32 emanuelrechsteiner/DocScraper

Documentation Scraper & Post-Processor

11
Experimental
33 noelmarior/arivagam-cloud-rag

A Full Stack, RAG application which acts as a workspace for students to...

11
Experimental