aiagentwithdhruv/multimodal-rag
Production-grade Multimodal RAG System — ingest text, PDFs, images, audio, and video into a unified vector space. Ask questions across any modality with streamed AI answers and source citations.
Stars
1
Forks
—
Language
Python
License
—
Category
Last pushed
Mar 14, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/aiagentwithdhruv/multimodal-rag"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
holisticon/multimodal-rag-demo
🧠🖼️📄 Multimodal RAG Demo based on Qwen3-VL Embedding and Reranker models
debanjan06/geospatial-rag
AI Framework for Remote Sensing Image Analysis using RAG - 88%+ accuracy, multi-modal queries,...
aimagelab/ReT
[CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
berntpopp/phentrieve
AI-powered system for mapping clinical text to Human Phenotype Ontology (HPO) terms using...
hadil1999-creator/RAG_Hack_team
Our AI Financial Advisor is designed to revolutionize how users interact with financial and...