azhwinraj/multimodal-rag-engine
Production-ready RAG system with multi-modal search across text, images, audio, and video. Built with LangChain, CLIP, vector databases, and LLMs for intelligent knowledge retrieval and question answering.
Stars
—
Forks
—
Language
—
License
MIT
Category
Last pushed
Oct 15, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/azhwinraj/multimodal-rag-engine"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
holisticon/multimodal-rag-demo
🧠🖼️📄 Multimodal RAG Demo based on Qwen3-VL Embedding and Reranker models
debanjan06/geospatial-rag
AI Framework for Remote Sensing Image Analysis using RAG - 88%+ accuracy, multi-modal queries,...
aimagelab/ReT
[CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
berntpopp/phentrieve
AI-powered system for mapping clinical text to Human Phenotype Ontology (HPO) terms using...
hadil1999-creator/RAG_Hack_team
Our AI Financial Advisor is designed to revolutionize how users interact with financial and...