sakshamVerma08/MultiModal-RAG-Practice-

Multi-Modal RAG: Retrieval-Augmented Generation over Text and Visual PDFs A multi-modal RAG system capable of understanding and reasoning over PDFs containing both text and images. Combines LangChain, CLIP, and FAISS to extract textual content, encode visual features, and enable unified semantic retrieval for context-aware responses.

15
/ 100
Experimental
No Package No Dependents
Maintenance 6 / 25
Adoption 0 / 25
Maturity 9 / 25
Community 0 / 25

How are scores calculated?

Stars

Forks

Language

License

MIT

Last pushed

Oct 27, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/sakshamVerma08/MultiModal-RAG-Practice-"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.