AnithaKarre/multimodel_RAG

Multimodal RAG pipeline that ingests PDFs, Word docs, CSVs, Excel files, and embedded images (via Tesseract OCR). Chunks and embeds content with Google Generative AI, indexes in FAISS, and generates grounded answers using Groq LLaMA 3 — with image path attribution.

/ 100

Experimental

No License No Package No Dependents

Maintenance 13 / 25

Adoption 0 / 25

Maturity 1 / 25

Community 0 / 25

How are scores calculated?

Stars

—

Forks

—

Language

Python

License

—

Category

multimodal-rag-systems

Last pushed

Mar 15, 2026

Commits (30d)

GitHub

Multimodal RAG Systems · 98 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/AnithaKarre/multimodel_RAG"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

AnswerDotAI/byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

illuin-tech/colpali

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

jolibrain/colette

Multimodal RAG to search and interact locally with technical documents of any kind

nannib/nbmultirag

Un framework in Italiano ed Inglese, che permette di chattare con i propri documenti in RAG,...

OpenBMB/VisRAG

Parsing-free RAG supported by VLMs

Explore RAG Tools

All categories Trending RAG directory Insights