Rahilshah01/multimodal-vision-ai-chat

A high-performance Multimodal AI Chatbot using Gemini 2.0 Flash to perform visual reasoning, OCR, and object detection. Features robust exponential backoff logic to handle API rate limits and token optimization for efficient image processing.

/ 100

Experimental

No License No Package No Dependents

Maintenance 10 / 25

Adoption 1 / 25

Maturity 1 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

—

Category

multimodal-streamlit-apps

Last pushed

Mar 04, 2026

Commits (30d)

GitHub

Multimodal Streamlit Apps · 44 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/Rahilshah01/multimodal-vision-ai-chat"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

stavrostheocharis/auto-streamlit-studio

AutoStreamlit Studio is an intelligent assistant designed to streamline the creation of...

dhineshaps/fetquest-genai

The FET Quest Model Portfolio Project created to explore the Generative AI and Incorporate the...

sitammeur/streamlit-app-builder

A Streamlit-based AI assistant generates custom Streamlit app code from user-provided images or...

grandelli/dfcx-geminiprovision

A Dialogflow CX implementation of a purely determistic agent (intent-based) integrated with...

sitammeur/PicQ

PicQ: Demo for MiniCPM-o 2.6 to answer questions about images using natural language.

Explore Generative AI Tools

All categories Trending Generative AI directory Insights