elizabethsiegle/gemini-multimodal-chat
Multimodal Chat with Gemini API
Streamlit-based chat interface combining LangChain's message abstraction with Google's Generative AI SDK to enable image and text inputs in conversation history. Uses a custom multimodal chat input component to capture mixed-media user inputs, routing them through LangChain's Google GenAI integration for unified prompt handling.
No commits in the last 6 months.
Stars
47
Forks
4
Language
Python
License
—
Category
Last pushed
Dec 25, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/elizabethsiegle/gemini-multimodal-chat"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
itbanque/talk2dom
Locate web elements using natural language. Powered by LLM for reliable UI automation.
gabrielchua/repo-explainer
Chat with a repo by adding the entire repo to gemini 1.5 pro's 1M context window 🔥
Gen-XR/TheiaEngine
All in one API to serve all Vision AI task
EternityYW/Gemini-Commonsense-Evaluation
Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large...
Bramitha-gowda-M/LLM-projects-using-Gemini-Pro
End to End Large Language Model projects using Gemini pro API for test and Gemini pro vision for...