bilgeyucel/multimodal-agent-workshop
🖼️ Workshop: Build a multimodal AI agent with Haystack & GPT-4o — featuring image understanding, document retrieval, conversational memory, and human-in-the-loop safety controls
Implements a complete indexing and retrieval pipeline using CLIP embeddings for multimodal content, enabling the agent to search across both PDFs and images via RAG tools. Deploys as a production-ready API through Hayhooks integration, transforming the interactive notebook workflow into a callable service. Demonstrates practical safety patterns including human-in-the-loop approval gates for sensitive operations like expense reimbursement requests.
Stars
14
Forks
6
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jan 30, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/bilgeyucel/multimodal-agent-workshop"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
arakoodev/EdgeChains
EdgeChains.js is Full-Stack GenAI library. Front-end, backend, apis, prompt management,...
xavierpuigf/virtualhome
API to run VirtualHome, a Multi-Agent Household Simulator
x-glacier/GenerativeAgentsCN
本项目为Generative Agents项目的重构+深度汉化版本,旨在为中文用户提供一个利于维护的基础版本,以便后续实验或功能拓展。
tmgthb/Autonomous-Agents
Autonomous Agents (LLMs) research papers. Updated Daily.
shadi-fsai/a1facts
a1facts - the precision layer for AI agents