GetStream/Vision-Agents
Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.
Features a pluggable processor pipeline for computer vision models (YOLO, Roboflow, PyTorch/ONNX) that run before LLM calls, plus native integrations with OpenAI Realtime, Gemini Live, and Claude for streaming AI responses. Includes voice integration via Deepgram/AssemblyAI for STT and ElevenLabs/Cartesia for TTS, turn detection with VAD, tool calling via MCP, and production-ready HTTP server with Prometheus metrics for horizontal scaling and Kubernetes deployment.
7,366 stars and 19,360 monthly downloads. Actively maintained with 55 commits in the last 30 days. Available on PyPI.
Stars
7,366
Forks
574
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Monthly downloads
19,360
Commits (30d)
55
Dependencies
10
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/GetStream/Vision-Agents"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related agents
video-db/videodb-capture-quickstart
Give your agents real time desktop perception. Stream screen, microphone, and system audio for...
grctest/g3n-fastapi-webcam-docker
Utilizing multiple Gemma 3n agents to analyze webcam footage
Karmacoke/chargen
AI-powered character generator built with React. Create detailed TRPG/Novel characters, NPC...
leukaemiamedtech/hias-tassai-facial-recognition
HIAS TassAI Facial Recognition Agent processes streams from local or remote cameras to identify...
TheSethRose/AI-File-Organizer-Agent
Uses an AI agent (powered by Google Gemini via the Agno framework) to intelligently propose and...