GetStream/Vision-Agents

Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

88
/ 100
Verified

Features a pluggable processor pipeline for computer vision models (YOLO, Roboflow, PyTorch/ONNX) that run before LLM calls, plus native integrations with OpenAI Realtime, Gemini Live, and Claude for streaming AI responses. Includes voice integration via Deepgram/AssemblyAI for STT and ElevenLabs/Cartesia for TTS, turn detection with VAD, tool calling via MCP, and production-ready HTTP server with Prometheus metrics for horizontal scaling and Kubernetes deployment.

7,366 stars and 19,360 monthly downloads. Actively maintained with 55 commits in the last 30 days. Available on PyPI.

Maintenance 25 / 25
Adoption 20 / 25
Maturity 24 / 25
Community 19 / 25

How are scores calculated?

Stars

7,366

Forks

574

Language

Python

License

Apache-2.0

Last pushed

Mar 13, 2026

Monthly downloads

19,360

Commits (30d)

55

Dependencies

10

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/GetStream/Vision-Agents"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.