Vision-Agents and visionagent

These are competitors offering similar multi-provider vision agent frameworks, though GetStream's production-ready platform with Stream's edge infrastructure and established adoption significantly outpaces the nascent, type-safe alternative.

Vision-Agents
88
Verified
visionagent
20
Experimental
Maintenance 25/25
Adoption 20/25
Maturity 24/25
Community 19/25
Maintenance 10/25
Adoption 1/25
Maturity 9/25
Community 0/25
Stars: 7,366
Forks: 574
Downloads: 19,360
Commits (30d): 55
Language: Python
License: Apache-2.0
Stars: 1
Forks:
Downloads:
Commits (30d): 0
Language: TypeScript
License: MIT
No risk flags
No Package No Dependents

About Vision-Agents

GetStream/Vision-Agents

Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

Features a pluggable processor pipeline for computer vision models (YOLO, Roboflow, PyTorch/ONNX) that run before LLM calls, plus native integrations with OpenAI Realtime, Gemini Live, and Claude for streaming AI responses. Includes voice integration via Deepgram/AssemblyAI for STT and ElevenLabs/Cartesia for TTS, turn detection with VAD, tool calling via MCP, and production-ready HTTP server with Prometheus metrics for horizontal scaling and Kubernetes deployment.

About visionagent

sijeeshmiziha/visionagent

Multi-provider AI agent framework with vision capabilities and tool calling. Supports OpenAI, Anthropic, Google. Built-in Figma tools and Google Stitch integration. Type-safe with Zod validation.

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work