IDEA-Research/DINO-X-MCP
Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.
Leverages DINO-X and Grounding DINO models for fine-grained object detection with structured outputs (bounding boxes, counts, attributes) enabling visual reasoning tasks. Operates via dual transport modes—STDIO for local deployment with visualization support, or HTTP streaming for cloud deployment—while integrating with MCP-compatible clients (Cursor, Windsurf, etc.) through API key authentication.
113 stars.
Stars
113
Forks
11
Language
TypeScript
License
Apache-2.0
Category
Last pushed
Oct 28, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mcp/IDEA-Research/DINO-X-MCP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
raveenb/fal-mcp-server
MCP server for Fal.ai - Generate images, videos, music and audio with Claude
sunriseapps/imagesorcery-mcp
An MCP server providing tools for image processing operations
shinpr/mcp-image
MCP server for AI image generation and editing with automatic prompt optimization and quality...
glifxyz/glif-mcp-server
Easily run glif.app AI workflows inside your LLM: image generators, memes, selfies, and more....
joenorton/comfyui-mcp-server
lightweight Python-based MCP (Model Context Protocol) server for local ComfyUI