JochenYang/luma-mcp

Multi-Model Visual Understanding MCP Server, GLM-4.6V, DeepSeek-OCR (free), and Qwen3-VL-Flash. Provide visual processing capabilities for AI coding models that do not support image understanding.多模型视觉理解MCP服务器，GLM-4.6V、DeepSeek-OCR（免费）和Qwen3-VL-Flash等。为不支持图片理解的 AI 编码模型提供视觉处理能力。

/ 100

Established

Implements an MCP server with pluggable vision model backends (Zhipu, SiliconFlow, Qwen, Volcengine, Hunyuan), automatically handling image preprocessing including compression, multi-crop tiling for dense text scenes, and format normalization across local files, URLs, and Data URIs. Exposes a single `image_understand` tool that integrates with Claude Desktop, Cline, and Claude Code, with configurable thinking mode and adaptive cropping strategies optimized for code/UI/OCR screenshots.

Available on npm.

Maintenance 10 / 25

Adoption 8 / 25

Maturity 22 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

TypeScript

License

MIT

Compare

luma-mcp and mcp-image-extractor

Related servers

raveenb/fal-mcp-server

MCP server for Fal.ai - Generate images, videos, music and audio with Claude

sunriseapps/imagesorcery-mcp

An MCP server providing tools for image processing operations

shinpr/mcp-image

MCP server for AI image generation and editing with automatic prompt optimization and quality...

glifxyz/glif-mcp-server

Easily run glif.app AI workflows inside your LLM: image generators, memes, selfies, and more....

joenorton/comfyui-mcp-server

lightweight Python-based MCP (Model Context Protocol) server for local ComfyUI

Explore MCP Servers

All categories Trending MCP Server directory Insights