OpenAdaptAI/OmniMCP
OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction capabilities.
Implements a perceive-plan-act loop that captures screenshots, parses UI elements with OmniParser, generates action plans via Claude/LLM, and executes mouse/keyboard interactions through `pynput`. Supports optional auto-deployment of OmniParser to AWS EC2 with cost management, and generates timestamped visual debugging artifacts for each agent step. Targets autonomous UI automation and agent-based task execution across arbitrary desktop applications.
No commits in the last 6 months.
Stars
71
Forks
16
Language
Python
License
—
Category
Last pushed
Apr 08, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mcp/OpenAdaptAI/OmniMCP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jonigl/mcp-client-for-ollama
A text-based user interface (TUI) client for interacting with MCP servers using Ollama. Features...
ArcadeAI/arcade-mcp
The best way to create, deploy, and share MCP Servers
hmldns/nautex
MCP server for guiding Coding Agents via end-to-end requirements to implementation plan pipeline
Dicklesworthstone/ultimate_mcp_server
Comprehensive MCP server exposing dozens of capabilities to AI agents: multi-provider LLM...
SecretiveShell/MCP-Bridge
A middleware to provide an openAI compatible endpoint that can call MCP tools