manushi4/Screenhand
Give AI eyes and hands on your desktop. Open-source MCP server for desktop automation — screenshots, UI control, browser automation, OCR. Works with Claude, Cursor, and any MCP client. macOS + Windows.
Implements native OS Accessibility APIs and Chrome DevTools Protocol for ~50ms action latency without requiring LLM calls per interaction, complemented by a persistent "App Mastery Map" that learns spatial UI blueprints and navigation patterns through normal tool usage. Includes 111 tools spanning desktop control, browser automation, smart fallbacks (Accessibility → CDP → OCR → coordinates), memory/learning systems, and multi-agent job orchestration, with prebuilt knowledge for 36+ applications and 49 automation playbooks loaded automatically on app detection.
Available on npm.
Stars
16
Forks
2
Language
TypeScript
License
AGPL-3.0
Category
Last pushed
Mar 11, 2026
Commits (30d)
0
Dependencies
4
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mcp/manushi4/Screenhand"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
getsentry/XcodeBuildMCP
A Model Context Protocol (MCP) server and CLI that provides tools for agent use when working on...
carterlasalle/mac_messages_mcp
An MCP server that securely interfaces with your iMessage database via the Model Context...
kimsungwhee/apple-docs-mcp
MCP server for Apple Developer Documentation - Search iOS/macOS/SwiftUI/UIKit docs, WWDC videos,...
domdomegg/computer-use-mcp
💻 Give AI models complete control of your computer (probably a bad idea)
peakmojo/applescript-mcp
MCP server that execute applescript giving you full control of your Mac