X-PLUG/MobileAgent
Mobile-Agent: The Powerful GUI Agent Family
Implements multimodal vision-language models (GUI-Owl series: 2B-235B parameters) optimized for GUI perception and grounding across desktop, mobile, and browser environments using Qwen3-VL backbone. The agentic framework layers planning, reflection, memory management, and tool/MCP calling on top of vision capabilities, enabling end-to-end task automation across platforms. Achieves state-of-the-art on 20+ GUI benchmarks including OSWorld and AndroidWorld through semi-online RL fine-tuning and native multi-platform support.
8,242 stars. Actively maintained with 25 commits in the last 30 days.
Stars
8,242
Forks
825
Language
Python
License
MIT
Category
Last pushed
Mar 09, 2026
Commits (30d)
25
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/X-PLUG/MobileAgent"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related agents
modelscope/ms-agent
MS-Agent: a lightweight framework to empower agentic execution of complex tasks
github/gh-aw
GitHub Agentic Workflows
study8677/antigravity-workspace-template
🪐 The ultimate starter kit for AI IDEs, Claude code,codex, and other agentic coding environments.
EtienneLescot/n8n-as-code
Give your AI agent n8n superpowers. 537 nodes with full schemas, 7,700+ templates, Git-like...
jonathan-vella/azure-agentic-infraops
Agentic InfraOps transforms Azure deployments for IT Pros. Using GitHub Copilot and AI agents,...