X-PLUG/MobileAgent

Mobile-Agent: The Powerful GUI Agent Family

69
/ 100
Established

Implements multimodal vision-language models (GUI-Owl series: 2B-235B parameters) optimized for GUI perception and grounding across desktop, mobile, and browser environments using Qwen3-VL backbone. The agentic framework layers planning, reflection, memory management, and tool/MCP calling on top of vision capabilities, enabling end-to-end task automation across platforms. Achieves state-of-the-art on 20+ GUI benchmarks including OSWorld and AndroidWorld through semi-online RL fine-tuning and native multi-platform support.

8,242 stars. Actively maintained with 25 commits in the last 30 days.

No Package No Dependents
Maintenance 23 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

8,242

Forks

825

Language

Python

License

MIT

Last pushed

Mar 09, 2026

Commits (30d)

25

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/X-PLUG/MobileAgent"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.