alibaba/page-agent

JavaScript in-page GUI agent. Control web interfaces with natural language.

88
/ 100
Verified

Operates entirely client-side using text-based DOM analysis rather than screenshots, eliminating the need for multi-modal LLMs or external infrastructure like browser extensions or headless browsers. Integrates with any LLM via a standard API interface, and optionally extends to multi-page automation through a Chrome extension and MCP Server for external agent control.

6,693 stars and 27,541 monthly downloads. Actively maintained with 217 commits in the last 30 days. Available on npm.

Maintenance 25 / 25
Adoption 20 / 25
Maturity 24 / 25
Community 19 / 25

How are scores calculated?

Stars

6,693

Forks

516

Language

TypeScript

License

MIT

Last pushed

Mar 12, 2026

Monthly downloads

27,541

Commits (30d)

217

Dependencies

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/alibaba/page-agent"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.