alibaba/page-agent
JavaScript in-page GUI agent. Control web interfaces with natural language.
Operates entirely client-side using text-based DOM analysis rather than screenshots, eliminating the need for multi-modal LLMs or external infrastructure like browser extensions or headless browsers. Integrates with any LLM via a standard API interface, and optionally extends to multi-page automation through a Chrome extension and MCP Server for external agent control.
6,693 stars and 27,541 monthly downloads. Actively maintained with 217 commits in the last 30 days. Available on npm.
Stars
6,693
Forks
516
Language
TypeScript
License
MIT
Category
Last pushed
Mar 12, 2026
Monthly downloads
27,541
Commits (30d)
217
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/alibaba/page-agent"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related agents
hanzili/hanzi-browse
let any ai agent use the local browser
CloakHQ/CloakBrowser
Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with...
4ier/neo
Turn any web app into an API. Chrome extension captures browser traffic, auto-generates schemas,...
nicobailon/surf-cli
The CLI for AI agents to control Chrome. Zero config, agent-agnostic, battle-tested.
actionbook/actionbook
Browser action engine for AI agents. 10× faster, resilient by design.