page-agent and browserable

These are competitors: both enable AI agents to control web browsers through natural language, but page-agent executes JavaScript in-page for direct DOM manipulation while browserable provides a separate browser automation library, offering different architectural approaches to the same problem.

page-agent
88
Verified
browserable
57
Established
Maintenance 25/25
Adoption 20/25
Maturity 24/25
Community 19/25
Maintenance 2/25
Adoption 13/25
Maturity 24/25
Community 18/25
Stars: 6,693
Forks: 516
Downloads: 27,541
Commits (30d): 217
Language: TypeScript
License: MIT
Stars: 1,169
Forks: 98
Downloads: 30
Commits (30d): 0
Language: JavaScript
License: MIT
No risk flags
Stale 6m

About page-agent

alibaba/page-agent

JavaScript in-page GUI agent. Control web interfaces with natural language.

Operates entirely client-side using text-based DOM analysis rather than screenshots, eliminating the need for multi-modal LLMs or external infrastructure like browser extensions or headless browsers. Integrates with any LLM via a standard API interface, and optionally extends to multi-page automation through a Chrome extension and MCP Server for external agent control.

About browserable

browserable/browserable

Open source and self-hostable browser automation library for AI agents

Provides vision-based task execution with LLM-driven navigation, form filling, and data extraction, achieving 90.4% on Web Voyager benchmarks. Integrates pluggable LLM providers (OpenAI, Claude, Gemini) and remote browser services (Hyperbrowser, Steel) via a Docker-based self-hosted architecture with MongoDB, Redis, and MinIO backend. Exposes functionality through a REST API and JavaScript SDK for programmatic agent control.

Scores updated daily from GitHub, PyPI, and npm data. How scores work