ai-naymul/BrowserPilot
Open‑source alternative to Perplexity Comet, director.ai and firecrawl combined
Leverages Google Gemini's vision AI to interpret web pages visually rather than parsing HTML, enabling resilience to layout changes and anti-bot detection. Built on Playwright for browser automation with integrated proxy rotation, CAPTCHA solving, and Cloudflare bypass capabilities. Provides a FastAPI backend with real-time streaming UI and supports multiple output formats (PDF, CSV, JSON) through natural language commands.
109 stars.
Stars
109
Forks
21
Language
Python
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/ai-naymul/BrowserPilot"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related agents
alibaba/page-agent
JavaScript in-page GUI agent. Control web interfaces with natural language.
CloakHQ/CloakBrowser
Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with...
hanzili/hanzi-browse
let any ai agent use the local browser
4ier/neo
Turn any web app into an API. Chrome extension captures browser traffic, auto-generates schemas,...
nicobailon/surf-cli
The CLI for AI agents to control Chrome. Zero config, agent-agnostic, battle-tested.