pmbstyle/gemini-browser-agent
A browser agent with a Google Chrome extension that can work in your browser. Based on Google Gemini 2.5 computer use model.
Bridges a Chrome extension with Google's Gemini Computer Use API to observe and interact with the active tab in real-time, exchanging screenshots and DOM events without requiring sandboxing. Uses a Python WebSocket server that communicates bidirectionally with the extension, enabling the model to execute browser automation tasks directly within your own browser context. Supports agentic workflows where Gemini plans multi-step interactions and streams execution logs back to the UI.
Stars
63
Forks
16
Language
JavaScript
License
—
Category
Last pushed
Oct 23, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/pmbstyle/gemini-browser-agent"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nth5693/gemini-kit
🚀 19 AI Agents + 44 Commands for Gemini CLI - Code 10x faster with auto planning, testing,...
josstei/maestro-gemini
Turn Gemini CLI into a multi-agent platform — 12 specialized subagents, parallel dispatch,...
lopushok9/gemini_quant
Free, easy-to-use, AI-driven market research tool for the Gemini CLI
Joonghyun-Lee-Frieren/oh-my-gemini-cli
Context-engineering-powered multi-agent team workflow pack for Gemini CLI.
jduncan-rva/gemini-agent-creator
AI-powered extension for Gemini CLI that creates custom agents through natural conversation