pmbstyle/gemini-computer-use

A minimal browser automation agent using Google's Gemini 2.5 Computer Use Preview model and Playwright for web browser control.

29
/ 100
Experimental

Implements vision-based browser automation by feeding screenshots to Gemini 2.5 for visual understanding, enabling the model to locate and interact with page elements without DOM parsing. Includes built-in safety guardrails with human-in-the-loop confirmation for sensitive operations, and provides a comprehensive action API covering clicks, typing, scrolling, drag-and-drop, and navigation primitives executed through Playwright's browser control layer.

No License No Package No Dependents
Maintenance 6 / 25
Adoption 6 / 25
Maturity 1 / 25
Community 16 / 25

How are scores calculated?

Stars

23

Forks

6

Language

Python

License

Last pushed

Oct 23, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/pmbstyle/gemini-computer-use"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.