Codeeaner/Computer-Use-Agent
An AI Agent that is able to control your screen to complste any task
Implements a local, closed-loop automation system using Qwen3-VL vision-language model via Ollama to iteratively capture screenshots, analyze UI state, and execute mouse/keyboard actions on Windows 11. Built with `mss` for efficient screen capture and `pyautogui` for input control, it includes function calling for structured action planning and screenshot history for debugging automation workflows.
Stars
16
Forks
—
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Oct 23, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/Codeeaner/Computer-Use-Agent"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
droidrun/droidrun
Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
trycua/cua
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and...
TurixAI/TuriX-CUA
This is the official website for TuriX Computer-use-Agent
Haervwe/open-webui-tools
Open‑WebUI Tools is a modular toolkit designed to extend and enrich your Open WebUI instance,...
erickjtorres/app-use
📱 Make apps accessible for AI agents