aidriventesting/Agent

Open-source AI agent for UI automation, combining structural and visual understanding of mobile & web interfaces. Toward the next generation of open-source, AI-driven testing.

38
/ 100
Emerging

Integrates with Robot Framework as a natural-language library that converts plain-English instructions into UI actions via multi-provider LLMs (OpenAI, Claude, Gemini). Uses vision-based UI parsing with OmniParser and Set-of-Mark techniques to ground visual elements, supporting both mobile (via Appium) and web platforms through a unified LLM→context→tool selection pipeline. Keywords like `Agent.Do`, `Agent.Check`, and `Agent.Ask` enable semantic test steps without explicit selectors or coordinates.

No Package No Dependents
Maintenance 10 / 25
Adoption 5 / 25
Maturity 9 / 25
Community 14 / 25

How are scores calculated?

Stars

11

Forks

3

Language

Python

License

MIT

Last pushed

Jan 19, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/aidriventesting/Agent"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.