web-arena-x/webarena
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
Provides a self-hostable Gym-like environment with simulated e-commerce, social media, and productivity websites for evaluating web navigation agents via accessibility tree observations and ID-based actions. Features 812 task configurations with reproducible evaluation infrastructure, auto-login mechanisms, and support for prompt-based agents (GPT-3.5, GPT-4). Integrates with Playwright for browser automation and OpenAI APIs, with Docker containerization for isolated website hosting.
1,398 stars.
Stars
1,398
Forks
226
Language
Python
License
Apache-2.0
Category
Last pushed
Nov 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/web-arena-x/webarena"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
nabeelxy/syara
SYARA: Super YARA Rules for GenAI Era
princeton-nlp/WebShop
[NeurIPS 2022] đź›’WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
X-LANCE/Mobile-Env
A Universal Platform for Training and Evaluation of Mobile Interaction
shbernal/pdfanki
Create Anki decks from PDF/EPUB files using NLP with LLMs.
dinhanhx/cpu-ish-rag
A very CPU-friendly RAG implementation