carlosplanchon/spidercreator

Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of Playwright-based spiders with minimal manual coding. Ideal for large enterprises with recurring data extraction needs.

53
/ 100
Established

Uses Browser Use to record interactive scraping sessions, then applies a multi-stage LLM pipeline to generate optimized XPath-based Playwright spiders that execute cheaply without further LLM calls. Integrates with Parsel for HTML parsing and includes a virtual execution environment (ctxexec) to validate candidate spider implementations before selecting the best performer for each navigation stage.

217 stars and 10 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m No Dependents
Maintenance 2 / 25
Adoption 12 / 25
Maturity 25 / 25
Community 14 / 25

How are scores calculated?

Stars

217

Forks

22

Language

Python

License

AGPL-3.0

Category

llm-web-scraping

Last pushed

Aug 25, 2025

Monthly downloads

10

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/carlosplanchon/spidercreator"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.