raznem/parsera

Lightweight library for scraping web-sites with LLMs

/ 100

Emerging

Extracts structured data from websites by describing desired fields in natural language, with Playwright integration for dynamic content and JavaScript-heavy sites. Supports both synchronous and asynchronous execution, custom LLM models, and includes CLI/Docker deployment options. The library delegates parsing logic to LLMs via API, eliminating need for CSS selectors or XPath expressions.

1,272 stars.

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 9 / 25

Community 16 / 25

How are scores calculated?

Stars

1,272

Forks

Language

Python

License

GPL-2.0

Higher-rated alternatives

carlosplanchon/spidercreator

Automated web scraping spider generation using Browser Use and LLMs. Streamline the creation of...

Riddhish1/CogniScrape

Intelligent Web Scraping Library with LLMs

poodle64/supacrawl

Zero-infrastructure web scraping for the terminal

yeahhe365/JustSearch

基于 Playwright 的自主 AI 搜索智能体。支持迭代式任务规划、深度网页爬取，以及带引用来源的多源知识整合。

rednafi/html-to-text

Extract pure text from any webpage

Explore LLM Tools

All categories Trending LLM Tool directory Insights