reader and scraping-agent-ai

These are complements: vakra-dev/reader provides the core web scraping and markdown cleaning infrastructure that hmshb/scraping-agent-ai wraps with agentic orchestration (LangGraph, Anthropic) to automate intelligent extraction workflows.

reader

Established

scraping-agent-ai

Emerging

Maintenance 10/25

Adoption 15/25

Maturity 20/25

Community 13/25

Maintenance 0/25

Adoption 6/25

Maturity 9/25

Community 15/25

Stars: 474

Forks: 32

Downloads: 196

Commits (30d): 0

Language: TypeScript

License: Apache-2.0

Stars: 16

Forks: 5

Downloads: —

Commits (30d): 0

Language: Python

License: MIT

No risk flags

Stale 6m No Package No Dependents

About reader

vakra-dev/reader

Open-source, production-grade web scraping engine built for LLMs. Scrape and crawl the entire web, clean markdown, ready for your agents.

Leverages [Ulixee Hero](https://ulixee.org/), a headless browser with built-in anti-bot defenses (TLS fingerprinting, Cloudflare bypass, DNS-over-TLS), managed through a pooled architecture with automatic recycling and health monitoring. Provides two core primitives—`scrape()` for converting URLs to cleaned markdown/HTML, and `crawl()` for breadth-first site discovery—with configurable browser pooling, proxy rotation strategies, batch concurrency, and graceful degradation handling all abstracted away.

About scraping-agent-ai

hmshb/scraping-agent-ai

AI-powered web scraping agent built with LangGraph, LangSmith, Firecrawl, and Anthropic AI. Automates intelligent crawling, structured data extraction, and LLM-powered content formatting. Efficiently handles anti-bot mechanisms, error recovery, and batch processing. 🚀

Related comparisons

reader and teracrawl reader and just-scrape

Scores updated daily from GitHub, PyPI, and npm data. How scores work