reader and teracrawl
These are direct competitors—both are production web scraping engines that convert web content to clean markdown for LLM consumption, with reader offering more mature adoption (474 stars, 196 monthly downloads vs. 236 stars, 0 monthly downloads) and teracrawl explicitly positioning itself as a Firecrawl alternative.
About reader
vakra-dev/reader
Open-source, production-grade web scraping engine built for LLMs. Scrape and crawl the entire web, clean markdown, ready for your agents.
Leverages [Ulixee Hero](https://ulixee.org/), a headless browser with built-in anti-bot defenses (TLS fingerprinting, Cloudflare bypass, DNS-over-TLS), managed through a pooled architecture with automatic recycling and health monitoring. Provides two core primitives—`scrape()` for converting URLs to cleaned markdown/HTML, and `crawl()` for breadth-first site discovery—with configurable browser pooling, proxy rotation strategies, batch concurrency, and graceful degradation handling all abstracted away.
About teracrawl
BrowserCash/teracrawl
High-performance web crawler API optimized for LLMs. Turn any search or website into clean Markdown using remote browsers. Firecrawl alternative
Leverages Browser.cash managed Chrome instances with a two-phase crawling strategy—fast mode for static content that reuses browser contexts and blocks heavy assets, falling back to dynamic mode for SPAs requiring hydration detection. Integrates with browser-serp for Google search functionality and optionally with Datalab for PDF-to-Markdown conversion, while maintaining a configurable session pool for parallel scraping with automatic timeout handling and anti-bot protection.
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work