memvid/maw

Crawl any website into a single searchable file. Query it forever, offline.

/ 100

Emerging

Automatically detects crawling strategy—starting with fast HTTP fetches, falling back to Playwright browser rendering, then stealth mode for protected sites—without manual configuration. Stores crawled content in `.mv2` files (memvid's document database format) supporting both BM25 keyword search and optional semantic embeddings via OpenAI. Includes CLI commands for searching, AI-powered Q&A, exporting to markdown/JSON, and supports crawling documentation sites, blogs, local codebases, and git repositories with configurable depth and rate-limiting.

Available on npm.

No License

Maintenance 10 / 25

Adoption 10 / 25

Maturity 10 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

TypeScript

License

—

Higher-rated alternatives

vakra-dev/reader

Open-source, production-grade web scraping engine built for LLMs. Scrape and crawl the entire...

joaobenedetmachado/scrapit

A (really) easy way to web scrape

firecrawl/open-scouts

🔥 AI-powered web monitoring platform. Create automated scouts that search the web and send email...

BrowserCash/teracrawl

High-performance web crawler API optimized for LLMs. Turn any search or website into clean...

poneoneo/Alibaba-CLI-Scraper

Create your own Alibaba dataset and interact with it in plain English.

Explore AI Agents

All categories Trending AI Agent directory Insights