pc8544/Website-Crawler

Extract data from websites in LLM ready JSON or CSV format. Crawl or Scrape entire website with Website Crawler

/ 100

Emerging

No License No Package No Dependents

Maintenance 10 / 25

Adoption 9 / 25

Maturity 7 / 25

Community 12 / 25

Stars

Forks

Language

Java

License

—

Category

Last pushed

Feb 19, 2026

Commits (30d)

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/pc8544/Website-Crawler"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

any4ai/AnyCrawl

AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts...

kreuzberg-dev/html-to-markdown

High performance and CommonMark compliant HTML to Markdown converter. Maintained by the...

lightfeed/extractor

Using LLMs and AI browser automation to robustly extract web data

ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

paulpierre/markdown-crawler

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file...