arunjeyaprasad/mcp-rag-web-scraper

Customizable web scraper that can be used to build a knowledge base which can be integrated with a RAG system for Search. Supports MCP integration as well for querying

19
/ 100
Experimental

This tool helps businesses, researchers, or anyone needing to create a private, searchable knowledge base from public websites. You provide a list of website URLs, and it automatically scrapes their content to build an offline database. This database can then be queried using natural language, acting like a private search engine for the information you've gathered.

No commits in the last 6 months.

Use this if you need to gather specific information from websites regularly and want to create a private, AI-searchable reference system for your team or personal use, especially for integrating with AI assistants like Claude Desktop or local LLMs like Ollama.

Not ideal if you need a general-purpose web crawler for broad data collection across the entire internet, or if you don't require the natural language search and AI integration capabilities.

knowledge-management competitive-intelligence content-curation research-automation private-search
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 2 / 25
Maturity 15 / 25
Community 0 / 25

How are scores calculated?

Stars

2

Forks

Language

Python

License

MIT

Last pushed

Jul 03, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/mcp/arunjeyaprasad/mcp-rag-web-scraper"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.