ScrapeGraphAI/Scrapegraph-ai

Python scraper based on AI

62
/ 100
Established

Leverages LLM-driven graph logic to intelligently extract data from websites and local documents (HTML, XML, JSON, Markdown) without writing selectors—users specify extraction goals in natural language. Integrates with major LLM providers (OpenAI, Ollama) and orchestration frameworks (Langchain, Llama Index, Crew.ai), plus low-code platforms (Zapier, n8n, Pipedream) through SDKs and API endpoints. Includes specialized pipelines like SearchGraph for multi-page extraction and SpeechGraph for audio generation from scraped content.

22,929 stars. Actively maintained with 12 commits in the last 30 days.

No Package No Dependents
Maintenance 17 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

22,929

Forks

2,000

Language

Python

License

MIT

Last pushed

Feb 24, 2026

Commits (30d)

12

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/ScrapeGraphAI/Scrapegraph-ai"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.