alirezamika/autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

64
/ 100
Established

Learns extraction patterns from sample data (text, URLs, or HTML attributes) you provide, then applies those rules to extract similar content from new pages; supports both pattern-matching ("similar results") and exact value retrieval modes. Integrates with the `requests` library for customizable HTTP transport, including proxy and header support, and persists trained scrapers to disk for reuse across multiple pages.

7,122 stars and 1,197 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 2 / 25
Adoption 17 / 25
Maturity 25 / 25
Community 20 / 25

How are scores calculated?

Stars

7,122

Forks

720

Language

Python

License

MIT

Last pushed

Jun 09, 2025

Monthly downloads

1,197

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/alirezamika/autoscraper"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.