alirezamika/autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Learns extraction patterns from sample data (text, URLs, or HTML attributes) you provide, then applies those rules to extract similar content from new pages; supports both pattern-matching ("similar results") and exact value retrieval modes. Integrates with the `requests` library for customizable HTTP transport, including proxy and header support, and persists trained scrapers to disk for reuse across multiple pages.
7,122 stars and 1,197 monthly downloads. No commits in the last 6 months. Available on PyPI.
Stars
7,122
Forks
720
Language
Python
License
MIT
Category
Last pushed
Jun 09, 2025
Monthly downloads
1,197
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/alirezamika/autoscraper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
YoongiKim/AutoCrawler
Google, Naver multiprocess image web crawler (Selenium)
lorey/mlscraper
🤖 Scrape data from HTML websites automatically by just providing examples
machine-learning-apps/Issue-Label-Bot
Code For The Issue Label Bot, an App that automatically labels issues using machine learning,...
nuhmanpk/Webtrench
A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of...
Tuhin-thinks/instagram-unfollower-tracker-meerkit
Analyze Instagram followers, find unfollowers, automate follow/unfollow, and predict follow-backs.