All Perception Tools
10,528 tools ranked by quality score · Page 5 of 106
| # | Tool | Score | Tier |
|---|---|---|---|
| 401 |
phase3dev/sitemap-extract
Processes XML sitemaps and extracts URLs. Includes features such as support... |
|
Emerging |
| 402 |
asad-haider/spidey
Robust web spider for NodeJS |
|
Emerging |
| 403 |
umbrellaDocumentation/Web-Data-Scraper
Web Data Scraper - no-code internet scraping. Extract and export to CSV,... |
|
Emerging |
| 404 |
mhwgoo/cambridge
Terminal version of Cambridge Dictionary by default. Also supports the... |
|
Emerging |
| 405 |
valayDave/arxiv-miner
arxiv_miner is a toolkit for mining research papers on CS ArXiv. |
|
Emerging |
| 406 |
Hecate2/Ignareo-ISML-auto-voter
Ignareo the Carillon, a web crawler/spider template of ultimate high... |
|
Emerging |
| 407 |
crawlab-team/crawlab-lite
Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台 |
|
Emerging |
| 408 |
NLPatVCU/PaperScraper
A web scraping tool to systematically extract the text of scientific papers... |
|
Emerging |
| 409 |
rivermont/spidy
The simple, easy to use command line web crawler. |
|
Emerging |
| 410 |
supadata-ai/py
Official Python SDK for the Supadata API. |
|
Emerging |
| 411 |
VolkanSah/Auto-Proxy-Fetcher
Automatically fetch and update proxy lists from multiple sources every 6... |
|
Emerging |
| 412 |
MarketingPipeline/Python-Selenium-Action
Run Selenium with Python via Github Actions using Headless or Non-Headless browsers! |
|
Emerging |
| 413 |
buyukakyuz/email-sleuth
Discover and verify professional emails using names + domains |
|
Emerging |
| 414 |
infinilabs/crawler
🕷️ An easy-to-use spider written in Golang. (previous named GOPA.) |
|
Emerging |
| 415 |
N4rr34n6/TikTok-User-Info-Scraper
TikTok User Info Scraper allows you to fetch detailed information about... |
|
Emerging |
| 416 |
macloo/python-adv-web-apps
Updated python-beginners docs and examples |
|
Emerging |
| 417 |
SiddharthSaxena/PyCurrency-Converter
A python library to convert currency using Google Finance. |
|
Emerging |
| 418 |
ivan-sincek/bot-safe-agents
A library for fetching a list of bot-safe user agents. |
|
Emerging |
| 419 |
amoudgl/short-jokes-dataset
Python scripts for building 'Short Jokes' dataset, featured on Kaggle |
|
Emerging |
| 420 |
batuhaniskr/twitter-intelligence
Twitter Intelligence OSINT project performs tracking and analysis of the Twitter |
|
Emerging |
| 421 |
my8100/scrapyd-cluster-on-heroku
Set up free and scalable Scrapyd cluster for distributed web-crawling with... |
|
Emerging |
| 422 |
ayakashi-io/ayakashi
:zap: Ayakashi.io - The next generation web scraping framework |
|
Emerging |
| 423 |
FaustoS88/PinescriptV6-docs-crawler
A Python tool for crawling and processing TradingView's PineScript V6... |
|
Emerging |
| 424 |
elliotxx/zhihu-crawler-people
A simple distributed crawler for zhihu && data analysis |
|
Emerging |
| 425 |
arhcoder/Club-de-Programacion-Creativa
🍇 Club de Programación Creativa: Si algo es automatizable, lo automatizamos;... |
|
Emerging |
| 426 |
kameleo-io/kameleo
Anti-detect browser for web scraping and automation. Engine-level... |
|
Emerging |
| 427 |
kyle-n/unofficial-amazon-search
A simple client for searching Amazon |
|
Emerging |
| 428 |
awolverp/markupever
The fast, most optimal, and correct HTML & XML parsing library for Python... |
|
Emerging |
| 429 |
shurco/goClone
🌱 goClone - clone websites in seconds |
|
Emerging |
| 430 |
SheikhAminul/browser-automator
Puppeteer alternative for Chrome extensions. |
|
Emerging |
| 431 |
abo123456789/leek
Distributed task redisqueue(最简单python分布式函数调度框架) |
|
Emerging |
| 432 |
rflechner/ScrapySharp
reborn of https://bitbucket.org/rflechner/scrapysharp |
|
Emerging |
| 433 |
shenfe/puppeteer-service
🎠 Run headless Chrome (aka Puppeteer) as a service. |
|
Emerging |
| 434 |
AgriciDaniel/google-ai-studio-n8n-google-maps-scraper
A no-code Google Maps lead scraper built with Google AI Studio (Gemini) and... |
|
Emerging |
| 435 |
MLArtist/WebScraper
Python-based web crawling script with randomized intervals, user-agent... |
|
Emerging |
| 436 |
ZenRows/zenrows-python-sdk
SDK to access ZenRows API directly from Python. We handle proxies rotation,... |
|
Emerging |
| 437 |
OSINT-TECHNOLOGIES/dpulse
DPULSE - Tool for complex approach to domain OSINT |
|
Emerging |
| 438 |
ypspy/dart-scraping
DART 다트 공시 서류 입수 가공 |
|
Emerging |
| 439 |
amerkurev/scrapper
Web scraper with a simple REST API living in Docker and using a Headless... |
|
Emerging |
| 440 |
EdJoPaTo/website-stalker
Track changes on websites via git |
|
Emerging |
| 441 |
vittoriotriassi/jobs_scraper
Simple job postings scraper for Indeed based on requests and BeautifulSoup |
|
Emerging |
| 442 |
Aran404/SpotAPI
A python wrapper for the public & private Spotify API |
|
Emerging |
| 443 |
MontFerret/lab
Test runner for Ferret |
|
Emerging |
| 444 |
ScriptSmith/reaper
Social media scraping / data collection tool for the Facebook, Twitter,... |
|
Emerging |
| 445 |
israelbls/notebooklm-podcast-automator
REST API to automate Google NotebookLM - upload sources (URLs, YouTube,... |
|
Emerging |
| 446 |
beucismis/limoon
Web scraper base Pythonic API for Ekşi Sözlük |
|
Emerging |
| 447 |
sewcio543/soupsavvy
Powerful and flexible web scraping Search Engine |
|
Emerging |
| 448 |
zrashwani/arachnid
Crawl all unique internal links found on a given website, and extract SEO... |
|
Emerging |
| 449 |
18520339/facebook-data-extraction
Experience for effectively fetching Facebook data by Querying Graph API with... |
|
Emerging |
| 450 |
tasooshi/pukpuk
HTTP discovery and change monitoring tool |
|
Emerging |
| 451 |
oxylabs/amazon-scraper
Free Trial Amazon Scraper API for extracting search, product, offer listing,... |
|
Emerging |
| 452 |
serp-spider/search-engine-google
:spider: Google client for SERPS |
|
Emerging |
| 453 |
sixem/imageboard-dl
Image downloader for various imageboards and image albums written in Python. |
|
Emerging |
| 454 |
scrapfly/typescript-scrapfly
SDK for Scrapfly.io web scraping API |
|
Emerging |
| 455 |
A3h1nt/Grawler
Grawler is a tool written in PHP which comes with a web interface that... |
|
Emerging |
| 456 |
fernandod1/Instagram-to-discord
Monitor instagram user account and automatically post new images to discord... |
|
Emerging |
| 457 |
Raccoon254/Aviator-Automated-Betika-Bot
Automated Aviator Betting Bot for Betika, Spribe & Other Aviator-style sites... |
|
Emerging |
| 458 |
RomySaputraSihananda/tiktok-comment-scrapper
Get all comments from tiktok video url or id |
|
Emerging |
| 459 |
abhijeet-reddy/Competitive_Programming_Score_API
API to get user details for competitive coding platforms - Codeforces,... |
|
Emerging |
| 460 |
elixir-crawly/crawly
Crawly, a high-level web crawling & scraping framework for Elixir. |
|
Emerging |
| 461 |
aditeyaS/8bp-free-gift-collector
8 ball pool free rewards collector |
|
Emerging |
| 462 |
JonasCz/save-for-offline
Android app for saving webpages for offline reading. |
|
Emerging |
| 463 |
AlexandreGazagnes/awdible
Awdible - Just the best free version of audible. Awdible is a free and... |
|
Emerging |
| 464 |
flother/htmltab
Command-line utility to convert HTML tables into CSV files |
|
Emerging |
| 465 |
sepandhaghighi/gitfollow
Github follower and following |
|
Emerging |
| 466 |
HelloThereMatey/tedata
Scraper for Trading Economics |
|
Emerging |
| 467 |
gildas-lormeau/simple-cdp
Lightweight JavaScript library to interact with Chromium-based browsers via... |
|
Emerging |
| 468 |
yfe404/web-scraper
Intelligent web scraping Claude Code skill with automatic strategy selection... |
|
Emerging |
| 469 |
gicornachini/bolsa
Biblioteca feita em Python com o objetivo de facilitar o acesso a dados de... |
|
Emerging |
| 470 |
lablnet/stepwright
A powerful web scraping library built with Playwright that provides a... |
|
Emerging |
| 471 |
2captcha/puppeteer-recaptcha-solver-using-clicks
Here is an example of solving reCAPTCHA using the Grid method. In this... |
|
Emerging |
| 472 |
crawlcore/qcrawl
qcrawl - fast async web crawling & scraping framework for Python. |
|
Emerging |
| 473 |
Police-Data-Accessibility-Project/scrapers
Code relating to scraping public police data. |
|
Emerging |
| 474 |
Krasjet/pdf.tocgen
A CLI toolset to generate table of contents for PDF files automatically. |
|
Emerging |
| 475 |
kelp/webdown
Download websites and save or view them as markdown, great for feeding into an LLM |
|
Emerging |
| 476 |
voliveirajr/seleniumcrawler
An example using Selenium webdrivers for python and Scrapy framework to... |
|
Emerging |
| 477 |
GoogleChromeLabs/pptraas.com
Puppeteer as a service |
|
Emerging |
| 478 |
zyachel/quetre
A libre front-end for Quora |
|
Emerging |
| 479 |
abaykan/CrawlBox
Easy way to brute-force web directory. |
|
Emerging |
| 480 |
medialab/SearchEnginesBookmarklet
Extract list of results from search engines pages as CSV with a bookmarklet... |
|
Emerging |
| 481 |
bbbbbrie/pastebin-bisque
Download all of a given user's public Pastebin pastes |
|
Emerging |
| 482 |
robalb/ebpf-web-fingerprint
a golang library and webserver for fast TCP & TLS fingerprinting, powered by eBPF |
|
Emerging |
| 483 |
peviitor-ro/JobsScrapers
Scraping of the jobs available and adding them all in one place at peviitor.ro |
|
Emerging |
| 484 |
MarcusFelling/demo.playwright
This repo is used to demo various testing scenarios with Playwright 🎭, using... |
|
Emerging |
| 485 |
lablnet/pakweather_scraper
A multi-threaded Pakistan Weather crawler written in JavaScript |
|
Emerging |
| 486 |
openstates/billy
legacy backend for Open States |
|
Emerging |
| 487 |
scrapinghub/scrapy-training
Scrapy Training companion code |
|
Emerging |
| 488 |
leogregianin/bancocentralbrasil
💵 💰 :brazil: Informações sobre taxas oficiais diárias de Inflação, Selic,... |
|
Emerging |
| 489 |
aeleraqi/GoogleNewsScraper
Google News Scraper, a Python notebook designed to extract news articles... |
|
Emerging |
| 490 |
floriandiud/facebook-group-members-scraper
Facebook Group Members Extractor. Download Facebook group members in CSV. |
|
Emerging |
| 491 |
cpatrickalves/scraping-ebay
Scraping Ebay's products using Scrapy Web Crawling Framework |
|
Emerging |
| 492 |
DavyJonesCodes/PyTweetToolkit
PyTweetToolkit: An intuitive Python library for managing Twitter... |
|
Emerging |
| 493 |
lackeyjb/playwright-skill
Claude Code Skill for browser automation with Playwright. Model-invoked -... |
|
Emerging |
| 494 |
Crinibus/scraper
Web scraper for scraping, tracking and visualizing prices of products on... |
|
Emerging |
| 495 |
sloev/spotiflite
Scrapes Spotify and dumps data to a sqlite3 database without auth |
|
Emerging |
| 496 |
ScrapeOps/scrapeops-scrapy-sdk
Scrapy extension that gives you all the scraping monitoring, alerting,... |
|
Emerging |
| 497 |
blacknon/pydork
Scraping and listing text and image searches on Google, Bing, DuckDuckGo,... |
|
Emerging |
| 498 |
sansan0/bilibili-comment-analyzer
🎯 哔哩哔哩(bilibili)评论区数据可视化分析软件-- up主可用于指导自己的题材选择,明确自己的粉丝群体 |
|
Emerging |
| 499 |
shakirth-anisha/pesu-slide-download-automator
Automates the process of downloading all PESU Academy Slides + helps merge... |
|
Emerging |
| 500 |
oxylabs/oxylabs-ai-studio-js
Structured data gathering from any website using AI-powered scraper,... |
|
Emerging |