PhantomInsights/subreddit-analyzer
A comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.
Implements a three-stage pipeline (ETL, NLP, visualization) using the Pushshift API to fetch Reddit data with recursive pagination, processes comments through spaCy for tokenization and named entity extraction, then generates temporal distribution charts and word frequency analysis via matplotlib/seaborn. Supports multi-subreddit analysis with configurable data retrieval by either fixed volume or target date, and handles language-agnostic NLP through swappable spaCy models.
499 stars. No commits in the last 6 months.
Stars
499
Forks
38
Language
Python
License
MIT
Category
Last pushed
Apr 02, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/PhantomInsights/subreddit-analyzer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
cannlytics/cannlytics
🔥 Cannlytics = cannabis + analytics. Data pipelines, user interfaces, and the best statistics in...
kariemoorman/tiktok-analyzer
TikTok video scraping and multimodal content analysis tool.
rahulkumaran/merkalysis
A marketing tool that helps you to market your products using organic marketing. This tool can...
eaglewarrior/scrape_do_nlp
I have made a package which will extract google news and twitter tweets and do sentiment...
oluobiri/nba-hate-tracker
Which NBA player does r/NBA hate the most? Sentiment analysis of 1.57M Reddit comments from the...