PhantomInsights/subreddit-analyzer

A comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.

41
/ 100
Emerging

Implements a three-stage pipeline (ETL, NLP, visualization) using the Pushshift API to fetch Reddit data with recursive pagination, processes comments through spaCy for tokenization and named entity extraction, then generates temporal distribution charts and word frequency analysis via matplotlib/seaborn. Supports multi-subreddit analysis with configurable data retrieval by either fixed volume or target date, and handles language-agnostic NLP through swappable spaCy models.

499 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

499

Forks

38

Language

Python

License

MIT

Last pushed

Apr 02, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/PhantomInsights/subreddit-analyzer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.