WLXie-Tony/Movie_Review_Analysis
Official replication package for IJFE (2026). Asynchronous ETL pipeline using GPT-4o to quantify investor distraction shocks from unstructured movie reviews.
The pipeline implements strict Pydantic schema validation and asynchronous batch processing with semaphores to achieve 20x speedup over sequential LLM calls, while state-aware scraping with exponential backoff retry logic processes 247K+ IMDb reviews idempotently. It integrates OpenAI's GPT-4o API for deterministic sentiment scoring, couples configuration management via YAML to decouple parameters, and targets financial economics research requiring production-grade reproducibility and auditability through centralized logging and defensive type-hinting throughout.
154 stars.
Stars
154
Forks
—
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Feb 16, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/WLXie-Tony/Movie_Review_Analysis"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
BeiCunNan/Sentiment_Analysis_Imdb
Using Bert/Roberta + LSTM/GRU/BiLSTM/TextCNN to do the sentiment analysis on the imdb datasets.
prakharchoudhary/SentimentAnalysis
A Sentimental Analysis model trained on IMDb dataset, using Bag of words model(tokenisation)
SkyThonk/Movie-Reviews-Sentiment-Analysis
Sentiment Analysis of Movie Reviews is either positive or negative review, the dataset which is...
farisology/SentimentAnalysis
Sentiment Analysis model using Linear SVM and collection of Tweets about Star Wars Rogue One Movie
ThinamXx/NeuralNetworks__SentimentAnalysis
In this repository, I have worked out on the Large Movie Review Dataset for the Sentiment...