halpert3/complaint-content-classification-nlp
Natural Language Processing classification project with machine learning models developed to classify consumer complaints.
Leverages lemmatization and TF-IDF vectorization to process 162,400 CFPB consumer complaint narratives, consolidating nine financial product classes into five balanced categories. Implements Multinomial Naive Bayes and Gradient Boosting models achieving 86% macro recall, with an API integration layer enabling real-time classification of up-to-date complaint data. Addresses class imbalance through parameter tuning rather than SMOTE, prioritizing recall to minimize false negatives in complaint routing workflows.
No commits in the last 6 months.
Stars
18
Forks
8
Language
Jupyter Notebook
License
—
Category
Last pushed
May 18, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/halpert3/complaint-content-classification-nlp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.