davidetaraborrelli/textkd-p1-clean-prep
A very simple baseline for text preprocessing + linear classification with Python and scikit-learn. It compares raw vs cleaned (lemmatized) text using TF-IDF + Logistic Regression, then saves metrics and a confusion matrix for quick inspection.
No commits in the last 6 months.
Stars
—
Forks
—
Language
Python
License
—
Category
Last pushed
Aug 27, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/davidetaraborrelli/textkd-p1-clean-prep"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
giacbrd/ShallowLearn
An experiment about re-implementing supervised learning models based on shallow neural network...
Wluper/edm
Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)
javedsha/text-classification
Machine Learning and NLP: Text Classification using python, scikit-learn and NLTK
fendouai/Awesome-Text-Classification
Awesome-Text-Classification Projects,Papers,Tutorial .
chicago-justice-project/article-tagging
Natural Language Processing of Chicago news articles