Traditional ML Text Classification NLP Tools

Tools implementing classical machine learning approaches (scikit-learn, naive bayes, SVM, TF-IDF vectorization) for text classification tasks. Does NOT include deep learning methods, neural networks, or transformer-based models like BERT.

There are 44 traditional ml text classification tools tracked. The highest-rated is giacbrd/ShallowLearn at 48/100 with 198 stars and 12 monthly downloads.

Get all 44 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=traditional-ml-text-classification&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 giacbrd/ShallowLearn

An experiment about re-implementing supervised learning models based on...

48
Emerging
2 Wluper/edm

Python package for understanding the difficulty of text classification...

46
Emerging
3 javedsha/text-classification

Machine Learning and NLP: Text Classification using python, scikit-learn and NLTK

44
Emerging
4 fendouai/Awesome-Text-Classification

Awesome-Text-Classification Projects,Papers,Tutorial .

39
Emerging
5 chicago-justice-project/article-tagging

Natural Language Processing of Chicago news articles

39
Emerging
6 Pritishsingla/nbsvm_sklearn

Scikit-learn style wrapper for the NBSVM algorithm for text classification

35
Emerging
7 opennlp/Large-Scale-Text-Classification

Large Scale benchmarking of state of the art text vectorizers

35
Emerging
8 eannefawcett/lexile-determination-v2

NLP to classify a text's lexile level

34
Emerging
9 FelixHo/Text-Classification-Benchmark

文本分类基准测试

33
Emerging
10 scotthlee/document-classification

Simple command-line scripts for document classification

31
Emerging
11 amankedia/Question-Classification

Classifying questions from UIUC's CogComp QC Dataset

30
Emerging
12 akshaybhatia10/Book-Genre-Classification

Classification of books based on titles without prior knowledge of context or author

29
Experimental
13 uosdmlab/playdata-zeppelin-notebook

Zeppelin 화재 뉴스 기사 분류 예제

28
Experimental
14 lpq29743/text_classification

Text classification models (including traditional machine learning methods...

28
Experimental
15 cahya-wirawan/text-classification

Text Classification engine using several algorithms in machine learning

28
Experimental
16 cair/TextUnderstandingTsetlinMachine

Using the Tsetlin Machine to learn human-interpretable rules for...

27
Experimental
17 p-karisani/self_pretraining

A classification model

26
Experimental
18 Authentic10/daxmod

A Python toolbox for text classification

23
Experimental
19 dwulff/text2sdg

Detect UN Sustainable Development Goals in Text

23
Experimental
20 UNDP-Data/dsc-sdgi-corpus

Model benchmarks on SDGi Corpus, a multilingual dataset for text...

22
Experimental
21 muggin/string-kernels

Implementation of the "Text Classification using String Kernels" publication...

21
Experimental
22 kushagra2103/Auto-Tagging-System

The project is based on a multi-label classification problem in NLP.

21
Experimental
23 madhurimamandal/Text-classification-into-difficulty-levels

Text classification into various reading difficulty levels using various...

21
Experimental
24 hlbao/classification_in_CSS

The tutorial on scraping, processing, and classification of text-based...

20
Experimental
25 swapkh91/Question-Classification

Classify questions as - what, when, who, affirmation, unknown

18
Experimental
26 andreaschandra/feature-selection-papers

A list of paper feature selection for text classification

17
Experimental
27 ARUNAGIRINATHAN-K/Text-Classifier-using-NLP_Techniques

A Text Classification using NLP Techniques.

16
Experimental
28 maty-bohacek/xgboost-vs-gpt4

Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text...

16
Experimental
29 tim5go/awesome-one-class-classification

A curated list of awesome resources dedicated to One Class Classification.

16
Experimental
30 RenzeLou/Text-to-Text-Classification

Investigating the NLP classification tasks, text-to-text vs. text-to-indices.

16
Experimental
31 MJahangeerQureshi/Text-Classification

A collection text classification methodologies

16
Experimental
32 ZhengZixiang/TCPapers

Worth-reading papers and related resources on text classification....

15
Experimental
33 Qinzhizhou/NLP-Classfication

Two NLP assignments for sentiment and topic classfication

14
Experimental
34 nadeeshafdo/text-classifier

A simple text classifier for classifying ideas into categories.

14
Experimental
35 RussianNLP/TAPE

TAPE benchmark

13
Experimental
36 atom-zh/SA_Classification

Multi Label Classification of short Texts of Science Association 机器学习 神经网络...

12
Experimental
37 YipingNUS/contextual-eval-dataset

Evaluation Dataset for "Bootstrapping Large-Scale Fine-Grained Contextual...

12
Experimental
38 Mariam-Badr-MB/Book-s-genre

Discover hidden micro-genres in books using unsupervised machine learning....

12
Experimental
39 davidetaraborrelli/textkd-p1-clean-prep

A very simple baseline for text preprocessing + linear classification with...

11
Experimental
40 TextXD/introduction-to-text-classification

Introduction to text classification

11
Experimental
41 stepthom/autotext

Run AutoML tools on a dataframe with text

11
Experimental
42 SupreethRao99/eXtreme-Classification

eXtreme Classification, classifying gigabytes of natural language text in minutes

10
Experimental
43 unlbslk/python-easy-text-classification

An easy machine learning Python text classification script with scikit-learn

10
Experimental
44 mohammad-oghli/Text-Classification

Classify text to different categories using trained ML model

10
Experimental