News Category Classification NLP Tools
Projects for classifying news articles into predefined categories (business, sports, politics, entertainment, tech, etc.) using NLP and machine learning. Does NOT include sentiment analysis, fake news detection, topic modeling, or document classification for non-news domains.
There are 90 news category classification tools tracked. 1 score above 50 (established tier). The highest-rated is angelosalatino/cso-classifier at 64/100 with 95 stars and 243 monthly downloads.
Get all 90 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=news-category-classification&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
angelosalatino/cso-classifier
Python library that classifies content from scientific papers with the... |
|
Established |
| 2 |
newsgac/platform
Platform for machine learning experiments developed in the project NEWSGAC |
|
Emerging |
| 3 |
giuseppebonaccorso/Reuters-21578-Classification
Text classification with Reuters-21578 datasets using Gensim Word2Vec and Keras LSTM |
|
Emerging |
| 4 |
aqibsaeed/Research-Paper-Categorization
Research paper classification using machine learning and NLP |
|
Emerging |
| 5 |
tblock/10kGNAD
Ten Thousand German News Articles Dataset for Topic Classification |
|
Emerging |
| 6 |
timthedev07/news-articles-classification
Classification model for news articles with deep learning. |
|
Emerging |
| 7 |
mdh266/TextClassificationApp
Building and Deploying A Serverless Text Classification Web App |
|
Emerging |
| 8 |
NirantK/Hinglish
Hinglish Text Classification |
|
Emerging |
| 9 |
yassersouri/classify-text
"20 Newsgroups" text classification with python |
|
Emerging |
| 10 |
vectorsss/news_classification
卷积神经网络&&爬虫 实现网易新闻自动爬取并分类 |
|
Emerging |
| 11 |
sndsabin/Nepali-News-Classifier
Text Classification of Nepali Language Document. This Mini Project was done... |
|
Emerging |
| 12 |
jdvala/lazytext
LazyText is inspired by the idea of lazypredict, a library which helps build... |
|
Emerging |
| 13 |
uhh-lt/BlurbGenreCollection-HMC
Hierarchical multi-label text classification of the BlurbGenreCollection... |
|
Emerging |
| 14 |
FantacherJOY/Arabic-text-classification
Arabic text documents classified using SVM, k-nn and Naive bayes classifers. |
|
Emerging |
| 15 |
elleros/text-classification-challenge
Supervised text classification |
|
Emerging |
| 16 |
yutkin/News-Aggregator
Classification and aggregation of russian news articles. University coursework. |
|
Emerging |
| 17 |
saidziani/Arabic-News-Article-Classification
Automatic categorization of documents, consists in assigning a category to a... |
|
Emerging |
| 18 |
gokriznastic/20-newsgroups_text-classification
"20 newsgroups" dataset - Text Classification using Multinomial Naive Bayes... |
|
Experimental |
| 19 |
tomeliud/Natural-Language-Processings-NLP-on-Swahili-News-Classification-
Develop a multi-class classification model to classify news content... |
|
Experimental |
| 20 |
aditya00kumar/document-classification
This project is an attempt to provide a generic pipeline for document... |
|
Experimental |
| 21 |
PetropoulakisPanagiotis/text-classification
Text Classification using scikit-learn. Classify BBC articles. |
|
Experimental |
| 22 |
urigoren/nlp_classification_workshop
NLP Classification Workshop |
|
Experimental |
| 23 |
searchisko/project-classifier-poc
Searchisko: A semantic search service over categorised content. |
|
Experimental |
| 24 |
Fresearch99/patent_classification_project
Python collection of programs to classify and identify patents that are... |
|
Experimental |
| 25 |
sayakpaul/Generating-categories-from-arXiv-paper-titles
This project takes the arXiv dataset and builds an automatic tag classifier... |
|
Experimental |
| 26 |
jarif87/movie-tv-genre-detector
Building a Multilabel Movie and TV Show Classifier with Blurr |
|
Experimental |
| 27 |
msi1427/MultiLabel-Book-Genre-Classifier
A multi-label text classifier that can classify 141 different book genres... |
|
Experimental |
| 28 |
cssturkiye/migration-tr
Migration-TR: Turkish migration discourse dataset with 6M tweets... |
|
Experimental |
| 29 |
Davisy/Swahili-News-Classifier-Streamlit-App
A simple Streamlit App to classify swahili news into different categories. |
|
Experimental |
| 30 |
sudharsan13296/Document-Classification-using-LSA
Document classification using Latent semantic analysis in python |
|
Experimental |
| 31 |
rootally/News-Category-Classification-with-BERT
Identify the type of news based on headlines and short descriptions |
|
Experimental |
| 32 |
deepklarity/poster2plot
Generate movie/t.v show plot from a poster |
|
Experimental |
| 33 |
yihong1120/Reuters-News-Classification-and-Analysis
Train a model to categorize news articles, scrape and translate articles,... |
|
Experimental |
| 34 |
saurabh1907/document-classification-ml-nlp
Document classification into four defined categories (World, Sports,... |
|
Experimental |
| 35 |
shashankvmaiya/Movie-Genre-Multi-Label-Text-Classification
Multi Label Text Classification of movie genres using its plot |
|
Experimental |
| 36 |
cawfree/reuters-dataset
🗞️ A tool for downloading and parsing Reuters-21578. These are a collection... |
|
Experimental |
| 37 |
Davisy/Swahili-News-Classifier-API
An API developed by using the FastAPI library and uses an NLP model to... |
|
Experimental |
| 38 |
yogeshnile/Movies-Genre-Classification
In this repo i have created a Movies Genre Classification project in machine... |
|
Experimental |
| 39 |
diem-ai/text-classification
The project aims to apply Naives Bayes on TF-IDF and Word2Vec Models .Use... |
|
Experimental |
| 40 |
ViralLab/TurkishBERTweet
TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis |
|
Experimental |
| 41 |
shibuiwilliam/aozora_classification
This project aims to classify Japanese sentence to how well similar to some... |
|
Experimental |
| 42 |
SpringerNLP/Chapter3
Chapter 3: Text and Speech Basics |
|
Experimental |
| 43 |
rcmckee/Document-Classification
Patent Classification with Machine Learning |
|
Experimental |
| 44 |
mohamedELBAHA/Text-classification-of-BBC-news-dataset
Application of NLP, word embedding, LSTM, PCA, TSNE. |
|
Experimental |
| 45 |
tomkeith/Multi-label-classification-with-NLP
Genre Genie - Movie Genre Predictor (multi-label classification with NLP) |
|
Experimental |
| 46 |
camoverride/language-classifier
Tutorial for building an end-to-end machine learning application! 😊 |
|
Experimental |
| 47 |
dkhd/text-classification
Code examples and presentation slides from Machine Learning ID #2 Meetup - Yogyakarta |
|
Experimental |
| 48 |
xiangyi-njust/FWS
the code for this paper : Automatic Recognition and Classification of Future... |
|
Experimental |
| 49 |
Wonuabimbola/movie-genre-prediction
Multiclass Text Classification problem where I used movie plot summaries to... |
|
Experimental |
| 50 |
savasy/TurkishTextClassification
text classification for Turkish language |
|
Experimental |
| 51 |
shivam5992/classification_pipeline
:orange_book: A complete document classification pipeline using Apache Spark in scala |
|
Experimental |
| 52 |
Balakishan77/News-Articles-Classification
This dataset contains around 125k news headlines from the year 2013 to 2018... |
|
Experimental |
| 53 |
sarufi-io/Swahili-sentiment-Analysis-using-transformers
The special repository to demonstrate how you can use transformers for... |
|
Experimental |
| 54 |
Reslan-Tinawi/20-newsgroups-Text-Classification
Implementing text classification algorithms using the 20 newsgroups... |
|
Experimental |
| 55 |
0AlphaZero0/Text-Classification-Data-Citation-EuropePMC
Text classification on Citations in Scientific Papers |
|
Experimental |
| 56 |
sbera7/Dialogue-act-classification
Dialogue Act classification |
|
Experimental |
| 57 |
J-Mourad/MNAD
The dataset contains +1 Million Moroccan news articles that are classified... |
|
Experimental |
| 58 |
tknishh/Text-Classification-Ag-News
The objective of this task is to build a text classification model using the... |
|
Experimental |
| 59 |
alfagama/arxiv_predicting_paper_category
Predicting categories of scientific papers with advanced machine learning... |
|
Experimental |
| 60 |
G0rav/Automatic_News_Tagging
Automatic News Categories Segmentation using Machine Learning and NLP |
|
Experimental |
| 61 |
TomMakesThings/Movie-Genre-Predictor
Natural language processing (NLP) group project to create a multi-label... |
|
Experimental |
| 62 |
robolab-pavia/slrkit_NLP_Nocera
Classification of the NLP dataset - Nocera |
|
Experimental |
| 63 |
Alvant/20-newsgroups-secrets
Some hidden knowledge found in the 20 Newsgroups dataset |
|
Experimental |
| 64 |
hakancangunerli/turkish_binary_classifier
this is a logistic regression binary classification system for classifying... |
|
Experimental |
| 65 |
parsa-abbasi/Persian-News-Classifier
Implementing a news classifier on Persian news dataset |
|
Experimental |
| 66 |
fatihbarutcu1/turkish-text-classifier
Turkish tweet classifier with 90% accuracy — scikit-learn & Streamlit |
|
Experimental |
| 67 |
trinhdoduyhungss/classification_sentence_knn_simple
It's an example of a sentence classification project using KNN algorithm. |
|
Experimental |
| 68 |
avestura/PersianNews
📰 My final project for NLP course |
|
Experimental |
| 69 |
X-XENDROME-X/News-Classification-Transformer
Advanced news topic classification system using DistilBERT transformer... |
|
Experimental |
| 70 |
ulkudilemre/Turkish-text-author-classification
Author-text classification in Turkish texts with 80% accuracy in svm linear kernel |
|
Experimental |
| 71 |
ezosa/topic-labelling
Multilingual topic labelling of news topics |
|
Experimental |
| 72 |
apdullahyayik/Tr-topicter
🔍 A simple topic detector. |
|
Experimental |
| 73 |
AdelAdool/News-Category-Classifier
News Category Classification using AG News dataset. Implements text... |
|
Experimental |
| 74 |
SajjadHossain43/app-multi-label-game-genre-classification-imdb
This project focuses on classifying video games into multiple genres using... |
|
Experimental |
| 75 |
Ryohskay/peshitta_cj
Attempt to classify OT Peshitta (a [Classical] Syriac Translation of the Old... |
|
Experimental |
| 76 |
PaKicek/News-Classifier
A classifier which collects news via RSS and trains ML models to classify... |
|
Experimental |
| 77 |
sujik18/EACL-2026-Abjad-NLP
Files for Shared Task 4: Medical Text Classification in Arabic... |
|
Experimental |
| 78 |
Rahul-404/bbc-news-sorting
📰 BBC News Article Classifier: A project that categorizes BBC News articles... |
|
Experimental |
| 79 |
dilohn/movie-revenue-predictor
predict movie revenue based on movie summary and poster only |
|
Experimental |
| 80 |
BeaEsparcia/spanish-news-classification
News classification project using Natural Language Processing (NLP) and... |
|
Experimental |
| 81 |
KsLimon/Multilabel-imdb-movie-genre-classifier
A multi-label movie description classifier that can classify 21 different... |
|
Experimental |
| 82 |
manik000-zx/text_classifier_ai
AI-powered text classification system built with Python and Streamlit.... |
|
Experimental |
| 83 |
youssefdlfk/movie-genre-classification
Multi-label text classification of movie genres based on their synopsis |
|
Experimental |
| 84 |
Simran0192004/Movie_Genre_Classification
A Machine Learning model that can predict the genre of a movie based on its... |
|
Experimental |
| 85 |
ahmedbendebba1/News-classification-webapp
Train and deploy a News Classifier using Logitic Regression - Serverless container |
|
Experimental |
| 86 |
FabriDeCastelli/HLT-News-Category
42 way classification of news |
|
Experimental |
| 87 |
TanmoyGG/Dhaka_Tribune-Scraping-and-Classification-XGBoost
An end-to-end R pipeline for scraping, processing, and classifying Dhaka... |
|
Experimental |
| 88 |
Arian-Rahman/Otaku-Tag
Multi-label category classifier for Anime and Mange |
|
Experimental |
| 89 |
StefanHeng/Personalized-Productivity-Dataset
Personalized Text Classification dataset with transient labels inspired by... |
|
Experimental |
| 90 |
mbfakourii/NLP-persian-category
NLP persian category with digimag and persian_news Dataset |
|
Experimental |