News Category Classification NLP Tools

Projects for classifying news articles into predefined categories (business, sports, politics, entertainment, tech, etc.) using NLP and machine learning. Does NOT include sentiment analysis, fake news detection, topic modeling, or document classification for non-news domains.

There are 90 news category classification tools tracked. 1 score above 50 (established tier). The highest-rated is angelosalatino/cso-classifier at 64/100 with 95 stars and 243 monthly downloads.

Get all 90 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=news-category-classification&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 angelosalatino/cso-classifier

Python library that classifies content from scientific papers with the...

64
Established
2 newsgac/platform

Platform for machine learning experiments developed in the project NEWSGAC

39
Emerging
3 giuseppebonaccorso/Reuters-21578-Classification

Text classification with Reuters-21578 datasets using Gensim Word2Vec and Keras LSTM

38
Emerging
4 aqibsaeed/Research-Paper-Categorization

Research paper classification using machine learning and NLP

36
Emerging
5 tblock/10kGNAD

Ten Thousand German News Articles Dataset for Topic Classification

35
Emerging
6 timthedev07/news-articles-classification

Classification model for news articles with deep learning.

34
Emerging
7 mdh266/TextClassificationApp

Building and Deploying A Serverless Text Classification Web App

34
Emerging
8 NirantK/Hinglish

Hinglish Text Classification

33
Emerging
9 yassersouri/classify-text

"20 Newsgroups" text classification with python

33
Emerging
10 vectorsss/news_classification

卷积神经网络&&爬虫 实现网易新闻自动爬取并分类

32
Emerging
11 sndsabin/Nepali-News-Classifier

Text Classification of Nepali Language Document. This Mini Project was done...

32
Emerging
12 jdvala/lazytext

LazyText is inspired by the idea of lazypredict, a library which helps build...

32
Emerging
13 uhh-lt/BlurbGenreCollection-HMC

Hierarchical multi-label text classification of the BlurbGenreCollection...

31
Emerging
14 FantacherJOY/Arabic-text-classification

Arabic text documents classified using SVM, k-nn and Naive bayes classifers.

31
Emerging
15 elleros/text-classification-challenge

Supervised text classification

31
Emerging
16 yutkin/News-Aggregator

Classification and aggregation of russian news articles. University coursework.

30
Emerging
17 saidziani/Arabic-News-Article-Classification

Automatic categorization of documents, consists in assigning a category to a...

30
Emerging
18 gokriznastic/20-newsgroups_text-classification

"20 newsgroups" dataset - Text Classification using Multinomial Naive Bayes...

29
Experimental
19 tomeliud/Natural-Language-Processings-NLP-on-Swahili-News-Classification-

Develop a multi-class classification model to classify news content...

29
Experimental
20 aditya00kumar/document-classification

This project is an attempt to provide a generic pipeline for document...

27
Experimental
21 PetropoulakisPanagiotis/text-classification

Text Classification using scikit-learn. Classify BBC articles.

26
Experimental
22 urigoren/nlp_classification_workshop

NLP Classification Workshop

26
Experimental
23 searchisko/project-classifier-poc

Searchisko: A semantic search service over categorised content.

26
Experimental
24 Fresearch99/patent_classification_project

Python collection of programs to classify and identify patents that are...

25
Experimental
25 sayakpaul/Generating-categories-from-arXiv-paper-titles

This project takes the arXiv dataset and builds an automatic tag classifier...

25
Experimental
26 jarif87/movie-tv-genre-detector

Building a Multilabel Movie and TV Show Classifier with Blurr

25
Experimental
27 msi1427/MultiLabel-Book-Genre-Classifier

A multi-label text classifier that can classify 141 different book genres...

24
Experimental
28 cssturkiye/migration-tr

Migration-TR: Turkish migration discourse dataset with 6M tweets...

24
Experimental
29 Davisy/Swahili-News-Classifier-Streamlit-App

A simple Streamlit App to classify swahili news into different categories.

24
Experimental
30 sudharsan13296/Document-Classification-using-LSA

Document classification using Latent semantic analysis in python

24
Experimental
31 rootally/News-Category-Classification-with-BERT

Identify the type of news based on headlines and short descriptions

24
Experimental
32 deepklarity/poster2plot

Generate movie/t.v show plot from a poster

24
Experimental
33 yihong1120/Reuters-News-Classification-and-Analysis

Train a model to categorize news articles, scrape and translate articles,...

24
Experimental
34 saurabh1907/document-classification-ml-nlp

Document classification into four defined categories (World, Sports,...

24
Experimental
35 shashankvmaiya/Movie-Genre-Multi-Label-Text-Classification

Multi Label Text Classification of movie genres using its plot

24
Experimental
36 cawfree/reuters-dataset

🗞️ A tool for downloading and parsing Reuters-21578. These are a collection...

23
Experimental
37 Davisy/Swahili-News-Classifier-API

An API developed by using the FastAPI library and uses an NLP model to...

23
Experimental
38 yogeshnile/Movies-Genre-Classification

In this repo i have created a Movies Genre Classification project in machine...

23
Experimental
39 diem-ai/text-classification

The project aims to apply Naives Bayes on TF-IDF and Word2Vec Models .Use...

23
Experimental
40 ViralLab/TurkishBERTweet

TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis

22
Experimental
41 shibuiwilliam/aozora_classification

This project aims to classify Japanese sentence to how well similar to some...

22
Experimental
42 SpringerNLP/Chapter3

Chapter 3: Text and Speech Basics

22
Experimental
43 rcmckee/Document-Classification

Patent Classification with Machine Learning

22
Experimental
44 mohamedELBAHA/Text-classification-of-BBC-news-dataset

Application of NLP, word embedding, LSTM, PCA, TSNE.

21
Experimental
45 tomkeith/Multi-label-classification-with-NLP

Genre Genie - Movie Genre Predictor (multi-label classification with NLP)

21
Experimental
46 camoverride/language-classifier

Tutorial for building an end-to-end machine learning application! 😊

21
Experimental
47 dkhd/text-classification

Code examples and presentation slides from Machine Learning ID #2 Meetup - Yogyakarta

20
Experimental
48 xiangyi-njust/FWS

the code for this paper : Automatic Recognition and Classification of Future...

20
Experimental
49 Wonuabimbola/movie-genre-prediction

Multiclass Text Classification problem where I used movie plot summaries to...

20
Experimental
50 savasy/TurkishTextClassification

text classification for Turkish language

19
Experimental
51 shivam5992/classification_pipeline

:orange_book: A complete document classification pipeline using Apache Spark in scala

19
Experimental
52 Balakishan77/News-Articles-Classification

This dataset contains around 125k news headlines from the year 2013 to 2018...

19
Experimental
53 sarufi-io/Swahili-sentiment-Analysis-using-transformers

The special repository to demonstrate how you can use transformers for...

18
Experimental
54 Reslan-Tinawi/20-newsgroups-Text-Classification

Implementing text classification algorithms using the 20 newsgroups...

17
Experimental
55 0AlphaZero0/Text-Classification-Data-Citation-EuropePMC

Text classification on Citations in Scientific Papers

17
Experimental
56 sbera7/Dialogue-act-classification

Dialogue Act classification

16
Experimental
57 J-Mourad/MNAD

The dataset contains +1 Million Moroccan news articles that are classified...

16
Experimental
58 tknishh/Text-Classification-Ag-News

The objective of this task is to build a text classification model using the...

16
Experimental
59 alfagama/arxiv_predicting_paper_category

Predicting categories of scientific papers with advanced machine learning...

15
Experimental
60 G0rav/Automatic_News_Tagging

Automatic News Categories Segmentation using Machine Learning and NLP

15
Experimental
61 TomMakesThings/Movie-Genre-Predictor

Natural language processing (NLP) group project to create a multi-label...

15
Experimental
62 robolab-pavia/slrkit_NLP_Nocera

Classification of the NLP dataset - Nocera

15
Experimental
63 Alvant/20-newsgroups-secrets

Some hidden knowledge found in the 20 Newsgroups dataset

15
Experimental
64 hakancangunerli/turkish_binary_classifier

this is a logistic regression binary classification system for classifying...

14
Experimental
65 parsa-abbasi/Persian-News-Classifier

Implementing a news classifier on Persian news dataset

14
Experimental
66 fatihbarutcu1/turkish-text-classifier

Turkish tweet classifier with 90% accuracy — scikit-learn & Streamlit

14
Experimental
67 trinhdoduyhungss/classification_sentence_knn_simple

It's an example of a sentence classification project using KNN algorithm.

13
Experimental
68 avestura/PersianNews

📰 My final project for NLP course

13
Experimental
69 X-XENDROME-X/News-Classification-Transformer

Advanced news topic classification system using DistilBERT transformer...

12
Experimental
70 ulkudilemre/Turkish-text-author-classification

Author-text classification in Turkish texts with 80% accuracy in svm linear kernel

12
Experimental
71 ezosa/topic-labelling

Multilingual topic labelling of news topics

12
Experimental
72 apdullahyayik/Tr-topicter

🔍 A simple topic detector.

12
Experimental
73 AdelAdool/News-Category-Classifier

News Category Classification using AG News dataset. Implements text...

12
Experimental
74 SajjadHossain43/app-multi-label-game-genre-classification-imdb

This project focuses on classifying video games into multiple genres using...

11
Experimental
75 Ryohskay/peshitta_cj

Attempt to classify OT Peshitta (a [Classical] Syriac Translation of the Old...

11
Experimental
76 PaKicek/News-Classifier

A classifier which collects news via RSS and trains ML models to classify...

11
Experimental
77 sujik18/EACL-2026-Abjad-NLP

Files for Shared Task 4: Medical Text Classification in Arabic...

11
Experimental
78 Rahul-404/bbc-news-sorting

📰 BBC News Article Classifier: A project that categorizes BBC News articles...

11
Experimental
79 dilohn/movie-revenue-predictor

predict movie revenue based on movie summary and poster only

11
Experimental
80 BeaEsparcia/spanish-news-classification

News classification project using Natural Language Processing (NLP) and...

11
Experimental
81 KsLimon/Multilabel-imdb-movie-genre-classifier

A multi-label movie description classifier that can classify 21 different...

11
Experimental
82 manik000-zx/text_classifier_ai

AI-powered text classification system built with Python and Streamlit....

11
Experimental
83 youssefdlfk/movie-genre-classification

Multi-label text classification of movie genres based on their synopsis

11
Experimental
84 Simran0192004/Movie_Genre_Classification

A Machine Learning model that can predict the genre of a movie based on its...

11
Experimental
85 ahmedbendebba1/News-classification-webapp

Train and deploy a News Classifier using Logitic Regression - Serverless container

10
Experimental
86 FabriDeCastelli/HLT-News-Category

42 way classification of news

10
Experimental
87 TanmoyGG/Dhaka_Tribune-Scraping-and-Classification-XGBoost

An end-to-end R pipeline for scraping, processing, and classifying Dhaka...

10
Experimental
88 Arian-Rahman/Otaku-Tag

Multi-label category classifier for Anime and Mange

10
Experimental
89 StefanHeng/Personalized-Productivity-Dataset

Personalized Text Classification dataset with transient labels inspired by...

10
Experimental
90 mbfakourii/NLP-persian-category

NLP persian category with digimag and persian_news Dataset

10
Experimental