TanmoyGG/Dhaka_Tribune-Scraping-and-Classification-XGBoost
An end-to-end R pipeline for scraping, processing, and classifying Dhaka Tribune news articles. Achieves 94% accuracy using Tidymodels and XGBoost, featuring automated text summarization and smart feature selection.
Stars
3
Forks
—
Language
R
License
—
Category
Last pushed
Jan 04, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/TanmoyGG/Dhaka_Tribune-Scraping-and-Classification-XGBoost"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
angelosalatino/cso-classifier
Python library that classifies content from scientific papers with the topics of the Computer...
giuseppebonaccorso/Reuters-21578-Classification
Text classification with Reuters-21578 datasets using Gensim Word2Vec and Keras LSTM
tblock/10kGNAD
Ten Thousand German News Articles Dataset for Topic Classification
NirantK/Hinglish
Hinglish Text Classification
yassersouri/classify-text
"20 Newsgroups" text classification with python