hrwhisper/SpamMessage

中文垃圾短信识别(手写分类器)

40
/ 100
Emerging

Implements multiple classification algorithms (Perceptron, Logistic Regression, Naive Bayes, SVM) with both custom implementations and scikit-learn wrappers, using jieba for Chinese tokenization and bag-of-words feature representation. The pipeline includes separate training (cross-validation in test.py) and inference phases, with trained models serialized for reuse. Accepts unlabeled SMS files via command-line interface and outputs binary spam/non-spam predictions.

201 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 22 / 25

How are scores calculated?

Stars

201

Forks

61

Language

Python

License

Last pushed

Dec 08, 2016

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/hrwhisper/SpamMessage"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.