jingpeicomp/product-category-predict

商品类目预测,使用 Spring Boot 开发框架和 Spark MLlib 机器学习框架,通过 TF-IDF 和 Bayes 算法,训练出一个商品类目预测模型。该模型可以根据商品名称自动预测出商品类目。项目对外提供 RESTFul 接口。

34
/ 100
Emerging

Implements hierarchical three-level product categorization across 962 categories using jieba Chinese tokenization augmented with a 300K-term domain dictionary, then applies TF-IDF weighting and HashingTF dimensionality reduction before Naive Bayes classification. Integrates with Spark 2.2.1 for distributed model training on large datasets (12M+ products trained in ~10 minutes on 8-core/8GB resources) and provides batch prediction via RESTful endpoints, achieving 82% accuracy on held-out test data using only product titles as input features.

157 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 1 / 25
Community 23 / 25

How are scores calculated?

Stars

157

Forks

70

Language

Java

License

Last pushed

Jun 07, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/jingpeicomp/product-category-predict"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.