Embedding/Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

51
/ 100
Established

Provides both dense (SGNS) and sparse (PPMI) embedding representations across diverse corpora (encyclopedias, news, social media, classical literature) with flexible context features (word, n-gram, character combinations) to capture different morphological and semantic properties. Includes the CA8 Chinese analogical reasoning benchmark and evaluation toolkit for intrinsic quality assessment, enabling users to benchmark vector performance on downstream NLP tasks.

12,188 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

12,188

Forks

2,325

Language

Python

License

Apache-2.0

Last pushed

Oct 30, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/Embedding/Chinese-Word-Vectors"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.