dselivanov/text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

55
/ 100
Established

Implements core NLP tasks through C++ with OpenMP parallelization, enabling near-linear multicore scaling and stream-based processing to handle data larger than RAM. Provides unified APIs across vectorization, topic modeling (LSA/LDA), distance metrics, and GloVe embeddings. Supports fork-based parallel backends on Unix systems for embarrassingly parallel operations like document-term matrix construction.

870 stars.

No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

870

Forks

134

Language

R

License

Last pushed

Dec 01, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dselivanov/text2vec"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.