Word Embeddings Training NLP Tools

Tools and data for training, evaluating, and applying word embedding models (word2vec, fastText, GloVe). Includes trained models, training corpora, and statistical analysis of embeddings. Does NOT include downstream NLP tasks like NER or general preprocessing pipelines that use embeddings as components.

There are 6 word embeddings training tools tracked. The highest-rated is veldhub/veld_code__word2vec at 15/100 with 0 stars.

Get all 6 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=word-embeddings-training&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 veldhub/veld_code__word2vec

Code velds encapsulating training and usage of word2vec models.

15
Experimental
2 veldhub/veld_chain__train_infer_wordembeddings_multiple_architectures__amc

Chain velds encapsulating training and evaluating static word embedding...

15
Experimental
3 veldhub/veld_chain__demo_nlp_generic_preprocessing

Demo of encapsulation of several commonly used NLP preprocessing workflows

15
Experimental
4 veldhub/veld_chain__demo_wordembeddings_multiarch

A VELD demonstration, aggregating heterogeneous modular workflows into a...

11
Experimental
5 veldhub/veld_code__glove

Code velds encapsulating training and usage of GloVe models.

11
Experimental
6 veldhub/veld_code__fasttext

Code velds encapsulating training and usage of fastText models.

11
Experimental