Word Embedding Vectors ML Frameworks

Pre-trained and trained word vector models (word2vec, fastText, etc.) and tools for generating, visualizing, or searching with word embeddings. Does NOT include general NLP frameworks, sentence embeddings, or document retrieval systems without word-level focus.

There are 35 word embedding vectors frameworks tracked. 2 score above 50 (established tier). The highest-rated is kermitt2/delft at 69/100 with 415 stars and 1,230 monthly downloads.

Get all 35 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=word-embedding-vectors&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 kermitt2/delft

a Deep Learning Framework for Text https://delft.readthedocs.io/

69
Established
2 yoeo/guesslang

Detect the programming language of a source code

50
Established
3 matthewdeanmartin/whats_that_code

detect programming language of source in pure python from an ensemble of classifiers

48
Emerging
4 airalcorn2/Deep-Semantic-Similarity-Model

My Keras implementation of the Deep Semantic Similarity Model...

44
Emerging
5 christiansafka/img2vec

:fire: Use pre-trained models in PyTorch to extract vector embeddings for any image

43
Emerging
6 microsoft/NeuronBlocks

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

41
Emerging
7 marian-nmt/sotastream

A library for data streaming and augmentation

41
Emerging
8 inejc/paragraph-vectors

:page_facing_up: A PyTorch implementation of Paragraph Vectors (doc2vec).

41
Emerging
9 oalieno/asm2vec-pytorch

Unofficial implementation of asm2vec using pytorch ( with GPU acceleration )

37
Emerging
10 carlomarxdk/life2vec-light

Basic implementation of the life2vec model with the dummy data.

35
Emerging
11 m-a-n-i-f-e-s-t/retention

Language modeling with linear-cost context

35
Emerging
12 Tixierae/deep_learning_NLP

Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP

35
Emerging
13 ArslanJajja1/bengio-nplm-pytorch

A from-scratch PyTorch implementation and tutorial of the landmark 2003...

32
Emerging
14 Guillem96/data2vec-vision

PyTorch implementation of Data2Vec self-supervised approach for vision use cases.

30
Emerging
15 greenelab/word-lapse

Explore how a word changes over time

28
Experimental
16 nacayu/CRFNet_Tensorflow2.4.1

Reproduce CRFNet official implementation on windows10, tensorflow2.4.1

28
Experimental
17 ashutosh1919/data2vec-pytorch

Ready to run PyTorch implementation of Data2Vec 2.0: Highly efficient...

28
Experimental
18 remykarem/word2vec-demo

Word2Vec demo on the browser

24
Experimental
19 queelius/infinigram

High-speed corpus-based language model using suffix arrays for...

24
Experimental
20 SisonkeBiotik-Africa/MeSH2Matrix

A set of Python codes for the classification of biomedical relations based...

23
Experimental
21 kalnee/trivor-nlp

trivor-nlp leverages the use of NPL (Natural Language Processing) to detect...

23
Experimental
22 czcorpus/cqlizer

Predicting time-consuming CQL queries in language corpora

20
Experimental
23 dpernes/spamhmm

Sparse Mixture of Hidden Markov Models for Graph Connected Entities

18
Experimental
24 euskadi31/go-ngram

an n-gram is a contiguous sequence of n items from a given sequence of text...

17
Experimental
25 Vidhi1290/Word2Vec-and-FastText-Word-Embedding-with-Gensim-in-Python

This project explores the realm of Natural Language Processing (NLP) using...

16
Experimental
26 eliaskempf/ideal_words

A PyTorch implementation of ideal word computation.

15
Experimental
27 yashlad27/neural-network-language-model

Neural Network Language Model & Optimization Engine - Deep learning...

15
Experimental
28 Sikukhayatinamuna/ml-ccg

🤖 Enhance machine learning with CCG for improved performance and simplified...

14
Experimental
29 knowledge-express/skipgram

For all your n-gram and skip-gram needs 🔠

13
Experimental
30 estamos/word2vec-thesis

🎓 Diploma Thesis | A Word2vec comparative study of CBOW and Skipgram

13
Experimental
31 PritK99/POS-Tagging

Parts-of-Speech Tagging using Hidden Markov Model and Viterbi Algorithm

13
Experimental
32 ryderwishart/ancient-greek-word2vec

Global vector modelling notebooks for Ancient Greek

13
Experimental
33 jinglescode/phrases-extraction-wordcloud

Extracting n-grams from text and display in beautiful D3 word cloud.

13
Experimental
34 amirabbasasadi/persian-wordvectors

An Interactive Visualization of Persian Word Vectors Trained on Wikipedia

12
Experimental
35 pmav/deep-learning-subversive-spell-checker

A subversive spell checker to generate errors.

12
Experimental