Hironsan/awesome-embedding-models
A curated list of awesome embedding models tutorials, projects and communities.
Organizes foundational embedding research across word-level (Word2Vec, GloVe, FastText), contextual (ELMo, BERT), and sentence/document models, with curated links to landmark papers, evaluation benchmarks, and pre-trained model repositories. Covers the evolution from count-based to neural prediction methods, including critical evaluation frameworks that distinguish intrinsic versus extrinsic performance metrics. Aggregates training datasets (Wikipedia), benchmark suites (SemEval, WordSimilarity-353), and downloadable model implementations across TensorFlow Hub and AllenNLP.
1,827 stars. No commits in the last 6 months.
Stars
1,827
Forks
249
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Apr 07, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/Hironsan/awesome-embedding-models"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
yannvgn/laserembeddings
LASER multilingual sentence embeddings as a pip package
harmonydata/harmony
The Harmony Python library: a research tool for psychologists to harmonise data and...
embeddings-benchmark/results
Data for the MTEB leaderboard
fresh-stack/freshstack
This repository helps you evaluate your models on the FreshStack benchmark!