Semantic Similarity Measurement Embedding Tools

Tools and benchmarks for measuring semantic similarity between text units (words, sentences, documents) using embeddings and NLP methods. Includes evaluation datasets, comparison studies, and similarity calculation implementations. Does NOT include clustering applications, search systems, or downstream tasks like recommendation or matching.

There are 25 semantic similarity measurement tools tracked. The highest-rated is Garrafao/LSCDetection at 35/100 with 31 stars.

Get all 25 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=semantic-similarity-measurement&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 Garrafao/LSCDetection

Data Sets and Models for Evaluation of Lexical Semantic Change Detection

35
Emerging
2 RepoAnalysis/RepoSim

This repository contains experiments on comparing the similarity of Python...

25
Experimental
3 cod3licious/simec

Similarity Encoder (SimEc) Neural Network Framework for learning low...

24
Experimental
4 kunal4040/hybrid-search-eval

🔍 Benchmark embedding models in hybrid search with Weaviate. Evaluate MRR@K,...

24
Experimental
5 cr1m5onk1ng/text_similarity

A nlp library for text similarity based on Transformer models

22
Experimental
6 taskswithcode/sentence_similarity_app

App to compare state-of-the-art models for sentence similarity task

22
Experimental
7 arclabs561/decksage

Card similarity and deck operations for trading card games (Magic, Pokemon, Yu-Gi-Oh)

22
Experimental
8 MatthewPaver/sentence-similarity-analysis

Semantic sentence similarity demonstration using transformer-based embedding...

22
Experimental
9 jorge-martinez-gil/uwsd

Context-Aware Semantic Similarity Measurement for Unsupervised Word Sense...

22
Experimental
10 pabvald/semantic-similarity

Comparison of methods based on pre-trained Word2Vec, GloVe and FastText...

21
Experimental
11 Reilly-ConceptsCognitionLab/SemanticDistance

Computes Pairwise Semantic Distance Between Tokens (ngrams, words, turns) in...

21
Experimental
12 ymoslem/Sentence-Similarity

Sentence Similarity Approaches

19
Experimental
13 albertrial/SemEval-2012-task-6

Semantic Textual Similarity: task which consists in evaluating the degree of...

17
Experimental
14 paulbricman/semantica

Extending conceptual thinking with semantic embeddings.

16
Experimental
15 Juancinho/similitud-palabras

Es una implementaciĂłn en python para visualizar la idea de los embeddings de...

15
Experimental
16 bloomberg/semantic-similarity-covariance-shrinkage

Code release for Semantic Similarity Covariance Matrix Shrinkage

12
Experimental
17 colindeseroux/semantop

đź§ Semantop is base of word2vec to create a french semantics game

12
Experimental
18 anebz/eu-sim

Exploring semantic similarities between contextualized embeddings

12
Experimental
19 manasRK/semantica

All you need for text preprocessing for NLP

11
Experimental
20 asteriscuz/semantic-similarity-calculator

Semantic similarity calculator using sentence embeddings

11
Experimental
21 xelandr3/jabruuuhtix

Real-time multiplayer word game inspired by Cémantix, based on semantic...

11
Experimental
22 joaquimgomez/BachelorsThesis-TextSimilarityMeasures

Code and models used in my Bachelor’s Degree Thesis about large text...

11
Experimental
23 LironOhana/sentence-similarity-embeddings

M.Sc. assignment — Sentence similarity with embeddings (STS Benchmark;...

11
Experimental
24 cwc09262/psalms-nlp-research

This is a research specific repository based on a foundation built from the...

11
Experimental
25 natylaza89/semantic-similarity-llm-dating-app

Semantic Similarity LLM Dating App using Python 3.12, FastAPI, WebSockets,...

10
Experimental