Semantic Textual Similarity Transformer Models

Tools for measuring and comparing semantic similarity between text passages using transformer embeddings and contextual analysis. Does NOT include general embedding extraction, text classification, or semantic communication without explicit similarity scoring.

There are 31 semantic textual similarity models tracked. The highest-rated is DebarshiChanda/Amazon-ML-Challenge2021 at 30/100 with 91 stars.

Get all 31 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=semantic-textual-similarity&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 DebarshiChanda/Amazon-ML-Challenge2021

Scripts and Approach for Amazon ML Challenge

30
Emerging
2 shahrukhx01/siamese-nn-semantic-text-similarity

A repository containing comprehensive Neural Networks based PyTorch...

29
Experimental
3 HasanBGit/KSAA2026-Fine-Tashkeel

Official code for "Fine-Tashkeel at KSAA-2026" — Systematic evaluation of 18...

29
Experimental
4 fork123aniket/Contrastive-Learning-for-Sentence-Embeddings

Implementation of Simple Contrastive Learning-based Unsupervised approach to...

25
Experimental
5 toriving/haafor-challenge-2020

The project for HAAFOR CHALLENGE 2020

24
Experimental
6 DoctorLai/SimilarString

Compute the score of similarity between two strings

24
Experimental
7 quyethd95/HuggingFace-BAAI--BGERerankerv2m3

🔍 Explore BGE Reranker v2 m3 for effective sequence reranking using Hugging...

22
Experimental
8 AniketRajpoot/Automated-Headline-and-Sentiment-Generator

A very simple repo for Text Classification, Sentiment Identification and...

22
Experimental
9 TioAbiyyu/HuggingFace-BAAI--BGERerankerv2m3

BGE Reranker v2 m3 demo with Hugging Face transformers for local and Azure cloud use.

22
Experimental
10 nuekkis/Turk-NLP

Türkçe için kapsamlı açık kaynak NLP (Doğal Dil İşleme) kütüphanesi.

18
Experimental
11 VinniLP/Document-Similarity-Finding-using-BERT

Document-Similarity-Finding-using-BERT

18
Experimental
12 sambitbhaumik/siamese-nn-sts

Project files contain PyTorch implementations for Siamese BiLSTM models for...

17
Experimental
13 ensarakbas77/LIFT-UP-Project-Similarity-Analysis

A system that compares newly submitted projects with previously completed...

16
Experimental
14 fblgit/model-similarity

Simple Model Similarities Analysis

15
Experimental
15 AnuragP004/amazon_ml_challenge_2025

Multimodal deep learning for product price prediction using DistilBERT and...

15
Experimental
16 MohamedNassih/Evaluation-Pertinence-Juridique-ML

Évaluation de la pertinence (question ↔ article juridique) en français....

15
Experimental
17 Sark-07/Copysafe

This is a document's contextual similarity evaluation tool. It captures the...

14
Experimental
18 nopgae/nlp-text-embedding-comparison

From N-grams to CLIP: comparing NLP embedding techniques including Word2Vec,...

14
Experimental
19 snairaadarsh/pdf-semantic-comparison

PDF comparison tool that uses transformer-based embeddings to identify...

14
Experimental
20 nfragakis/NLP-Knowledge-Extraction

Minimalist adaptation of more in-depth work concerning NLP knowledge...

14
Experimental
21 navneet-raghav/sentence-embedding-behavior-analysis

A controlled empirical study comparing lexical, static, and contextual...

13
Experimental
22 LazaUK/HuggingFace-BAAI-BGERerankerv2m3

BGE Reranker v2 m3 demo with Hugging Face transformers for local and Azure cloud use.

12
Experimental
23 philipphager/baidu-bert-model

SIGIR 2024 - Train flax-based MonoBERT rankers on Baidu-ULTR

12
Experimental
24 TharinduDR/STS-Transformers

Transformer based Semantic Textual Similarity

11
Experimental
25 pmadruga/ds-jobindex

Machine learning techniques (NLP) applied to the jobindex.dk dataset

11
Experimental
26 Vishnusai17/NLP

Natural Language Processing projects implementing Transformers, BERT, and...

11
Experimental
27 jklsnt/dictembed

A model!

11
Experimental
28 albertopirillo/nlp-project-2023

Predicting the similarity between the pairs of sentences in the STS dataset....

10
Experimental
29 ir2718/similarity-embedding-quality

[EMNLP 2024 Findings] Are ELECTRA's Sentence Embeddings Beyond Repair? The...

10
Experimental
30 najafmurtaza/General_Sentence_Embeddings

Extract Sentence Embeddings from Hugging Face pre-trained models.

10
Experimental
31 R-aryan/Text-Similarity-Using-BERT

End to End NLP text similarity project, using kaggle Quora Dataset , served...

10
Experimental