Text Similarity Matching NLP Tools

Tools for computing similarity scores between text passages, phrases, or documents using embeddings, distance metrics, or statistical methods. Does NOT include semantic search engines, plagiarism detection systems, or applications that use similarity as a component of larger NLP tasks.

There are 49 text similarity matching tools tracked. 1 score above 50 (established tier). The highest-rated is shibing624/similarity at 54/100 with 1,571 stars.

Get all 49 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=text-similarity-matching&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 shibing624/similarity

similarity: Text similarity calculation Toolkit for Java....

54
Established
2 eBay/Sequence-Semantic-Embedding

Tools and recipes to train deep learning models and build services for NLP...

43
Emerging
3 RandolphVI/Text-Pairs-Relation-Classification

About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based...

41
Emerging
4 piotrmaciejbednarski/text-similarity-node

High-performance and memory efficient native C++ text similarity algorithms...

39
Emerging
5 MartinoMensio/spacy-universal-sentence-encoder

Google USE (Universal Sentence Encoder) for spaCy

39
Emerging
6 brianrisk/simphile-text-similarity-nlp

Python Text Similarity NLP Libray

36
Emerging
7 brunoarine/findlike

Command-line tool that finds lexically similar documents in relation to a...

36
Emerging
8 nullnull/simstring

A Python implementation of the SimString, a simple and efficient algorithm...

35
Emerging
9 likejazz/Siamese-LSTM

Siamese LSTM for evaluating semantic similarity between sentences of the...

34
Emerging
10 jinglescode/textual-similarity-universal-sentence-encoder

Extract embeddings and group sentences with universal sentence encoder...

34
Emerging
11 zqhZY/semanaly

semantic analysis using word2vector, doc2vector,lstm and other method....

33
Emerging
12 Neurotech-HQ/pysimilar

A python library for computing the similarity between two string(text) based...

32
Emerging
13 vishnumani2009/siamese-text-similarity

Tensorflow based implementation of deep siamese LSTM network for sentence...

31
Emerging
14 google/unisim

UniSim is a package for efficient similarity computation, fuzzy matching,...

29
Experimental
15 ocramz/ncd-tree

text similarity search trees based on Normalized Compression Distance

27
Experimental
16 adrianmfi/git-semantic-similarity

Search git commit messages by semantic similarity with embeddings from...

26
Experimental
17 eu90h/semantic-dictionary

A Python dictionary that uses semantic similarity for key matching instead...

26
Experimental
18 amansrivastava17/bns-short-text-similarity

📖 Use Bi-normal Separation to find document vectors which is used to...

26
Experimental
19 rameshjes/Semantic-Textual-Similarity

Natural Language Processing using NLTK and Spacy

26
Experimental
20 qiangzi11hao/Semantic-Textual-Similarity

语义相似度

25
Experimental
21 kenneth-lange/java-nlp-text-similarity

Measure the similarity between different text documents.

25
Experimental
22 caiyinqiong/Semantic-Retrieval-Models

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted:...

25
Experimental
23 victormurcia/PyUMLS_Similarity

This package computes a variety of similarity metrics between concepts...

24
Experimental
24 PedroMTQ/UniFunc

Tool for similarity analysis of protein function annotations.

24
Experimental
25 ankushbhatia2/Document_similarity_research_notebook

Jupyter notebook for my research in Document Similarity.

24
Experimental
26 vukbatanovic/STSFineGrain

A collection of Semantic Textual Similarity (STS) models and a framework for...

23
Experimental
27 Aghabeiki/sentences2vector

Get Similarity of two sentences based gensim word2vec trained model.

23
Experimental
28 goliasz/pio-template-text-similarity

Text similarity based on Word2Vec vectors.

22
Experimental
29 khalidbelk/jaccard

🧬 Calculate the similarity index between two texts

22
Experimental
30 Aditya1001001/similarity-and-embedding-app

Learn about text similarity measures & text embedding methods.

22
Experimental
31 mforstenhaeusler/Siamese-LSTM-for-Semantic-Similarity-PyTorch

This repositpory entails an implementation of a Deep Learning Pipeline that...

19
Experimental
32 chigwell/compario

A new package that uses large language models and pattern matching to...

19
Experimental
33 sauravjoshi23/SiameseNet-Loss

Text Classification Using Siamese Neural Networks - Contrastive Loss,...

18
Experimental
34 nurhaankhaled/SentenceToSentenceSemanticSimilarity

First rank winner at the Natural Language Processing competition FCIS-ASU 2021-2022.

17
Experimental
35 hiun/siamese-nn

Tensorflow Implementation of Siamese Neural Network for Learning Sentence Similarity

15
Experimental
36 DavidHarar/Siamese-Networks-for-name-nickname-similarity

Using siamese networks to learn a similarity metric between names and nicknames.

14
Experimental
37 FDSRashid/Semantic-Similarity-Arabic

A package which implements various ways to measure semantic similarity of...

14
Experimental
38 tlack/semantics

Semantic similarity via text embeddings in Elixir - powered by...

13
Experimental
39 likithponnanna/text-similarity-scorer

This repo contains files to code that deploys an API endpoint for a user to...

12
Experimental
40 tos-kamiya/d2vg

A Doc2Vec grep. On your desktop.

12
Experimental
41 mrpawan-gupta/TextTo

The Repository Contains The CPP Program to Calculate the Cosine Similarity...

12
Experimental
42 plain-jane-gray/parse-PDF-NLP-ML

Parses apart a PDF file into separate documents and then uses Natural...

12
Experimental
43 VanekPetr/text-similarity-ranking

Algorithm to rank text similarity between set of strings and given inputs

12
Experimental
44 adrianmfi/semantic-similarity-cli

semantic-similarity-cli is a command-line tool leveraging NLP models to...

12
Experimental
45 t-ski/string-similarity-algorithms

Common string similarity algorithm implementations.

11
Experimental
46 SirapopChu/DSI314_Keyword-similarity-matching-to-assess-the-potential-of-Pathum-Thani-provincial-

The project “Keyword Similarity Matching to Assess the Potential of Pathum...

10
Experimental
47 plandes/spanmatch

Unsupervised Position-Based Semantic Matching

10
Experimental
48 tos-kamiya/stng

An off-the-shelf grep-like tool that performs semantic similarity search

10
Experimental
49 tonyfan08/NLP_TEXT_NER_SIMILARITY

AI NLP Competition

10
Experimental