Text Similarity Matching NLP Tools
Tools for computing similarity scores between text passages, phrases, or documents using embeddings, distance metrics, or statistical methods. Does NOT include semantic search engines, plagiarism detection systems, or applications that use similarity as a component of larger NLP tasks.
There are 49 text similarity matching tools tracked. 1 score above 50 (established tier). The highest-rated is shibing624/similarity at 54/100 with 1,571 stars.
Get all 49 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=text-similarity-matching&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
shibing624/similarity
similarity: Text similarity calculation Toolkit for Java.... |
|
Established |
| 2 |
eBay/Sequence-Semantic-Embedding
Tools and recipes to train deep learning models and build services for NLP... |
|
Emerging |
| 3 |
RandolphVI/Text-Pairs-Relation-Classification
About Text Pairs (Sentence Level) Classification (Similarity Modeling) Based... |
|
Emerging |
| 4 |
piotrmaciejbednarski/text-similarity-node
High-performance and memory efficient native C++ text similarity algorithms... |
|
Emerging |
| 5 |
MartinoMensio/spacy-universal-sentence-encoder
Google USE (Universal Sentence Encoder) for spaCy |
|
Emerging |
| 6 |
brianrisk/simphile-text-similarity-nlp
Python Text Similarity NLP Libray |
|
Emerging |
| 7 |
brunoarine/findlike
Command-line tool that finds lexically similar documents in relation to a... |
|
Emerging |
| 8 |
nullnull/simstring
A Python implementation of the SimString, a simple and efficient algorithm... |
|
Emerging |
| 9 |
likejazz/Siamese-LSTM
Siamese LSTM for evaluating semantic similarity between sentences of the... |
|
Emerging |
| 10 |
jinglescode/textual-similarity-universal-sentence-encoder
Extract embeddings and group sentences with universal sentence encoder... |
|
Emerging |
| 11 |
zqhZY/semanaly
semantic analysis using word2vector, doc2vector,lstm and other method.... |
|
Emerging |
| 12 |
Neurotech-HQ/pysimilar
A python library for computing the similarity between two string(text) based... |
|
Emerging |
| 13 |
vishnumani2009/siamese-text-similarity
Tensorflow based implementation of deep siamese LSTM network for sentence... |
|
Emerging |
| 14 |
google/unisim
UniSim is a package for efficient similarity computation, fuzzy matching,... |
|
Experimental |
| 15 |
ocramz/ncd-tree
text similarity search trees based on Normalized Compression Distance |
|
Experimental |
| 16 |
adrianmfi/git-semantic-similarity
Search git commit messages by semantic similarity with embeddings from... |
|
Experimental |
| 17 |
eu90h/semantic-dictionary
A Python dictionary that uses semantic similarity for key matching instead... |
|
Experimental |
| 18 |
amansrivastava17/bns-short-text-similarity
📖 Use Bi-normal Separation to find document vectors which is used to... |
|
Experimental |
| 19 |
rameshjes/Semantic-Textual-Similarity
Natural Language Processing using NLTK and Spacy |
|
Experimental |
| 20 |
qiangzi11hao/Semantic-Textual-Similarity
è¯ä¹‰ç›¸ä¼¼åº¦ |
|
Experimental |
| 21 |
kenneth-lange/java-nlp-text-similarity
Measure the similarity between different text documents. |
|
Experimental |
| 22 |
caiyinqiong/Semantic-Retrieval-Models
A curated list of awesome papers for Semantic Retrieval (TOIS Accepted:... |
|
Experimental |
| 23 |
victormurcia/PyUMLS_Similarity
This package computes a variety of similarity metrics between concepts... |
|
Experimental |
| 24 |
PedroMTQ/UniFunc
Tool for similarity analysis of protein function annotations. |
|
Experimental |
| 25 |
ankushbhatia2/Document_similarity_research_notebook
Jupyter notebook for my research in Document Similarity. |
|
Experimental |
| 26 |
vukbatanovic/STSFineGrain
A collection of Semantic Textual Similarity (STS) models and a framework for... |
|
Experimental |
| 27 |
Aghabeiki/sentences2vector
Get Similarity of two sentences based gensim word2vec trained model. |
|
Experimental |
| 28 |
goliasz/pio-template-text-similarity
Text similarity based on Word2Vec vectors. |
|
Experimental |
| 29 |
khalidbelk/jaccard
🧬 Calculate the similarity index between two texts |
|
Experimental |
| 30 |
Aditya1001001/similarity-and-embedding-app
Learn about text similarity measures & text embedding methods. |
|
Experimental |
| 31 |
mforstenhaeusler/Siamese-LSTM-for-Semantic-Similarity-PyTorch
This repositpory entails an implementation of a Deep Learning Pipeline that... |
|
Experimental |
| 32 |
chigwell/compario
A new package that uses large language models and pattern matching to... |
|
Experimental |
| 33 |
sauravjoshi23/SiameseNet-Loss
Text Classification Using Siamese Neural Networks - Contrastive Loss,... |
|
Experimental |
| 34 |
nurhaankhaled/SentenceToSentenceSemanticSimilarity
First rank winner at the Natural Language Processing competition FCIS-ASU 2021-2022. |
|
Experimental |
| 35 |
hiun/siamese-nn
Tensorflow Implementation of Siamese Neural Network for Learning Sentence Similarity |
|
Experimental |
| 36 |
DavidHarar/Siamese-Networks-for-name-nickname-similarity
Using siamese networks to learn a similarity metric between names and nicknames. |
|
Experimental |
| 37 |
FDSRashid/Semantic-Similarity-Arabic
A package which implements various ways to measure semantic similarity of... |
|
Experimental |
| 38 |
tlack/semantics
Semantic similarity via text embeddings in Elixir - powered by... |
|
Experimental |
| 39 |
likithponnanna/text-similarity-scorer
This repo contains files to code that deploys an API endpoint for a user to... |
|
Experimental |
| 40 |
tos-kamiya/d2vg
A Doc2Vec grep. On your desktop. |
|
Experimental |
| 41 |
mrpawan-gupta/TextTo
The Repository Contains The CPP Program to Calculate the Cosine Similarity... |
|
Experimental |
| 42 |
plain-jane-gray/parse-PDF-NLP-ML
Parses apart a PDF file into separate documents and then uses Natural... |
|
Experimental |
| 43 |
VanekPetr/text-similarity-ranking
Algorithm to rank text similarity between set of strings and given inputs |
|
Experimental |
| 44 |
adrianmfi/semantic-similarity-cli
semantic-similarity-cli is a command-line tool leveraging NLP models to... |
|
Experimental |
| 45 |
t-ski/string-similarity-algorithms
Common string similarity algorithm implementations. |
|
Experimental |
| 46 |
SirapopChu/DSI314_Keyword-similarity-matching-to-assess-the-potential-of-Pathum-Thani-provincial-
The project “Keyword Similarity Matching to Assess the Potential of Pathum... |
|
Experimental |
| 47 |
plandes/spanmatch
Unsupervised Position-Based Semantic Matching |
|
Experimental |
| 48 |
tos-kamiya/stng
An off-the-shelf grep-like tool that performs semantic similarity search |
|
Experimental |
| 49 |
tonyfan08/NLP_TEXT_NER_SIMILARITY
AI NLP Competition |
|
Experimental |