All Embedding Tools
4,013 tools ranked by quality score
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark |
|
Verified |
| 2 |
aiming-lab/SimpleMem
SimpleMem: Efficient Lifelong Memory for LLM Agents |
|
Verified |
| 3 |
xhluca/bm25s
Fast lexical search implementing BM25 in Python |
|
Verified |
| 4 |
MinishLab/model2vec
Fast State-of-the-Art Static Embeddings |
|
Verified |
| 5 |
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs |
|
Verified |
| 6 |
docarray/docarray
Represent, send, store and search multimodal data |
|
Verified |
| 7 |
srbhr/Resume-Matcher
Improve your resumes with Resume Matcher. Get insights, keyword suggestions... |
|
Verified |
| 8 |
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding |
|
Verified |
| 9 |
typesense/typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use... |
|
Verified |
| 10 |
vllm-project/semantic-router
System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge |
|
Verified |
| 11 |
shibing624/text2vec
text2vec, text to vector.... |
|
Verified |
| 12 |
airweave-ai/airweave
Open-source context retrieval layer for AI agents |
|
Verified |
| 13 |
inception-project/inception
INCEpTION provides a semantic annotation platform offering intelligent... |
|
Established |
| 14 |
lfnovo/esperanto
A unified interface for various AI model providers |
|
Established |
| 15 |
Blaizzy/mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding... |
|
Established |
| 16 |
getzep/zep
Zep | Examples, Integrations, & More |
|
Established |
| 17 |
zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index. |
|
Established |
| 18 |
shibing624/similarities
Similarities: a toolkit for similarity calculation and semantic search.... |
|
Established |
| 19 |
gorse-io/gorse
AI powered open source recommender system engine supports classical/LLM... |
|
Established |
| 20 |
roshan-research/hazm
Persian NLP Toolkit |
|
Established |
| 21 |
cocoindex-io/cocoindex
Data transformation framework for AI. Ultra performant, with incremental... |
|
Established |
| 22 |
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models |
|
Established |
| 23 |
eliorc/node2vec
Implementation of the node2vec algorithm. |
|
Established |
| 24 |
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors. |
|
Established |
| 25 |
aws-samples/amazon-bedrock-samples
This repository contains examples for customers to get started using the... |
|
Established |
| 26 |
brianpetro/obsidian-smart-connections
Chat with your notes & see links to related content with AI embeddings. Use... |
|
Established |
| 27 |
Merck/Sapiens
Sapiens is a human antibody language model based on BERT. |
|
Established |
| 28 |
zilliztech/memsearch
A Markdown-first memory system, a standalone library for any AI agent.... |
|
Established |
| 29 |
NotJoeMartinez/yt-fts
YouTube Full Text Search - Search all of YouTube from the command line |
|
Established |
| 30 |
jparkerweb/semantic-chunking
🍱 semantic-chunking ⇢ semantically create chunks from large document for... |
|
Established |
| 31 |
dtsola/xiaoyaosearch
小遥搜索,听懂你的话、看懂你的图,用AI找到本地任何文件。让搜索像聊天一样简单。XiaoyaoSearch: Understands your... |
|
Established |
| 32 |
Ryandonofrio3/osgrep
Open Source Semantic Search for your AI Agent |
|
Established |
| 33 |
explosion/sense2vec
🦆 Contextually-keyed word vectors |
|
Established |
| 34 |
Anush008/fastembed-rs
Rust library for vector embeddings and reranking. |
|
Established |
| 35 |
TorchDR/TorchDR
TorchDR - PyTorch Dimensionality Reduction |
|
Established |
| 36 |
codelion/adaptive-classifier
A flexible, adaptive classification system for dynamic text classification |
|
Established |
| 37 |
ssrajadh/sentrysearch
Semantic search over videos using Gemini Embedding 2. |
|
Established |
| 38 |
cosmosgl/graph
GPU-accelerated force graph layout and rendering |
|
Established |
| 39 |
winkjs/wink-bm25-text-search
Fast Full Text Search based on BM25 |
|
Established |
| 40 |
patrickfrank1/chesspos
Embedding based chess position search and embedding learning for chess positions |
|
Established |
| 41 |
unum-cloud/UForm
Pocket-Sized Multimodal AI for content understanding and generation across... |
|
Established |
| 42 |
freedmand/semantra
Multi-tool for semantic search |
|
Established |
| 43 |
MilaNLProc/contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine... |
|
Established |
| 44 |
jina-ai/clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP |
|
Established |
| 45 |
AmenRa/retriv
A Python Search Engine for Humans 🥸 |
|
Established |
| 46 |
curiosity-ai/catalyst
🚀 Catalyst is a C# Natural Language Processing library built for speed.... |
|
Established |
| 47 |
michaelfeil/infinity
Infinity is a high-throughput, low-latency serving engine for... |
|
Established |
| 48 |
yoanbernabeu/grepai
Semantic Search & Call Graphs for AI Agents (100% Local) |
|
Established |
| 49 |
justincasher/lean-explore
A search engine for Lean 4 declarations |
|
Established |
| 50 |
deepset-ai/haystack-core-integrations
Additional packages (components, document stores and the likes) to extend... |
|
Established |
| 51 |
microsoft/simplechat
Secure AI conversations with documents, video, audio, and more. Personal... |
|
Established |
| 52 |
deepset-ai/haystack-tutorials
Here you can find all the Tutorials for Haystack 📓 |
|
Established |
| 53 |
ContextualAI/gritlm
Generative Representational Instruction Tuning |
|
Established |
| 54 |
byte5ai/palaia
Palaia — Local, crash-safe memory for AI agents. Semantic vector search... |
|
Established |
| 55 |
Yomguithereal/talisman
Straightforward fuzzy matching, information retrieval and NLP building... |
|
Established |
| 56 |
rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them |
|
Established |
| 57 |
awinml/voyage-embedders-haystack
Custom components for Haystack for creating embeddings and reranking... |
|
Established |
| 58 |
Accenture/AmpliGraph
Python library for Representation Learning on Knowledge Graphs... |
|
Established |
| 59 |
usc-isi-i2/kgtk
Knowledge Graph Toolkit |
|
Established |
| 60 |
unum-cloud/USearch
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary... |
|
Established |
| 61 |
ascottbell/maasv
Memory Architecture as a Service — cognition layer for AI assistants.... |
|
Established |
| 62 |
towhee-io/towhee
Towhee is a framework that is dedicated to making neural data processing... |
|
Established |
| 63 |
aryn-ai/sycamore
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data. |
|
Established |
| 64 |
TeleAI-UAGI/telemem
TeleMem is a high-performance drop-in replacement for Mem0, featuring... |
|
Established |
| 65 |
starthackHQ/Contextinator
Turning messy repos into weapons of mass structured context. |
|
Established |
| 66 |
Clay-foundation/model
The Clay Foundation Model - An open source AI model and interface for Earth |
|
Established |
| 67 |
Azure/azure-search-vector-samples
A repository of code samples for Vector search capabilities in Azure AI Search. |
|
Established |
| 68 |
asreview/asreview-dory
Official extension for ASReview LAB enabling state-of-the-art NLP models... |
|
Established |
| 69 |
neuml/annotateai
📝 Automatically annotate papers using LLMs |
|
Established |
| 70 |
apocas/restai
RESTai is an AIaaS (AI as a Service) open-source platform. Built on top of... |
|
Established |
| 71 |
yannvgn/laserembeddings
LASER multilingual sentence embeddings as a pip package |
|
Established |
| 72 |
NyanNyanovich/nyan
Automatic news aggregator in Telegram / Автоматический агрегатор новостей в Телеграме |
|
Established |
| 73 |
run-llama/semtools
Semantic search and document parsing tools for the command line |
|
Established |
| 74 |
RichmondAlake/memorizz
MemoRizz: A Python library serving as a memory layer for AI applications.... |
|
Established |
| 75 |
fellanH/context-vault
Persistent memory for AI agents — save and search knowledge across sessions... |
|
Established |
| 76 |
artitw/text2text
Text2Text Language Modeling Toolkit |
|
Established |
| 77 |
chakki-works/chakin
Simple downloader for pre-trained word vectors |
|
Established |
| 78 |
joshuaswarren/openclaw-engram
Local-first memory plugin for OpenClaw AI agents. LLM-powered extraction,... |
|
Established |
| 79 |
EBISPOT/ols4
The EMBL-EBI Ontology Lookup Service (OLS) |
|
Established |
| 80 |
supabase/embeddings-generator
GitHub Action to generate embeddings from the markdown files in your repository. |
|
Established |
| 81 |
harmonydata/harmony
The Harmony Python library: a research tool for psychologists to harmonise... |
|
Established |
| 82 |
koursaros-ai/nboost
NBoost is a scalable, search-api-boosting platform for deploying transformer... |
|
Established |
| 83 |
gmickel/gno
Local AI-powered document search and editing with first-in-class hybrid... |
|
Established |
| 84 |
Michael-JB/bm25
A BM25 embedder, scorer, and search engine, written in Rust. |
|
Established |
| 85 |
predict-idlab/pyRDF2Vec
🐍 Python Implementation and Extension of RDF2Vec |
|
Established |
| 86 |
Dadmatech/DadmaTools
DadmaTools is a Persian NLP tools developed by Dadmatech Co. |
|
Established |
| 87 |
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量 |
|
Established |
| 88 |
tensorflow/hub
A library for transfer learning by reusing parts of TensorFlow models. |
|
Established |
| 89 |
joelvdhoeven/memord
A local shared memory layer for all your AI tools — Claude, Cursor,... |
|
Established |
| 90 |
vector-ai/vectorai
Vector AI — A platform for building vector based applications. Encode, query... |
|
Established |
| 91 |
lotus-data/lotus
AI-Powered Data Processing: Use LOTUS to process all of your datasets with... |
|
Established |
| 92 |
Hamza5/file-brain
Smart local file search app that understands your files |
|
Established |
| 93 |
scarletkc/vexor
A semantic search engine for files and code. |
|
Established |
| 94 |
prosperitypirate/codexfi
Persistent memory for OpenCode AI agents. Embedded LanceDB + Voyage AI... |
|
Established |
| 95 |
primeqa/primeqa
The prime repository for state-of-the-art Multilingual Question Answering... |
|
Established |
| 96 |
gao-lab/GLUE
Graph-linked unified embedding for single-cell multi-omics data integration |
|
Established |
| 97 |
MinishLab/model2vec-rs
Official Rust Implementation of Model2Vec |
|
Established |
| 98 |
remete618/widemem-ai
Next-gen AI memory layer with importance scoring, temporal decay,... |
|
Established |
| 99 |
mazzzystar/Queryable
Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos. |
|
Emerging |
| 100 |
jayzeng/pi-memory
Persistent memory extension for pi with daily logs, scratchpad, and optional... |
|
Emerging |