All Embedding Tools

4,013 tools ranked by quality score

Showing 1–100 of 4,013
# Tool Score Tier
1 embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

99
Verified
2 aiming-lab/SimpleMem

SimpleMem: Efficient Lifelong Memory for LLM Agents

81
Verified
3 xhluca/bm25s

Fast lexical search implementing BM25 in Python

80
Verified
4 MinishLab/model2vec

Fast State-of-the-Art Static Embeddings

80
Verified
5 FlagOpen/FlagEmbedding

Retrieval and Retrieval-augmented LLMs

79
Verified
6 docarray/docarray

Represent, send, store and search multimodal data

76
Verified
7 srbhr/Resume-Matcher

Improve your resumes with Resume Matcher. Get insights, keyword suggestions...

75
Verified
8 qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

74
Verified
9 typesense/typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use...

74
Verified
10 vllm-project/semantic-router

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

73
Verified
11 shibing624/text2vec

text2vec, text to vector....

73
Verified
12 airweave-ai/airweave

Open-source context retrieval layer for AI agents

72
Verified
13 inception-project/inception

INCEpTION provides a semantic annotation platform offering intelligent...

69
Established
14 lfnovo/esperanto

A unified interface for various AI model providers

69
Established
15 Blaizzy/mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding...

69
Established
16 getzep/zep

Zep | Examples, Integrations, & More

68
Established
17 zilliztech/GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

66
Established
18 shibing624/similarities

Similarities: a toolkit for similarity calculation and semantic search....

66
Established
19 gorse-io/gorse

AI powered open source recommender system engine supports classical/LLM...

66
Established
20 roshan-research/hazm

Persian NLP Toolkit

66
Established
21 cocoindex-io/cocoindex

Data transformation framework for AI. Ultra performant, with incremental...

65
Established
22 huggingface/text-embeddings-inference

A blazing fast inference solution for text embeddings models

65
Established
23 eliorc/node2vec

Implementation of the node2vec algorithm.

65
Established
24 ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

65
Established
25 aws-samples/amazon-bedrock-samples

This repository contains examples for customers to get started using the...

64
Established
26 brianpetro/obsidian-smart-connections

Chat with your notes & see links to related content with AI embeddings. Use...

64
Established
27 Merck/Sapiens

Sapiens is a human antibody language model based on BERT.

62
Established
28 zilliztech/memsearch

A Markdown-first memory system, a standalone library for any AI agent....

62
Established
29 NotJoeMartinez/yt-fts

YouTube Full Text Search - Search all of YouTube from the command line

61
Established
30 jparkerweb/semantic-chunking

🍱 semantic-chunking ⇢ semantically create chunks from large document for...

61
Established
31 dtsola/xiaoyaosearch

小遥搜索,听懂你的话、看懂你的图,用AI找到本地任何文件。让搜索像聊天一样简单。XiaoyaoSearch: Understands your...

61
Established
32 Ryandonofrio3/osgrep

Open Source Semantic Search for your AI Agent

60
Established
33 explosion/sense2vec

🦆 Contextually-keyed word vectors

60
Established
34 Anush008/fastembed-rs

Rust library for vector embeddings and reranking.

60
Established
35 TorchDR/TorchDR

TorchDR - PyTorch Dimensionality Reduction

59
Established
36 codelion/adaptive-classifier

A flexible, adaptive classification system for dynamic text classification

59
Established
37 ssrajadh/sentrysearch

Semantic search over videos using Gemini Embedding 2.

58
Established
38 cosmosgl/graph

GPU-accelerated force graph layout and rendering

58
Established
39 winkjs/wink-bm25-text-search

Fast Full Text Search based on BM25

58
Established
40 patrickfrank1/chesspos

Embedding based chess position search and embedding learning for chess positions

58
Established
41 unum-cloud/UForm

Pocket-Sized Multimodal AI for content understanding and generation across...

58
Established
42 freedmand/semantra

Multi-tool for semantic search

58
Established
43 MilaNLProc/contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine...

58
Established
44 jina-ai/clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

58
Established
45 AmenRa/retriv

A Python Search Engine for Humans 🥸

57
Established
46 curiosity-ai/catalyst

🚀 Catalyst is a C# Natural Language Processing library built for speed....

57
Established
47 michaelfeil/infinity

Infinity is a high-throughput, low-latency serving engine for...

57
Established
48 yoanbernabeu/grepai

Semantic Search & Call Graphs for AI Agents (100% Local)

57
Established
49 justincasher/lean-explore

A search engine for Lean 4 declarations

57
Established
50 deepset-ai/haystack-core-integrations

Additional packages (components, document stores and the likes) to extend...

57
Established
51 microsoft/simplechat

Secure AI conversations with documents, video, audio, and more. Personal...

56
Established
52 deepset-ai/haystack-tutorials

Here you can find all the Tutorials for Haystack 📓

56
Established
53 ContextualAI/gritlm

Generative Representational Instruction Tuning

56
Established
54 byte5ai/palaia

Palaia — Local, crash-safe memory for AI agents. Semantic vector search...

56
Established
55 Yomguithereal/talisman

Straightforward fuzzy matching, information retrieval and NLP building...

56
Established
56 rom1504/clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

56
Established
57 awinml/voyage-embedders-haystack

Custom components for Haystack for creating embeddings and reranking...

56
Established
58 Accenture/AmpliGraph

Python library for Representation Learning on Knowledge Graphs...

56
Established
59 usc-isi-i2/kgtk

Knowledge Graph Toolkit

55
Established
60 unum-cloud/USearch

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary...

55
Established
61 ascottbell/maasv

Memory Architecture as a Service — cognition layer for AI assistants....

55
Established
62 towhee-io/towhee

Towhee is a framework that is dedicated to making neural data processing...

54
Established
63 aryn-ai/sycamore

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

54
Established
64 TeleAI-UAGI/telemem

TeleMem is a high-performance drop-in replacement for Mem0, featuring...

54
Established
65 starthackHQ/Contextinator

Turning messy repos into weapons of mass structured context.

54
Established
66 Clay-foundation/model

The Clay Foundation Model - An open source AI model and interface for Earth

54
Established
67 Azure/azure-search-vector-samples

A repository of code samples for Vector search capabilities in Azure AI Search.

54
Established
68 asreview/asreview-dory

Official extension for ASReview LAB enabling state-of-the-art NLP models...

54
Established
69 neuml/annotateai

📝 Automatically annotate papers using LLMs

54
Established
70 apocas/restai

RESTai is an AIaaS (AI as a Service) open-source platform. Built on top of...

53
Established
71 yannvgn/laserembeddings

LASER multilingual sentence embeddings as a pip package

53
Established
72 NyanNyanovich/nyan

Automatic news aggregator in Telegram / Автоматический агрегатор новостей в Телеграме

53
Established
73 run-llama/semtools

Semantic search and document parsing tools for the command line

53
Established
74 RichmondAlake/memorizz

MemoRizz: A Python library serving as a memory layer for AI applications....

53
Established
75 fellanH/context-vault

Persistent memory for AI agents — save and search knowledge across sessions...

53
Established
76 artitw/text2text

Text2Text Language Modeling Toolkit

52
Established
77 chakki-works/chakin

Simple downloader for pre-trained word vectors

52
Established
78 joshuaswarren/openclaw-engram

Local-first memory plugin for OpenClaw AI agents. LLM-powered extraction,...

52
Established
79 EBISPOT/ols4

The EMBL-EBI Ontology Lookup Service (OLS)

52
Established
80 supabase/embeddings-generator

GitHub Action to generate embeddings from the markdown files in your repository.

52
Established
81 harmonydata/harmony

The Harmony Python library: a research tool for psychologists to harmonise...

52
Established
82 koursaros-ai/nboost

NBoost is a scalable, search-api-boosting platform for deploying transformer...

52
Established
83 gmickel/gno

Local AI-powered document search and editing with first-in-class hybrid...

52
Established
84 Michael-JB/bm25

A BM25 embedder, scorer, and search engine, written in Rust.

52
Established
85 predict-idlab/pyRDF2Vec

🐍 Python Implementation and Extension of RDF2Vec

51
Established
86 Dadmatech/DadmaTools

DadmaTools is a Persian NLP tools developed by Dadmatech Co.

51
Established
87 Embedding/Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

51
Established
88 tensorflow/hub

A library for transfer learning by reusing parts of TensorFlow models.

51
Established
89 joelvdhoeven/memord

A local shared memory layer for all your AI tools — Claude, Cursor,...

51
Established
90 vector-ai/vectorai

Vector AI — A platform for building vector based applications. Encode, query...

51
Established
91 lotus-data/lotus

AI-Powered Data Processing: Use LOTUS to process all of your datasets with...

51
Established
92 Hamza5/file-brain

Smart local file search app that understands your files

51
Established
93 scarletkc/vexor

A semantic search engine for files and code.

51
Established
94 prosperitypirate/codexfi

Persistent memory for OpenCode AI agents. Embedded LanceDB + Voyage AI...

51
Established
95 primeqa/primeqa

The prime repository for state-of-the-art Multilingual Question Answering...

50
Established
96 gao-lab/GLUE

Graph-linked unified embedding for single-cell multi-omics data integration

50
Established
97 MinishLab/model2vec-rs

Official Rust Implementation of Model2Vec

50
Established
98 remete618/widemem-ai

Next-gen AI memory layer with importance scoring, temporal decay,...

50
Established
99 mazzzystar/Queryable

Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.

49
Emerging
100 jayzeng/pi-memory

Persistent memory extension for pi with daily logs, scratchpad, and optional...

49
Emerging
1 2 3 39 40 41 Next »