All Embedding Tools
4,013 tools ranked by quality score · Page 2 of 41
| # | Tool | Score | Tier |
|---|---|---|---|
| 101 |
microsoft/kernel-memory
Research project. A Memory solution for users, teams, and applications. |
|
Emerging |
| 102 |
IITH-Compilers/IR2Vec
Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings |
|
Emerging |
| 103 |
derrickburns/generalized-kmeans-clustering
Production-ready K-Means clustering for Apache Spark with pluggable Bregman... |
|
Emerging |
| 104 |
infinilabs/coco-server
🥥 Coco AI Server - Search, Connect, Collaborate, AI-powered Enterprise... |
|
Emerging |
| 105 |
caspianmoon/memoripy
An AI memory layer with short- and long-term storage, semantic clustering,... |
|
Emerging |
| 106 |
IntuitionEngineeringTeam/chars2vec
Character-based word embeddings model based on RNN for handling real world texts |
|
Emerging |
| 107 |
stephantul/reach
Load embeddings and featurize your sentences. |
|
Emerging |
| 108 |
probelabs/probe
AI-friendly semantic code search engine for large codebases. Combines... |
|
Emerging |
| 109 |
pdrm83/sent2vec
How to encode sentences in a high-dimensional vector space, a.k.a., sentence... |
|
Emerging |
| 110 |
MaartenGr/PolyFuzz
Fuzzy string matching, grouping, and evaluation. |
|
Emerging |
| 111 |
insideout10/wordlift-plugin
WordLift brings the power of Artificial Intelligence to beautifully organize... |
|
Emerging |
| 112 |
snap-stanford/stark
(NeurIPS D&B 2024) STaRK: Benchmarking LLM Retrieval on Textual and... |
|
Emerging |
| 113 |
EU-ECDC/episomer
ECDC Early warning tool using social media data. |
|
Emerging |
| 114 |
wagtail/wagtail-vector-index
Store Wagtail pages & Django models as embeddings in vector databases |
|
Emerging |
| 115 |
pingcap/pytidb
TiDB AI SDK: Unified Multi-Modal Data Platform for AI Apps & Agents -... |
|
Emerging |
| 116 |
olaflaitinen/citysense
CitySense is an open-source Python library that bridges geospatial urban... |
|
Emerging |
| 117 |
KonstantinosPetrakis/esco-skill-extractor
Extract ESCO skills and ISCO occupations from texts such as job descriptions or CVs |
|
Emerging |
| 118 |
AnswerDotAI/ModernBERT
Bringing BERT into modernity via both architecture changes and scaling |
|
Emerging |
| 119 |
sashakolpakov/dire-jax
DImensionality REduction in JAX |
|
Emerging |
| 120 |
superduper-io/superduper
Superduper: End-to-end framework for building custom AI applications and agents. |
|
Emerging |
| 121 |
ddickmann/vllm-factory
Production inference for encoder models - ColBERT, GLiNER, ColPali,... |
|
Emerging |
| 122 |
sbhjt-gr/InferrLM
On-device AI for iOS & Android |
|
Emerging |
| 123 |
snap-research/GRID
GRID: Generative Recommendation with Semantic IDs |
|
Emerging |
| 124 |
nomic-ai/nomic
Nomic Developer API SDK |
|
Emerging |
| 125 |
cerul-ai/cerul
Real-time video search engine for AI agents. Search by meaning across visual... |
|
Emerging |
| 126 |
Santosh-Gupta/SpeedTorch
Library for faster pinned CPU <-> GPU transfer in Pytorch |
|
Emerging |
| 127 |
AKSW/sante
The Ontology, Dataset and Knowledge Search Engine |
|
Emerging |
| 128 |
Azure-Samples/azure-ai-document-processing-samples
A collection of samples demonstrating techniques for processing documents... |
|
Emerging |
| 129 |
gweidart/rs-bpe
A ridiculously fast Python BPE (Byte Pair Encoder) implementation written in Rust |
|
Emerging |
| 130 |
kelindar/search
Go library for embedded vector search and semantic embeddings using llama.cpp |
|
Emerging |
| 131 |
pinecone-io/pinecone-datasets
An open-source dataset library for pre-embedded dataset: create your own... |
|
Emerging |
| 132 |
Hyper3Labs/HyperView
HyperView curates datasets and provides model introspection in hyperbolic... |
|
Emerging |
| 133 |
abhilash1910/ClusterTransformer
Topic clustering library built on Transformer embeddings and cosine... |
|
Emerging |
| 134 |
vinid/cade
Compass-aligned Distributional Embeddings. Align embeddings from different corpora |
|
Emerging |
| 135 |
embeddings-benchmark/results
Data for the MTEB leaderboard |
|
Emerging |
| 136 |
alexshtf/torchcurves
Parametric differentiable curves with PyTorch for continuous embeddings,... |
|
Emerging |
| 137 |
amirivojdan/shekar
Simplifying Persian NLP for Modern Applications |
|
Emerging |
| 138 |
Terronex-dev/aifbin-pro
AIF-BIN Pro — Professional AI Memory Management with Semantic Search |
|
Emerging |
| 139 |
alexklibisz/elastiknn
Elasticsearch plugin for nearest neighbor search. Store vectors and run... |
|
Emerging |
| 140 |
ferencberes/online-node2vec
Node Embeddings in Dynamic Graphs |
|
Emerging |
| 141 |
similigh/simili-bot
AI-powered GitHub issue intelligence - semantic duplicate detection,... |
|
Emerging |
| 142 |
yusufhilmi/client-vector-search
A client side vector search library that can embed, store, search, and cache... |
|
Emerging |
| 143 |
twang2218/vocab-coverage
语言模型中文认知能力分析 |
|
Emerging |
| 144 |
estebanpdl/osintgpt
An open-source intelligence (OSINT) analysis tool leveraging GPT-powered... |
|
Emerging |
| 145 |
amansrivastava17/embedding-as-service
One-Stop Solution to encode sentence to fixed length vectors from various... |
|
Emerging |
| 146 |
eugeneyan/ml-surveys
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs,... |
|
Emerging |
| 147 |
choihyunsus/n2-mimir
AI Experience Learning Engine — AI agents remember, but don't learn. Mimir... |
|
Emerging |
| 148 |
LongxingTan/open-retrievals
All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers |
|
Emerging |
| 149 |
IlyasMoutawwakil/py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and... |
|
Emerging |
| 150 |
vezlo/assistant-server
AI Assistant Server |
|
Emerging |
| 151 |
langformers/langformers
🚀 Unified NLP Pipelines for Language Models |
|
Emerging |
| 152 |
voyage-ai/voyageai-python
Voyage AI Official Python Library |
|
Emerging |
| 153 |
lucidrains/discrete-continuous-embed-readout
Embedding and readout for simple multi-categorical and gaussian continuous |
|
Emerging |
| 154 |
raphaelsty/neural-cherche
Neural Search |
|
Emerging |
| 155 |
milvus-io/milvus-model
A library integrating embedding and reranker models from OpenAI,... |
|
Emerging |
| 156 |
Dicklesworthstone/frankensearch
Two-tier hybrid search for Rust: sub-millisecond initial results via... |
|
Emerging |
| 157 |
slicenferqin/universal-memory-mcp
超级个人助理 - 通用、跨项目、100%本地私有的AI记忆系统(基于MCP协议)- Super personal assistant for any... |
|
Emerging |
| 158 |
jkrukowski/swift-embeddings
Run embedding models locally in Swift using MLTensor. |
|
Emerging |
| 159 |
SemBench/SemBench
Benchmarking Semantic Query Processing Engines |
|
Emerging |
| 160 |
debnsuma/fcc-ai-engineering-aws
A Practical Course on Embeddings, RAG, Multimodal Models, and Agents with... |
|
Emerging |
| 161 |
baidubce/bce-qianfan-sdk
Provide best practices for LMOps, as well as elegant and convenient access... |
|
Emerging |
| 162 |
openviglet/turing
:sparkles: :dna: Turing ES - Enterprise Search, Semantic Navigation, Chatbot... |
|
Emerging |
| 163 |
deepset-ai/haystack-demos
Fully working applications that demonstrate how to use Haystack to implement... |
|
Emerging |
| 164 |
md-experiments/picture_text
Interactive tree-maps with SBERT & Hierarchical Clustering (HAC) |
|
Emerging |
| 165 |
mantisfury/ArkhamMirror
Local-first AI-powered document intelligence platform for investigative journalism |
|
Emerging |
| 166 |
one-bit/oc-mnemoria
Persistent shared memory (hive mind) for OpenCode agents, powered by the... |
|
Emerging |
| 167 |
MilaNLProc/honest
A Python package to compute HONEST, a score to measure hurtful sentence... |
|
Emerging |
| 168 |
jared-goering/ultramemory
Local-first AI memory engine with relational versioning, temporal grounding,... |
|
Emerging |
| 169 |
sebischair/Lbl2Vec
Lbl2Vec learns jointly embedded label, document and word vectors to retrieve... |
|
Emerging |
| 170 |
Terronex-dev/aifbin-lite
AIF-BIN Lite — Free & Open Source CLI for AI Memory Files |
|
Emerging |
| 171 |
datastax/astra-db-java
Java Client for Data API |
|
Emerging |
| 172 |
mims-harvard/nimfa
Nimfa: Nonnegative matrix factorization in Python |
|
Emerging |
| 173 |
jiegzhan/multi-class-text-classification-cnn
Classify Kaggle Consumer Finance Complaints into 11 classes. Build the model... |
|
Emerging |
| 174 |
natasha/navec
Compact high quality word embeddings for Russian language |
|
Emerging |
| 175 |
fresh-stack/freshstack
This repository helps you evaluate your models on the FreshStack benchmark! |
|
Emerging |
| 176 |
Hironsan/bertsearch
Elasticsearch with BERT for advanced document search. |
|
Emerging |
| 177 |
Praful932/Kitabe
Book Recommendation System built for Book Lovers📖. Simply Rate ⭐ some books... |
|
Emerging |
| 178 |
SireJeff/k0ntext
AI Context Engineering - Intelligent context for Claude, Copilot, Cline, and... |
|
Emerging |
| 179 |
mims-harvard/decagon
Graph convolutional neural network for multirelational link prediction |
|
Emerging |
| 180 |
ALucek/QuicKB
Optimize Document Retrieval with Fine-Tuned KnowledgeBases |
|
Emerging |
| 181 |
raphaelsty/cherche
Neural Search |
|
Emerging |
| 182 |
omoindrot/tensorflow-triplet-loss
Implementation of triplet loss in TensorFlow |
|
Emerging |
| 183 |
jina-ai/examples
Jina examples and demos to help you get started |
|
Emerging |
| 184 |
hamelsmu/code_search
Code For Medium Article: "How To Create Natural Language Semantic Search for... |
|
Emerging |
| 185 |
jiegzhan/multi-class-text-classification-cnn-rnn
Classify Kaggle San Francisco Crime Description into 39 classes. Build the... |
|
Emerging |
| 186 |
amzn/pecos
PECOS - Prediction for Enormous and Correlated Spaces |
|
Emerging |
| 187 |
benedekrozemberczki/graph2vec
A parallel implementation of "graph2vec: Learning Distributed... |
|
Emerging |
| 188 |
shibing624/TreeSearch
TreeSearch: Structure-aware document retrieval without embeddings.... |
|
Emerging |
| 189 |
ProviderProtocol/ai
0-DEP AI DX SDK |
|
Emerging |
| 190 |
freelawproject/inception
Our microservice for generating embeddings from blocks of text |
|
Emerging |
| 191 |
BernhoferM/TMbed
Transmembrane proteins predicted through Language Model embeddings |
|
Emerging |
| 192 |
ina-foss/twembeddings
Sentence embeddings for unsupervised event detection in the Twitter stream:... |
|
Emerging |
| 193 |
n24q02m/qwen3-embed
Lightweight ONNX inference for Qwen3 embedding and reranking models |
|
Emerging |
| 194 |
poloclub/wizmap
Explore and interpret large embeddings in your browser with interactive... |
|
Emerging |
| 195 |
kaushalshetty/Structured-Self-Attention
A Structured Self-attentive Sentence Embedding |
|
Emerging |
| 196 |
realityinspector/waivelets-v0.1
Wavelet-derived structural fingerprints for text. MiniLM embeddings →... |
|
Emerging |
| 197 |
vectara/react-search
UI widget for adding semantic search to your React UI in just a few lines of code |
|
Emerging |
| 198 |
mims-harvard/ClinVec
ClinVec: Unified Embeddings of Clinical Codes Enable Knowledge-Grounded AI... |
|
Emerging |
| 199 |
towhee-io/examples
Analyze the unstructured data with Towhee, such as reverse image search,... |
|
Emerging |
| 200 |
M9nx/CodexA
Codexa is a local semantic code intelligence CLI designed to help AI... |
|
Emerging |