All Embedding Tools

4,013 tools ranked by quality score · Page 2 of 41

Showing 101–200 of 4,013
# Tool Score Tier
101 microsoft/kernel-memory

Research project. A Memory solution for users, teams, and applications.

49
Emerging
102 IITH-Compilers/IR2Vec

Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings

49
Emerging
103 derrickburns/generalized-kmeans-clustering

Production-ready K-Means clustering for Apache Spark with pluggable Bregman...

49
Emerging
104 infinilabs/coco-server

🥥 Coco AI Server - Search, Connect, Collaborate, AI-powered Enterprise...

49
Emerging
105 caspianmoon/memoripy

An AI memory layer with short- and long-term storage, semantic clustering,...

49
Emerging
106 IntuitionEngineeringTeam/chars2vec

Character-based word embeddings model based on RNN for handling real world texts

49
Emerging
107 stephantul/reach

Load embeddings and featurize your sentences.

49
Emerging
108 probelabs/probe

AI-friendly semantic code search engine for large codebases. Combines...

49
Emerging
109 pdrm83/sent2vec

How to encode sentences in a high-dimensional vector space, a.k.a., sentence...

49
Emerging
110 MaartenGr/PolyFuzz

Fuzzy string matching, grouping, and evaluation.

49
Emerging
111 insideout10/wordlift-plugin

WordLift brings the power of Artificial Intelligence to beautifully organize...

49
Emerging
112 snap-stanford/stark

(NeurIPS D&B 2024) STaRK: Benchmarking LLM Retrieval on Textual and...

49
Emerging
113 EU-ECDC/episomer

ECDC Early warning tool using social media data.

48
Emerging
114 wagtail/wagtail-vector-index

Store Wagtail pages & Django models as embeddings in vector databases

48
Emerging
115 pingcap/pytidb

TiDB AI SDK: Unified Multi-Modal Data Platform for AI Apps & Agents -...

48
Emerging
116 olaflaitinen/citysense

CitySense is an open-source Python library that bridges geospatial urban...

48
Emerging
117 KonstantinosPetrakis/esco-skill-extractor

Extract ESCO skills and ISCO occupations from texts such as job descriptions or CVs

48
Emerging
118 AnswerDotAI/ModernBERT

Bringing BERT into modernity via both architecture changes and scaling

48
Emerging
119 sashakolpakov/dire-jax

DImensionality REduction in JAX

48
Emerging
120 superduper-io/superduper

Superduper: End-to-end framework for building custom AI applications and agents.

48
Emerging
121 ddickmann/vllm-factory

Production inference for encoder models - ColBERT, GLiNER, ColPali,...

48
Emerging
122 sbhjt-gr/InferrLM

On-device AI for iOS & Android

48
Emerging
123 snap-research/GRID

GRID: Generative Recommendation with Semantic IDs

48
Emerging
124 nomic-ai/nomic

Nomic Developer API SDK

47
Emerging
125 cerul-ai/cerul

Real-time video search engine for AI agents. Search by meaning across visual...

47
Emerging
126 Santosh-Gupta/SpeedTorch

Library for faster pinned CPU <-> GPU transfer in Pytorch

47
Emerging
127 AKSW/sante

The Ontology, Dataset and Knowledge Search Engine

47
Emerging
128 Azure-Samples/azure-ai-document-processing-samples

A collection of samples demonstrating techniques for processing documents...

47
Emerging
129 gweidart/rs-bpe

A ridiculously fast Python BPE (Byte Pair Encoder) implementation written in Rust

47
Emerging
130 kelindar/search

Go library for embedded vector search and semantic embeddings using llama.cpp

47
Emerging
131 pinecone-io/pinecone-datasets

An open-source dataset library for pre-embedded dataset: create your own...

47
Emerging
132 Hyper3Labs/HyperView

HyperView curates datasets and provides model introspection in hyperbolic...

47
Emerging
133 abhilash1910/ClusterTransformer

Topic clustering library built on Transformer embeddings and cosine...

47
Emerging
134 vinid/cade

Compass-aligned Distributional Embeddings. Align embeddings from different corpora

47
Emerging
135 embeddings-benchmark/results

Data for the MTEB leaderboard

47
Emerging
136 alexshtf/torchcurves

Parametric differentiable curves with PyTorch for continuous embeddings,...

47
Emerging
137 amirivojdan/shekar

Simplifying Persian NLP for Modern Applications

47
Emerging
138 Terronex-dev/aifbin-pro

AIF-BIN Pro — Professional AI Memory Management with Semantic Search

47
Emerging
139 alexklibisz/elastiknn

Elasticsearch plugin for nearest neighbor search. Store vectors and run...

47
Emerging
140 ferencberes/online-node2vec

Node Embeddings in Dynamic Graphs

47
Emerging
141 similigh/simili-bot

AI-powered GitHub issue intelligence - semantic duplicate detection,...

47
Emerging
142 yusufhilmi/client-vector-search

A client side vector search library that can embed, store, search, and cache...

47
Emerging
143 twang2218/vocab-coverage

语言模型中文认知能力分析

47
Emerging
144 estebanpdl/osintgpt

An open-source intelligence (OSINT) analysis tool leveraging GPT-powered...

46
Emerging
145 amansrivastava17/embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from various...

46
Emerging
146 eugeneyan/ml-surveys

📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs,...

46
Emerging
147 choihyunsus/n2-mimir

AI Experience Learning Engine — AI agents remember, but don't learn. Mimir...

46
Emerging
148 LongxingTan/open-retrievals

All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers

46
Emerging
149 IlyasMoutawwakil/py-txi

A Python wrapper around HuggingFace's TGI (text-generation-inference) and...

46
Emerging
150 vezlo/assistant-server

AI Assistant Server

46
Emerging
151 langformers/langformers

🚀 Unified NLP Pipelines for Language Models

46
Emerging
152 voyage-ai/voyageai-python

Voyage AI Official Python Library

46
Emerging
153 lucidrains/discrete-continuous-embed-readout

Embedding and readout for simple multi-categorical and gaussian continuous

46
Emerging
154 raphaelsty/neural-cherche

Neural Search

46
Emerging
155 milvus-io/milvus-model

A library integrating embedding and reranker models from OpenAI,...

46
Emerging
156 Dicklesworthstone/frankensearch

Two-tier hybrid search for Rust: sub-millisecond initial results via...

46
Emerging
157 slicenferqin/universal-memory-mcp

超级个人助理 - 通用、跨项目、100%本地私有的AI记忆系统(基于MCP协议)- Super personal assistant for any...

46
Emerging
158 jkrukowski/swift-embeddings

Run embedding models locally in Swift using MLTensor.

45
Emerging
159 SemBench/SemBench

Benchmarking Semantic Query Processing Engines

45
Emerging
160 debnsuma/fcc-ai-engineering-aws

A Practical Course on Embeddings, RAG, Multimodal Models, and Agents with...

45
Emerging
161 baidubce/bce-qianfan-sdk

Provide best practices for LMOps, as well as elegant and convenient access...

45
Emerging
162 openviglet/turing

:sparkles: :dna: Turing ES - Enterprise Search, Semantic Navigation, Chatbot...

45
Emerging
163 deepset-ai/haystack-demos

Fully working applications that demonstrate how to use Haystack to implement...

45
Emerging
164 md-experiments/picture_text

Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)

45
Emerging
165 mantisfury/ArkhamMirror

Local-first AI-powered document intelligence platform for investigative journalism

45
Emerging
166 one-bit/oc-mnemoria

Persistent shared memory (hive mind) for OpenCode agents, powered by the...

45
Emerging
167 MilaNLProc/honest

A Python package to compute HONEST, a score to measure hurtful sentence...

45
Emerging
168 jared-goering/ultramemory

Local-first AI memory engine with relational versioning, temporal grounding,...

45
Emerging
169 sebischair/Lbl2Vec

Lbl2Vec learns jointly embedded label, document and word vectors to retrieve...

45
Emerging
170 Terronex-dev/aifbin-lite

AIF-BIN Lite — Free & Open Source CLI for AI Memory Files

45
Emerging
171 datastax/astra-db-java

Java Client for Data API

44
Emerging
172 mims-harvard/nimfa

Nimfa: Nonnegative matrix factorization in Python

44
Emerging
173 jiegzhan/multi-class-text-classification-cnn

Classify Kaggle Consumer Finance Complaints into 11 classes. Build the model...

44
Emerging
174 natasha/navec

Compact high quality word embeddings for Russian language

44
Emerging
175 fresh-stack/freshstack

This repository helps you evaluate your models on the FreshStack benchmark!

44
Emerging
176 Hironsan/bertsearch

Elasticsearch with BERT for advanced document search.

44
Emerging
177 Praful932/Kitabe

Book Recommendation System built for Book Lovers📖. Simply Rate ⭐ some books...

44
Emerging
178 SireJeff/k0ntext

AI Context Engineering - Intelligent context for Claude, Copilot, Cline, and...

44
Emerging
179 mims-harvard/decagon

Graph convolutional neural network for multirelational link prediction

44
Emerging
180 ALucek/QuicKB

Optimize Document Retrieval with Fine-Tuned KnowledgeBases

44
Emerging
181 raphaelsty/cherche

Neural Search

44
Emerging
182 omoindrot/tensorflow-triplet-loss

Implementation of triplet loss in TensorFlow

44
Emerging
183 jina-ai/examples

Jina examples and demos to help you get started

44
Emerging
184 hamelsmu/code_search

Code For Medium Article: "How To Create Natural Language Semantic Search for...

44
Emerging
185 jiegzhan/multi-class-text-classification-cnn-rnn

Classify Kaggle San Francisco Crime Description into 39 classes. Build the...

44
Emerging
186 amzn/pecos

PECOS - Prediction for Enormous and Correlated Spaces

43
Emerging
187 benedekrozemberczki/graph2vec

A parallel implementation of "graph2vec: Learning Distributed...

43
Emerging
188 shibing624/TreeSearch

TreeSearch: Structure-aware document retrieval without embeddings....

43
Emerging
189 ProviderProtocol/ai

0-DEP AI DX SDK

43
Emerging
190 freelawproject/inception

Our microservice for generating embeddings from blocks of text

43
Emerging
191 BernhoferM/TMbed

Transmembrane proteins predicted through Language Model embeddings

43
Emerging
192 ina-foss/twembeddings

Sentence embeddings for unsupervised event detection in the Twitter stream:...

43
Emerging
193 n24q02m/qwen3-embed

Lightweight ONNX inference for Qwen3 embedding and reranking models

43
Emerging
194 poloclub/wizmap

Explore and interpret large embeddings in your browser with interactive...

43
Emerging
195 kaushalshetty/Structured-Self-Attention

A Structured Self-attentive Sentence Embedding

43
Emerging
196 realityinspector/waivelets-v0.1

Wavelet-derived structural fingerprints for text. MiniLM embeddings →...

43
Emerging
197 vectara/react-search

UI widget for adding semantic search to your React UI in just a few lines of code

43
Emerging
198 mims-harvard/ClinVec

ClinVec: Unified Embeddings of Clinical Codes Enable Knowledge-Grounded AI...

43
Emerging
199 towhee-io/examples

Analyze the unstructured data with Towhee, such as reverse image search,...

43
Emerging
200 M9nx/CodexA

Codexa is a local semantic code intelligence CLI designed to help AI...

43
Emerging