All Embedding Tools
4,013 tools ranked by quality score · Page 3 of 41
| # | Tool | Score | Tier |
|---|---|---|---|
| 201 |
Muvon/octolib
The lib to power AI tools. |
|
Emerging |
| 202 |
build-on-aws/langchain-embeddings
This repository demonstrates the construction of a state-of-the-art... |
|
Emerging |
| 203 |
dalinvip/cw2vec
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information |
|
Emerging |
| 204 |
dabit3/semantic-search-nextjs-pinecone-langchain-chatgpt
Embeds text files into vectors, stores them on Pinecone, and enables... |
|
Emerging |
| 205 |
JGalego/RAGmap
A simple Streamlit application to visualize document chunks and queries in... |
|
Emerging |
| 206 |
yuniko-software/bge-m3-onnx
ONNX implementation of the BGE-M3 multilingual embedding model and tokenizer... |
|
Emerging |
| 207 |
claws-lab/jodie
A PyTorch implementation of ACM SIGKDD 2019 paper "Predicting Dynamic... |
|
Emerging |
| 208 |
GlobalMaksimum/sadedegel
A General Purpose NLP library for Turkish |
|
Emerging |
| 209 |
GregorBiswanger/SemanticChunker.NET
Embedding-driven, context-aware text chunking for Semantic Kernel and RAG... |
|
Emerging |
| 210 |
GKalliatakis/Keras-VGG16-places365
Keras code and weights files for the VGG16-places365 and VGG16-hybrid1365... |
|
Emerging |
| 211 |
maxent-ai/ocrpy
OCR, Archive, Index and Search: Implementation agnostic OCR framework. |
|
Emerging |
| 212 |
cbaziotis/datastories-semeval2017-task4
Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep... |
|
Emerging |
| 213 |
curiosity-ai/hnsw-sharp
C# library for approximate nearest neighbors search using Hierarchical... |
|
Emerging |
| 214 |
eBay/KPRN
Reasoning Over Knowledge Graph Paths for Recommendation |
|
Emerging |
| 215 |
karadHub/jenkins-ai-optimizer
Meet the beast of Jenkins MCP with AI-powered diagnostics, and more with... |
|
Emerging |
| 216 |
gnes-ai/gnes
GNES is Generic Neural Elastic Search, a cloud-native semantic search system... |
|
Emerging |
| 217 |
gustavz/DataChad
Ask questions about any data source by leveraging langchains |
|
Emerging |
| 218 |
yatendra2001/ai_buddy
Your personal free-to-use AI assistant, built with gemini & flutter. |
|
Emerging |
| 219 |
shehzaadzd/MINERVA
Meandering In Networks of Entities to Reach Verisimilar Answers |
|
Emerging |
| 220 |
NYUMedML/DeepEHR
Chronic Disease Prediction Using Medical Notes |
|
Emerging |
| 221 |
sashakolpakov/graphem-rapids
Graph embedding for influence maximization in networks |
|
Emerging |
| 222 |
pwrdrvr/ghcrawl
Terminal UI and local CLI for crawling GitHub issues and pull requests,... |
|
Emerging |
| 223 |
WolframResearch/wolfram-notebook-embedder
JavaScript embedder for Wolfram Cloud notebooks |
|
Emerging |
| 224 |
Hyper3Labs/clawdrive
Google Drive for AI agents. Store any file and search by meaning across modalities. |
|
Emerging |
| 225 |
mariotoffia/goannoy
go native port of annoy. Approximate Nearest Neighbors in optimized for... |
|
Emerging |
| 226 |
jaanli/food2vec
:hamburger: |
|
Emerging |
| 227 |
dmotz/emdash
๐๐งโโ๏ธ Wisdom indexer โ use AI to organize text snippets so you can actually... |
|
Emerging |
| 228 |
Ubaida-M-Yusuf/Makimus-AI
AI-powered media search โ find images and videos using natural language or... |
|
Emerging |
| 229 |
criteo-research/CausE
Code for the Recsys 2018 paper entitled Causal Embeddings for Recommandation. |
|
Emerging |
| 230 |
ecnu-sea/SEA
[EMNLP 2024 Findings] SEA is an automated paper review framework capable of... |
|
Emerging |
| 231 |
DiceTechJobs/ConceptualSearch
Train a Word2Vec model or LSA model, and Implement Conceptual... |
|
Emerging |
| 232 |
OpenConceptLab/oclmap
OCL Mapper (beta): an open-source AI-supported terminology mapping solution... |
|
Emerging |
| 233 |
mims-harvard/graphml-tutorials
Tutorials for Machine Learning on Graphs |
|
Emerging |
| 234 |
cybergis/rs-embed
A basic repo for integrating Remote Sensing Foundation Models |
|
Emerging |
| 235 |
pnpnpn/dna2vec
dna2vec: Consistent vector representations of variable-length k-mers |
|
Emerging |
| 236 |
filippostanghellini/DocFinder
DocFinder is a local-first indexing and searching documents using semantic... |
|
Emerging |
| 237 |
dawenl/cofactor
CoFactor: Regularizing Matrix Factorization with Item Co-occurrence |
|
Emerging |
| 238 |
mims-harvard/Raindrop
Graph Neural Networks for Irregular Time Series |
|
Emerging |
| 239 |
linkml/linkml-map
Mapping between LinkML schemas |
|
Emerging |
| 240 |
robert-mcdermott/embeddings_plot
A command line utility to create a plots of word embeddings |
|
Emerging |
| 241 |
spcl/ncc
Neural Code Comprehension: A Learnable Representation of Code Semantics |
|
Emerging |
| 242 |
AshokHub/locBLAST
Local NCBI BLAST+ Search |
|
Emerging |
| 243 |
autonomio/signs
A suite of tools for text preparation, vectorization and processing for deep... |
|
Emerging |
| 244 |
MaartenGr/VLAC
Vectors of Locally Aggregated Concepts |
|
Emerging |
| 245 |
Bryptobricks/Structured-Memory-Engine
Persistent, self-maintaining memory for AI agents. 990 tests. <1ms recall.... |
|
Emerging |
| 246 |
devmount/GermanWordEmbeddings
Toolkit to obtain and preprocess German text corpora, train models and... |
|
Emerging |
| 247 |
starkbaknet/project-vectorizer
A CLI tool that vectorizes codebases, stores them in a database, tracks... |
|
Emerging |
| 248 |
Collection-Space-Navigator/CSN
Interactive Visualization Interface for Multidimensional Datasets |
|
Emerging |
| 249 |
noi-techpark/stuart-chatbot
Stuart is simple RAG System, that the Open Data Hub uses as a chatbot to... |
|
Emerging |
| 250 |
vectorlessflow/vectorless
Vectorless is a hierarchical, reasoning-native document intelligence engine.... |
|
Emerging |
| 251 |
vintasoftware/entity-embed
PyTorch library for transforming entities like companies, products, etc.... |
|
Emerging |
| 252 |
Hironsan/awesome-embedding-models
A curated list of awesome embedding models tutorials, projects and communities. |
|
Emerging |
| 253 |
LLukas22/tei-client
Convenience Client for Hugging Face Text Embeddings Inference (TEI) with... |
|
Emerging |
| 254 |
minimaxir/imgbeddings
Python package to generate image embeddings with CLIP without PyTorch/TensorFlow |
|
Emerging |
| 255 |
auyelbekov/rawq
Context retrieval engine for AI agents โ semantic + lexical search over codebases |
|
Emerging |
| 256 |
finalfusion/finalfusion-rust
finalfusion embeddings in Rust |
|
Emerging |
| 257 |
IBM/watsonx-ai-java-sdk
The watsonx.ai Java SDK is an open-source library that simplifies the... |
|
Emerging |
| 258 |
revokslab/codecrawl
๐ Turn entire codebases into LLM-ready data. Extract data, search, and... |
|
Emerging |
| 259 |
Snehil-Shah/Multimodal-Image-Search-Engine
Text to Image & Reverse Image Search Engine built upon Vector Similarity... |
|
Emerging |
| 260 |
MinishLab/tokenlearn
Pre-train Static Word Embeddings |
|
Emerging |
| 261 |
ShunsukeHayashi/context-and-impact
Unified Context-to-Execution pipeline: Obsidian semantic search + GitNexus... |
|
Emerging |
| 262 |
youneslaaroussi/CloudWatchman
Autonomous AI agent for AWS CloudWatch log monitoring. |
|
Emerging |
| 263 |
supabase/headless-vector-search
Supabase Toolkit to perform vector similarity search on your knowledge base... |
|
Emerging |
| 264 |
iamaziz/ar-embeddings
Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic)... |
|
Emerging |
| 265 |
DeepGraphLearning/graphvite
GraphVite: A General and High-performance Graph Embedding System |
|
Emerging |
| 266 |
Zefan-Cai/R-KV
[Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models |
|
Emerging |
| 267 |
xsfa/pointstorm
Real-time embeddings for data on the move |
|
Emerging |
| 268 |
vcal-project/vcal-core
VCAL Core โ high-performance semantic cache and vector cache library for LLM... |
|
Emerging |
| 269 |
abojchevski/graph2gauss
Gaussian node embeddings. Implementation of "Deep Gaussian Embedding of... |
|
Emerging |
| 270 |
mims-harvard/SHEPHERD
SHEPHERD: Few shot learning for phenotype-driven diagnosis of patients with... |
|
Emerging |
| 271 |
eugeneyan/semantic-ids-llm
Semantic IDs: How to train an LLM-Recommender Hybrid with steerability and... |
|
Emerging |
| 272 |
jwizenfeld04/Echo-Guard
Semantic linting CLI that detects codebase redundancy created by AI coding agents. |
|
Emerging |
| 273 |
biocentral/biocentral_server
Compute functionality for biocentral. |
|
Emerging |
| 274 |
BruinGrowly/Semantic-Compressor
DNA-inspired semantic compression for AI reasoning at scale. Compress... |
|
Emerging |
| 275 |
mims-harvard/scikit-fusion
scikit-fusion: Data fusion via collective latent factor models |
|
Emerging |
| 276 |
solygambas/python-openai-projects
13 projects using ChatGPT API, Whisper, Embeddings, and DALL-E with Python. |
|
Emerging |
| 277 |
aws-samples/sample-extreme-text-classifier
A Python text classifier for large-scale multi-class classification using... |
|
Emerging |
| 278 |
FullStackWithLawrence/openai-embeddings
OpenAI chatGPT hybrid search and retrieval augmented generation |
|
Emerging |
| 279 |
curiosity-ai/umap-sharp
C# library for fast embeddings projection using Uniform Manifold... |
|
Emerging |
| 280 |
userFRM/rpg-encoder
Repository Planning Graph โ semantic code understanding via MCP (arXiv:2602.02084) |
|
Emerging |
| 281 |
houtini-ai/fanout-mcp
A Query fan-out analyser for AI Search |
|
Emerging |
| 282 |
D2KLab/entity2rec
entity2rec generates item recommendation using property-specific knowledge... |
|
Emerging |
| 283 |
s-emanuilov/litepali
LitePali is a minimal, efficient implementation of ColPali for image... |
|
Emerging |
| 284 |
Agrover112/awesome-semantic-search
A curated list of awesome resources related to Semantic Search๐ and... |
|
Emerging |
| 285 |
alisonbma/aiSFX
Representation Learning for the Automatic Indexing of Sound Effects... |
|
Emerging |
| 286 |
AliOsm/simplerepresentations
Easy-to-use text representations extraction library based on the... |
|
Emerging |
| 287 |
matzalazar/rhizome
Local-first semantic backlinks for Obsidian and Logseq โ embeds your notes... |
|
Emerging |
| 288 |
yuvrajangadsingh/vemb
httpie for embeddings. Embed text, images, audio, video, and PDFs from the... |
|
Emerging |
| 289 |
balajivis/sutra-mas
36,299 multi-agent systems papers collected, 17,969 analyzed with... |
|
Emerging |
| 290 |
aws-samples/news-clustering-and-summarization
This repository contains code for a near real-time news clustering and... |
|
Emerging |
| 291 |
decodingai-magazine/tabular-semantic-search-tutorial
๐ Tutorial on building a modern search app for Amazon e-commerce products... |
|
Emerging |
| 292 |
arabicapp/everything-claude-code
๐ Build powerful agents and configurations with the complete collection of... |
|
Emerging |
| 293 |
UniverseTBD/platonic-universe
Do foundation models see the same sky? ๐ฎ |
|
Emerging |
| 294 |
sede-open/Fleming
Fleming repo to run semantic search models on databricks on CPU. |
|
Emerging |
| 295 |
mims-harvard/SubGNN
Subgraph Neural Networks (NeurIPS 2020) |
|
Emerging |
| 296 |
Addepto/graph_builder
Open-source toolkit to extract structured knowledge graphs from documents... |
|
Emerging |
| 297 |
IronAdamant/stele-context
Local context cache for LLM agents. 100% offline, zero dependencies. |
|
Emerging |
| 298 |
PkuRainBow/HDC.caffe
Complete Code for "Hard-Aware-Deeply-Cascaded-Embedding" |
|
Emerging |
| 299 |
zrg-team/memorall
Local-first AI extension that turns what you read into a searchable... |
|
Emerging |
| 300 |
code-kern-ai/embedders
With embedders, you can easily convert your texts into sentence- or... |
|
Emerging |