All Embedding Tools

4,013 tools ranked by quality score · Page 3 of 41

Showing 201–300 of 4,013
# Tool Score Tier
201 Muvon/octolib

The lib to power AI tools.

43
Emerging
202 build-on-aws/langchain-embeddings

This repository demonstrates the construction of a state-of-the-art...

42
Emerging
203 dalinvip/cw2vec

cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information

42
Emerging
204 dabit3/semantic-search-nextjs-pinecone-langchain-chatgpt

Embeds text files into vectors, stores them on Pinecone, and enables...

42
Emerging
205 JGalego/RAGmap

A simple Streamlit application to visualize document chunks and queries in...

42
Emerging
206 yuniko-software/bge-m3-onnx

ONNX implementation of the BGE-M3 multilingual embedding model and tokenizer...

42
Emerging
207 claws-lab/jodie

A PyTorch implementation of ACM SIGKDD 2019 paper "Predicting Dynamic...

42
Emerging
208 GlobalMaksimum/sadedegel

A General Purpose NLP library for Turkish

42
Emerging
209 GregorBiswanger/SemanticChunker.NET

Embedding-driven, context-aware text chunking for Semantic Kernel and RAG...

42
Emerging
210 GKalliatakis/Keras-VGG16-places365

Keras code and weights files for the VGG16-places365 and VGG16-hybrid1365...

42
Emerging
211 maxent-ai/ocrpy

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

42
Emerging
212 cbaziotis/datastories-semeval2017-task4

Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep...

42
Emerging
213 curiosity-ai/hnsw-sharp

C# library for approximate nearest neighbors search using Hierarchical...

42
Emerging
214 eBay/KPRN

Reasoning Over Knowledge Graph Paths for Recommendation

42
Emerging
215 karadHub/jenkins-ai-optimizer

Meet the beast of Jenkins MCP with AI-powered diagnostics, and more with...

42
Emerging
216 gnes-ai/gnes

GNES is Generic Neural Elastic Search, a cloud-native semantic search system...

42
Emerging
217 gustavz/DataChad

Ask questions about any data source by leveraging langchains

42
Emerging
218 yatendra2001/ai_buddy

Your personal free-to-use AI assistant, built with gemini & flutter.

42
Emerging
219 shehzaadzd/MINERVA

Meandering In Networks of Entities to Reach Verisimilar Answers

42
Emerging
220 NYUMedML/DeepEHR

Chronic Disease Prediction Using Medical Notes

42
Emerging
221 sashakolpakov/graphem-rapids

Graph embedding for influence maximization in networks

42
Emerging
222 pwrdrvr/ghcrawl

Terminal UI and local CLI for crawling GitHub issues and pull requests,...

42
Emerging
223 WolframResearch/wolfram-notebook-embedder

JavaScript embedder for Wolfram Cloud notebooks

41
Emerging
224 Hyper3Labs/clawdrive

Google Drive for AI agents. Store any file and search by meaning across modalities.

41
Emerging
225 mariotoffia/goannoy

go native port of annoy. Approximate Nearest Neighbors in optimized for...

41
Emerging
226 jaanli/food2vec

:hamburger:

41
Emerging
227 dmotz/emdash

๐Ÿ“š๐Ÿง™โ€โ™‚๏ธ Wisdom indexer โ€” use AI to organize text snippets so you can actually...

41
Emerging
228 Ubaida-M-Yusuf/Makimus-AI

AI-powered media search โ€” find images and videos using natural language or...

41
Emerging
229 criteo-research/CausE

Code for the Recsys 2018 paper entitled Causal Embeddings for Recommandation.

41
Emerging
230 ecnu-sea/SEA

[EMNLP 2024 Findings] SEA is an automated paper review framework capable of...

41
Emerging
231 DiceTechJobs/ConceptualSearch

Train a Word2Vec model or LSA model, and Implement Conceptual...

41
Emerging
232 OpenConceptLab/oclmap

OCL Mapper (beta): an open-source AI-supported terminology mapping solution...

41
Emerging
233 mims-harvard/graphml-tutorials

Tutorials for Machine Learning on Graphs

41
Emerging
234 cybergis/rs-embed

A basic repo for integrating Remote Sensing Foundation Models

41
Emerging
235 pnpnpn/dna2vec

dna2vec: Consistent vector representations of variable-length k-mers

41
Emerging
236 filippostanghellini/DocFinder

DocFinder is a local-first indexing and searching documents using semantic...

41
Emerging
237 dawenl/cofactor

CoFactor: Regularizing Matrix Factorization with Item Co-occurrence

41
Emerging
238 mims-harvard/Raindrop

Graph Neural Networks for Irregular Time Series

41
Emerging
239 linkml/linkml-map

Mapping between LinkML schemas

41
Emerging
240 robert-mcdermott/embeddings_plot

A command line utility to create a plots of word embeddings

41
Emerging
241 spcl/ncc

Neural Code Comprehension: A Learnable Representation of Code Semantics

41
Emerging
242 AshokHub/locBLAST

Local NCBI BLAST+ Search

41
Emerging
243 autonomio/signs

A suite of tools for text preparation, vectorization and processing for deep...

41
Emerging
244 MaartenGr/VLAC

Vectors of Locally Aggregated Concepts

41
Emerging
245 Bryptobricks/Structured-Memory-Engine

Persistent, self-maintaining memory for AI agents. 990 tests. <1ms recall....

41
Emerging
246 devmount/GermanWordEmbeddings

Toolkit to obtain and preprocess German text corpora, train models and...

41
Emerging
247 starkbaknet/project-vectorizer

A CLI tool that vectorizes codebases, stores them in a database, tracks...

41
Emerging
248 Collection-Space-Navigator/CSN

Interactive Visualization Interface for Multidimensional Datasets

41
Emerging
249 noi-techpark/stuart-chatbot

Stuart is simple RAG System, that the Open Data Hub uses as a chatbot to...

41
Emerging
250 vectorlessflow/vectorless

Vectorless is a hierarchical, reasoning-native document intelligence engine....

41
Emerging
251 vintasoftware/entity-embed

PyTorch library for transforming entities like companies, products, etc....

41
Emerging
252 Hironsan/awesome-embedding-models

A curated list of awesome embedding models tutorials, projects and communities.

41
Emerging
253 LLukas22/tei-client

Convenience Client for Hugging Face Text Embeddings Inference (TEI) with...

41
Emerging
254 minimaxir/imgbeddings

Python package to generate image embeddings with CLIP without PyTorch/TensorFlow

41
Emerging
255 auyelbekov/rawq

Context retrieval engine for AI agents โ€” semantic + lexical search over codebases

41
Emerging
256 finalfusion/finalfusion-rust

finalfusion embeddings in Rust

40
Emerging
257 IBM/watsonx-ai-java-sdk

The watsonx.ai Java SDK is an open-source library that simplifies the...

40
Emerging
258 revokslab/codecrawl

๐ŸŒŠ Turn entire codebases into LLM-ready data. Extract data, search, and...

40
Emerging
259 Snehil-Shah/Multimodal-Image-Search-Engine

Text to Image & Reverse Image Search Engine built upon Vector Similarity...

40
Emerging
260 MinishLab/tokenlearn

Pre-train Static Word Embeddings

40
Emerging
261 ShunsukeHayashi/context-and-impact

Unified Context-to-Execution pipeline: Obsidian semantic search + GitNexus...

40
Emerging
262 youneslaaroussi/CloudWatchman

Autonomous AI agent for AWS CloudWatch log monitoring.

40
Emerging
263 supabase/headless-vector-search

Supabase Toolkit to perform vector similarity search on your knowledge base...

40
Emerging
264 iamaziz/ar-embeddings

Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic)...

40
Emerging
265 DeepGraphLearning/graphvite

GraphVite: A General and High-performance Graph Embedding System

40
Emerging
266 Zefan-Cai/R-KV

[Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

40
Emerging
267 xsfa/pointstorm

Real-time embeddings for data on the move

40
Emerging
268 vcal-project/vcal-core

VCAL Core โ€” high-performance semantic cache and vector cache library for LLM...

40
Emerging
269 abojchevski/graph2gauss

Gaussian node embeddings. Implementation of "Deep Gaussian Embedding of...

40
Emerging
270 mims-harvard/SHEPHERD

SHEPHERD: Few shot learning for phenotype-driven diagnosis of patients with...

40
Emerging
271 eugeneyan/semantic-ids-llm

Semantic IDs: How to train an LLM-Recommender Hybrid with steerability and...

40
Emerging
272 jwizenfeld04/Echo-Guard

Semantic linting CLI that detects codebase redundancy created by AI coding agents.

40
Emerging
273 biocentral/biocentral_server

Compute functionality for biocentral.

40
Emerging
274 BruinGrowly/Semantic-Compressor

DNA-inspired semantic compression for AI reasoning at scale. Compress...

40
Emerging
275 mims-harvard/scikit-fusion

scikit-fusion: Data fusion via collective latent factor models

40
Emerging
276 solygambas/python-openai-projects

13 projects using ChatGPT API, Whisper, Embeddings, and DALL-E with Python.

40
Emerging
277 aws-samples/sample-extreme-text-classifier

A Python text classifier for large-scale multi-class classification using...

40
Emerging
278 FullStackWithLawrence/openai-embeddings

OpenAI chatGPT hybrid search and retrieval augmented generation

40
Emerging
279 curiosity-ai/umap-sharp

C# library for fast embeddings projection using Uniform Manifold...

40
Emerging
280 userFRM/rpg-encoder

Repository Planning Graph โ€” semantic code understanding via MCP (arXiv:2602.02084)

40
Emerging
281 houtini-ai/fanout-mcp

A Query fan-out analyser for AI Search

40
Emerging
282 D2KLab/entity2rec

entity2rec generates item recommendation using property-specific knowledge...

40
Emerging
283 s-emanuilov/litepali

LitePali is a minimal, efficient implementation of ColPali for image...

40
Emerging
284 Agrover112/awesome-semantic-search

A curated list of awesome resources related to Semantic Search๐Ÿ”Ž and...

39
Emerging
285 alisonbma/aiSFX

Representation Learning for the Automatic Indexing of Sound Effects...

39
Emerging
286 AliOsm/simplerepresentations

Easy-to-use text representations extraction library based on the...

39
Emerging
287 matzalazar/rhizome

Local-first semantic backlinks for Obsidian and Logseq โ€” embeds your notes...

39
Emerging
288 yuvrajangadsingh/vemb

httpie for embeddings. Embed text, images, audio, video, and PDFs from the...

39
Emerging
289 balajivis/sutra-mas

36,299 multi-agent systems papers collected, 17,969 analyzed with...

39
Emerging
290 aws-samples/news-clustering-and-summarization

This repository contains code for a near real-time news clustering and...

39
Emerging
291 decodingai-magazine/tabular-semantic-search-tutorial

๐Ÿ“š Tutorial on building a modern search app for Amazon e-commerce products...

39
Emerging
292 arabicapp/everything-claude-code

๐Ÿš€ Build powerful agents and configurations with the complete collection of...

39
Emerging
293 UniverseTBD/platonic-universe

Do foundation models see the same sky? ๐Ÿ”ฎ

39
Emerging
294 sede-open/Fleming

Fleming repo to run semantic search models on databricks on CPU.

39
Emerging
295 mims-harvard/SubGNN

Subgraph Neural Networks (NeurIPS 2020)

39
Emerging
296 Addepto/graph_builder

Open-source toolkit to extract structured knowledge graphs from documents...

39
Emerging
297 IronAdamant/stele-context

Local context cache for LLM agents. 100% offline, zero dependencies.

39
Emerging
298 PkuRainBow/HDC.caffe

Complete Code for "Hard-Aware-Deeply-Cascaded-Embedding"

39
Emerging
299 zrg-team/memorall

Local-first AI extension that turns what you read into a searchable...

39
Emerging
300 code-kern-ai/embedders

With embedders, you can easily convert your texts into sentence- or...

39
Emerging
« Prev 1 2 3 4 5 39 40 41 Next »