All Embedding Tools
4,013 tools ranked by quality score · Page 4 of 41
| # | Tool | Score | Tier |
|---|---|---|---|
| 301 |
hayabhay/frogbase
Transform audio-visual content into navigable knowledge. |
|
Emerging |
| 302 |
aiplanethub/beyondllm
Build, evaluate and observe LLM apps |
|
Emerging |
| 303 |
cofin/mogemma
🔥 Python / Mojo Interface for Google Gemma 3 |
|
Emerging |
| 304 |
simonw/llm-embed-jina
Embedding models from Jina AI |
|
Emerging |
| 305 |
nomic-ai/semantic-search-app-template
Tutorial and template for a semantic search app powered by the Atlas... |
|
Emerging |
| 306 |
lgalke/vec4ir
Word Embeddings for Information Retrieval |
|
Emerging |
| 307 |
BryanChasko/kiro-cli-notes
Professional Kiro CLI setup guide based on 10 tutorial videos with proper... |
|
Emerging |
| 308 |
kreuzberg-dev/kreuzberg-surrealdb
Extract, chunk, and embed documents from 88+ formats directly into SurrealDB. |
|
Emerging |
| 309 |
SeanLee97/AnglE
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and... |
|
Emerging |
| 310 |
calebevans/cordon
Reduce logs to their semantic anomalies |
|
Emerging |
| 311 |
oborchers/Fast_Sentence_Embeddings
Compute Sentence Embeddings Fast! |
|
Emerging |
| 312 |
tca19/dict2vec
Dict2vec is a framework to learn word embeddings using lexical dictionaries. |
|
Emerging |
| 313 |
dayyass/muse-as-service
REST API for sentence tokenization and embedding using Multilingual... |
|
Emerging |
| 314 |
ssoudan/gcp-vertex-ai-generative-ai
An async client library for GCP Vertex AI Generative models |
|
Emerging |
| 315 |
nlpcloud/nlpcloud-js
NLP Cloud serves high performance pre-trained or custom models for NER,... |
|
Emerging |
| 316 |
maragudk/gai
Go Artificial Intelligence (GAI) helps you work with foundational models,... |
|
Emerging |
| 317 |
gentaiscool/distfuse
A library to calculate similarity scores between two collections of text... |
|
Emerging |
| 318 |
sacdallago/bio_embeddings
Get protein embeddings from protein sequences |
|
Emerging |
| 319 |
geetanjaliapp/geetanjali
RAG-powered ethical decision guidance from Bhagavad Geeta. Analyze dilemmas,... |
|
Emerging |
| 320 |
flipz357/S3BERT
Semantically Structured Sentence Embeddings |
|
Emerging |
| 321 |
DeepChainBio/bio-transformers
bio-transformers is a wrapper on top of the ESM/Protbert model, trained on... |
|
Emerging |
| 322 |
ramanujammv1988/edge-veda
On-device AI SDK for Flutter — LLM inference, vision, STT, TTS, image... |
|
Emerging |
| 323 |
rom1504/image_embeddings
Using efficientnet to provide embeddings for retrieval |
|
Emerging |
| 324 |
mims-harvard/GraphXAI
GraphXAI: Resource to support the development and evaluation of GNN explainers |
|
Emerging |
| 325 |
SylphxAI/coderag
Lightning-fast semantic code search with AST chunking (15+ languages) -... |
|
Emerging |
| 326 |
matte1782/lecture-mind
VL-JEPA Lecture Summarizer - Event-aware lecture summarization using... |
|
Emerging |
| 327 |
Mubelotix/SimRepo
Shows similar repositories in the sidebar |
|
Emerging |
| 328 |
spring-petclinic/spring-petclinic-langchain4j
Spring Petclinic application with a chatbot powered by OpenAI's Generative... |
|
Emerging |
| 329 |
lpalbou/AbstractCore
A unified Python library for interaction with multiple Large Language Model... |
|
Emerging |
| 330 |
bnosac/ruimtehol
R package to Embed All the Things! using StarSpace |
|
Emerging |
| 331 |
haven-jeon/LegalQA
Korean LegalQA using SentenceKoBART |
|
Emerging |
| 332 |
nixiesearch/nixiesearch
Hybrid search engine, combining best features of text and semantic search worlds |
|
Emerging |
| 333 |
IngressTechnology/jimbomesh-holler-server
Open source AI inference server with Model Marketplace, Document RAG, and... |
|
Emerging |
| 334 |
wikipedia2vec/wikipedia2vec
A tool for learning vector representations of words and entities from Wikipedia |
|
Emerging |
| 335 |
setu4993/convert-labse-tf-pt
Convert LaBSE model from TF Hub to PyTorch. |
|
Emerging |
| 336 |
mainlp/semantic_components
Finding semantic components in your neural representations. |
|
Emerging |
| 337 |
babylonhealth/fastText_multilingual
Multilingual word vectors in 78 languages |
|
Emerging |
| 338 |
tomohiro-owada/devrag
Markdown vector search MCP server for Claude Code. Natural language search... |
|
Emerging |
| 339 |
ltgoslo/simple_elmo
Simple library to work with pre-trained ELMo models in TensorFlow |
|
Emerging |
| 340 |
DmitryKey/bert-solr-search
Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU |
|
Emerging |
| 341 |
ireapps/ire-archive-frontend
SvelteKit frontend for archive.ire.org |
|
Emerging |
| 342 |
jeanCarloMachado/PythonSearch
A minimalistic search engine for productivity that stores documents as code |
|
Emerging |
| 343 |
snu-mllab/KVzip
[NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory... |
|
Emerging |
| 344 |
dungle-scrubs/hippo
Persistent memory for AI agents — facts, semantic search, conflict... |
|
Emerging |
| 345 |
etalab-ia/mediatech
Collection of public datasets from the French administration, vectorized and... |
|
Emerging |
| 346 |
drittich/SemanticSlicer
🧠✂️ SemanticSlicer — A smart text chunker for LLM-ready documents. |
|
Emerging |
| 347 |
maxoodf/word2vec
word2vec++ is a Distributed Representations of Words (word2vec) library and... |
|
Emerging |
| 348 |
veekaybee/what_are_embeddings
A deep dive into embeddings starting from fundamentals |
|
Emerging |
| 349 |
finalfusion/finalfusion-python
Finalfusion embeddings in Python |
|
Emerging |
| 350 |
HKUDS/XRec
[EMNLP'2024] "XRec: Large Language Models for Explainable Recommendation" |
|
Emerging |
| 351 |
rodrigo-arenas/Graph-Embeddings
Graph and Nodes embeddings for downstream tasks |
|
Emerging |
| 352 |
Abhinandan-Khurana/GitStarRecall
GitStarRecall is a local-first AI enabled web app that turns your GitHub... |
|
Emerging |
| 353 |
Keerthivasan-Venkitajalam/Recall
Self-learning data agent delivering insights, not SQL. 6 layers of context... |
|
Emerging |
| 354 |
Azure-Samples/azure-sql-db-session-recommender
Build a recommender using OpenAI, Azure Functions, Azure Static Web Apps,... |
|
Emerging |
| 355 |
malllabiisc/cesi
WWW 2018: CESI: Canonicalizing Open Knowledge Bases using Embeddings and... |
|
Emerging |
| 356 |
aravindhsampath/hobbyboard
hobbyboard is a self hosted image search and organization tool for... |
|
Emerging |
| 357 |
smeznar/HVAE
An approach for embedding hierarhical structures into a continuous vector... |
|
Emerging |
| 358 |
ibm-self-serve-assets/Watson-NLP
This collection demonstrates how to help you to quickly embed Watson NLP in... |
|
Emerging |
| 359 |
EveripediaNetwork/fastc
Unattended Lightweight Text Classifiers with LLM Embeddings |
|
Emerging |
| 360 |
Uni-Creator/RAG-MultiFile-QA
A RAG (Retrieval-Augmented Generation) AI chatbot that allows users to... |
|
Emerging |
| 361 |
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings |
|
Emerging |
| 362 |
QuartzUnit/embgrep
Local semantic search — embedding-powered grep for files, zero external services |
|
Emerging |
| 363 |
navidgh66/omni_proof
Open-source causal-multimodal engine for creative attribution. Answers why a... |
|
Emerging |
| 364 |
chrisfentiman/claude-context-cli
Auto-indexing CLI for claude-context-mcp (npm: claude-context-cli) |
|
Emerging |
| 365 |
bnosac/doc2vec
Distributed Representations of Sentences and Documents |
|
Emerging |
| 366 |
persiyanov/skip-thought-tf
An implementation of skip-thought vectors in Tensorflow |
|
Emerging |
| 367 |
forjd/git-search
Semantic search over git commit history — local embeddings, sqlite-vec, terminal UI |
|
Emerging |
| 368 |
cyberytti/ToolHunt
This is a local search engine to search for cybersecurity tools. It has... |
|
Emerging |
| 369 |
ElieElDebs/Good-Karma
Good Karma is a SaaS that analyze your Reddit's Post and gives you KPI and Advices |
|
Emerging |
| 370 |
colonelwatch/abstracts-search
Semantic search engine indexing 110 million academic publications |
|
Emerging |
| 371 |
karlopintaric/omop-concept-automapper
An automated system for mapping source medical concepts to OMOP standard... |
|
Emerging |
| 372 |
rekal-dev/rekal-cli
Git-anchored decentralised intent(conversation) ledger for teams who build with AI |
|
Emerging |
| 373 |
benoitc/erlang-python
Execute Python from Erlang using dirty NIFs with GIL-aware execution, rate... |
|
Emerging |
| 374 |
plasticityai/magnitude
A fast, efficient universal vector embedding utility package. |
|
Emerging |
| 375 |
jbmiller10/semantik
Semantik is a self-hosted semantic search engine for your documents. |
|
Emerging |
| 376 |
jina-ai/jina-grep-cli
Semantic grep powered by Jina embeddings v5 (MLX on Apple Silicon) |
|
Emerging |
| 377 |
lexy-ai/lexy
Data pipelines for AI applications |
|
Emerging |
| 378 |
pedugnat/dynnode2vec
dynnode2vec is a python package that implements algorithms to embed dynamic graphs |
|
Emerging |
| 379 |
ISDO-TUM/Capstone-2025
AI-Agent powered academic paper discovery engine |
|
Emerging |
| 380 |
gcorso/NeuroSEED
Implementation of Neural Distance Embeddings for Biological Sequences... |
|
Emerging |
| 381 |
model-architectures/GRAPE
[ICLR 2026] GRAPE: Group Representational Position Encoding... |
|
Emerging |
| 382 |
Mihaiii/semantic-autocomplete
A blazing-fast semantic search React component. Match by meaning, not just... |
|
Emerging |
| 383 |
tbepler/prose
Multi-task and masked language model-based protein sequence embedding models. |
|
Emerging |
| 384 |
ddangelov/RESTful-Top2Vec
Expose a Top2Vec model with a REST API. |
|
Emerging |
| 385 |
VincenzoImp/job-search-tool
Automated job search and analysis tool powered by JobSpy. Features... |
|
Emerging |
| 386 |
rodrigobressan/entity_embeddings_categorical
Discover relevant information about categorical data with entity embeddings... |
|
Emerging |
| 387 |
TC407-api/Titan-Memory
Titan-Memory , it is a memory system that is not only for memory, but it... |
|
Emerging |
| 388 |
vgel/repeng
A library for making RepE control vectors |
|
Emerging |
| 389 |
specialprocedures/semnet
Semnet efficiently constructs graph structures from embeddings, enabling... |
|
Emerging |
| 390 |
lh0x00/lightweight-embeddings
LightweightEmbeddings is a fast, free, and unlimited API service for... |
|
Emerging |
| 391 |
fstamatelopoulos/cerefox
Personal knowledge base with hybrid search and read/write access for AI agents |
|
Emerging |
| 392 |
itayzit/openai-async
A light-weight, asynchronous client for OpenAI API - text completion, image... |
|
Emerging |
| 393 |
isaacus-dev/mleb
The code used to evaluate embedding models on the Massive Legal Embedding... |
|
Emerging |
| 394 |
Atik203/Scholar-Flow
ScholarFlow is a SaaS platform designed for researchers, students,... |
|
Emerging |
| 395 |
HKUDS/RLMRec
[WWW'2024] "RLMRec: Representation Learning with Large Language Models for... |
|
Emerging |
| 396 |
bheinzerling/bpemb
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE) |
|
Emerging |
| 397 |
REASY/k8s-ariadne-rs
Query Kubernetes with natural language by compiling English to Cypher. No... |
|
Emerging |
| 398 |
PranavMotarwar/raglineage
Lineage-aware RAG engine for auditable, reproducible, versioned retrieval and answers |
|
Emerging |
| 399 |
sagarmk/beacon-plugin
Semantic code search plugin for Claude Code using hybrid vector search +... |
|
Emerging |
| 400 |
sismetanin/sentiment-analysis-of-tweets-in-russian
Sentiment analysis of tweets in Russian using Convolutional Neural Networks... |
|
Emerging |