All Embedding Tools

4,013 tools ranked by quality score · Page 4 of 41

Showing 301–400 of 4,013
# Tool Score Tier
301 hayabhay/frogbase

Transform audio-visual content into navigable knowledge.

39
Emerging
302 aiplanethub/beyondllm

Build, evaluate and observe LLM apps

39
Emerging
303 cofin/mogemma

🔥 Python / Mojo Interface for Google Gemma 3

39
Emerging
304 simonw/llm-embed-jina

Embedding models from Jina AI

39
Emerging
305 nomic-ai/semantic-search-app-template

Tutorial and template for a semantic search app powered by the Atlas...

39
Emerging
306 lgalke/vec4ir

Word Embeddings for Information Retrieval

39
Emerging
307 BryanChasko/kiro-cli-notes

Professional Kiro CLI setup guide based on 10 tutorial videos with proper...

39
Emerging
308 kreuzberg-dev/kreuzberg-surrealdb

Extract, chunk, and embed documents from 88+ formats directly into SurrealDB.

39
Emerging
309 SeanLee97/AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and...

39
Emerging
310 calebevans/cordon

Reduce logs to their semantic anomalies

39
Emerging
311 oborchers/Fast_Sentence_Embeddings

Compute Sentence Embeddings Fast!

39
Emerging
312 tca19/dict2vec

Dict2vec is a framework to learn word embeddings using lexical dictionaries.

39
Emerging
313 dayyass/muse-as-service

REST API for sentence tokenization and embedding using Multilingual...

39
Emerging
314 ssoudan/gcp-vertex-ai-generative-ai

An async client library for GCP Vertex AI Generative models

39
Emerging
315 nlpcloud/nlpcloud-js

NLP Cloud serves high performance pre-trained or custom models for NER,...

39
Emerging
316 maragudk/gai

Go Artificial Intelligence (GAI) helps you work with foundational models,...

39
Emerging
317 gentaiscool/distfuse

A library to calculate similarity scores between two collections of text...

39
Emerging
318 sacdallago/bio_embeddings

Get protein embeddings from protein sequences

39
Emerging
319 geetanjaliapp/geetanjali

RAG-powered ethical decision guidance from Bhagavad Geeta. Analyze dilemmas,...

39
Emerging
320 flipz357/S3BERT

Semantically Structured Sentence Embeddings

39
Emerging
321 DeepChainBio/bio-transformers

bio-transformers is a wrapper on top of the ESM/Protbert model, trained on...

39
Emerging
322 ramanujammv1988/edge-veda

On-device AI SDK for Flutter — LLM inference, vision, STT, TTS, image...

39
Emerging
323 rom1504/image_embeddings

Using efficientnet to provide embeddings for retrieval

39
Emerging
324 mims-harvard/GraphXAI

GraphXAI: Resource to support the development and evaluation of GNN explainers

39
Emerging
325 SylphxAI/coderag

Lightning-fast semantic code search with AST chunking (15+ languages) -...

38
Emerging
326 matte1782/lecture-mind

VL-JEPA Lecture Summarizer - Event-aware lecture summarization using...

38
Emerging
327 Mubelotix/SimRepo

Shows similar repositories in the sidebar

38
Emerging
328 spring-petclinic/spring-petclinic-langchain4j

Spring Petclinic application with a chatbot powered by OpenAI's Generative...

38
Emerging
329 lpalbou/AbstractCore

A unified Python library for interaction with multiple Large Language Model...

38
Emerging
330 bnosac/ruimtehol

R package to Embed All the Things! using StarSpace

38
Emerging
331 haven-jeon/LegalQA

Korean LegalQA using SentenceKoBART

38
Emerging
332 nixiesearch/nixiesearch

Hybrid search engine, combining best features of text and semantic search worlds

38
Emerging
333 IngressTechnology/jimbomesh-holler-server

Open source AI inference server with Model Marketplace, Document RAG, and...

38
Emerging
334 wikipedia2vec/wikipedia2vec

A tool for learning vector representations of words and entities from Wikipedia

38
Emerging
335 setu4993/convert-labse-tf-pt

Convert LaBSE model from TF Hub to PyTorch.

38
Emerging
336 mainlp/semantic_components

Finding semantic components in your neural representations.

38
Emerging
337 babylonhealth/fastText_multilingual

Multilingual word vectors in 78 languages

38
Emerging
338 tomohiro-owada/devrag

Markdown vector search MCP server for Claude Code. Natural language search...

38
Emerging
339 ltgoslo/simple_elmo

Simple library to work with pre-trained ELMo models in TensorFlow

38
Emerging
340 DmitryKey/bert-solr-search

Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU

38
Emerging
341 ireapps/ire-archive-frontend

SvelteKit frontend for archive.ire.org

38
Emerging
342 jeanCarloMachado/PythonSearch

A minimalistic search engine for productivity that stores documents as code

38
Emerging
343 snu-mllab/KVzip

[NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory...

38
Emerging
344 dungle-scrubs/hippo

Persistent memory for AI agents — facts, semantic search, conflict...

38
Emerging
345 etalab-ia/mediatech

Collection of public datasets from the French administration, vectorized and...

38
Emerging
346 drittich/SemanticSlicer

🧠✂️ SemanticSlicer — A smart text chunker for LLM-ready documents.

38
Emerging
347 maxoodf/word2vec

word2vec++ is a Distributed Representations of Words (word2vec) library and...

38
Emerging
348 veekaybee/what_are_embeddings

A deep dive into embeddings starting from fundamentals

38
Emerging
349 finalfusion/finalfusion-python

Finalfusion embeddings in Python

38
Emerging
350 HKUDS/XRec

[EMNLP'2024] "XRec: Large Language Models for Explainable Recommendation"

38
Emerging
351 rodrigo-arenas/Graph-Embeddings

Graph and Nodes embeddings for downstream tasks

38
Emerging
352 Abhinandan-Khurana/GitStarRecall

GitStarRecall is a local-first AI enabled web app that turns your GitHub...

38
Emerging
353 Keerthivasan-Venkitajalam/Recall

Self-learning data agent delivering insights, not SQL. 6 layers of context...

38
Emerging
354 Azure-Samples/azure-sql-db-session-recommender

Build a recommender using OpenAI, Azure Functions, Azure Static Web Apps,...

38
Emerging
355 malllabiisc/cesi

WWW 2018: CESI: Canonicalizing Open Knowledge Bases using Embeddings and...

38
Emerging
356 aravindhsampath/hobbyboard

hobbyboard is a self hosted image search and organization tool for...

38
Emerging
357 smeznar/HVAE

An approach for embedding hierarhical structures into a continuous vector...

38
Emerging
358 ibm-self-serve-assets/Watson-NLP

This collection demonstrates how to help you to quickly embed Watson NLP in...

38
Emerging
359 EveripediaNetwork/fastc

Unattended Lightweight Text Classifiers with LLM Embeddings

38
Emerging
360 Uni-Creator/RAG-MultiFile-QA

A RAG (Retrieval-Augmented Generation) AI chatbot that allows users to...

38
Emerging
361 xlang-ai/instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

38
Emerging
362 QuartzUnit/embgrep

Local semantic search — embedding-powered grep for files, zero external services

38
Emerging
363 navidgh66/omni_proof

Open-source causal-multimodal engine for creative attribution. Answers why a...

38
Emerging
364 chrisfentiman/claude-context-cli

Auto-indexing CLI for claude-context-mcp (npm: claude-context-cli)

38
Emerging
365 bnosac/doc2vec

Distributed Representations of Sentences and Documents

38
Emerging
366 persiyanov/skip-thought-tf

An implementation of skip-thought vectors in Tensorflow

38
Emerging
367 forjd/git-search

Semantic search over git commit history — local embeddings, sqlite-vec, terminal UI

38
Emerging
368 cyberytti/ToolHunt

This is a local search engine to search for cybersecurity tools. It has...

37
Emerging
369 ElieElDebs/Good-Karma

Good Karma is a SaaS that analyze your Reddit's Post and gives you KPI and Advices

37
Emerging
370 colonelwatch/abstracts-search

Semantic search engine indexing 110 million academic publications

37
Emerging
371 karlopintaric/omop-concept-automapper

An automated system for mapping source medical concepts to OMOP standard...

37
Emerging
372 rekal-dev/rekal-cli

Git-anchored decentralised intent(conversation) ledger for teams who build with AI

37
Emerging
373 benoitc/erlang-python

Execute Python from Erlang using dirty NIFs with GIL-aware execution, rate...

37
Emerging
374 plasticityai/magnitude

A fast, efficient universal vector embedding utility package.

37
Emerging
375 jbmiller10/semantik

Semantik is a self-hosted semantic search engine for your documents.

37
Emerging
376 jina-ai/jina-grep-cli

Semantic grep powered by Jina embeddings v5 (MLX on Apple Silicon)

37
Emerging
377 lexy-ai/lexy

Data pipelines for AI applications

37
Emerging
378 pedugnat/dynnode2vec

dynnode2vec is a python package that implements algorithms to embed dynamic graphs

37
Emerging
379 ISDO-TUM/Capstone-2025

AI-Agent powered academic paper discovery engine

37
Emerging
380 gcorso/NeuroSEED

Implementation of Neural Distance Embeddings for Biological Sequences...

37
Emerging
381 model-architectures/GRAPE

[ICLR 2026] GRAPE: Group Representational Position Encoding...

37
Emerging
382 Mihaiii/semantic-autocomplete

A blazing-fast semantic search React component. Match by meaning, not just...

37
Emerging
383 tbepler/prose

Multi-task and masked language model-based protein sequence embedding models.

37
Emerging
384 ddangelov/RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

37
Emerging
385 VincenzoImp/job-search-tool

Automated job search and analysis tool powered by JobSpy. Features...

37
Emerging
386 rodrigobressan/entity_embeddings_categorical

Discover relevant information about categorical data with entity embeddings...

37
Emerging
387 TC407-api/Titan-Memory

Titan-Memory , it is a memory system that is not only for memory, but it...

37
Emerging
388 vgel/repeng

A library for making RepE control vectors

37
Emerging
389 specialprocedures/semnet

Semnet efficiently constructs graph structures from embeddings, enabling...

37
Emerging
390 lh0x00/lightweight-embeddings

LightweightEmbeddings is a fast, free, and unlimited API service for...

37
Emerging
391 fstamatelopoulos/cerefox

Personal knowledge base with hybrid search and read/write access for AI agents

37
Emerging
392 itayzit/openai-async

A light-weight, asynchronous client for OpenAI API - text completion, image...

37
Emerging
393 isaacus-dev/mleb

The code used to evaluate embedding models on the Massive Legal Embedding...

37
Emerging
394 Atik203/Scholar-Flow

ScholarFlow is a SaaS platform designed for researchers, students,...

37
Emerging
395 HKUDS/RLMRec

[WWW'2024] "RLMRec: Representation Learning with Large Language Models for...

37
Emerging
396 bheinzerling/bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

37
Emerging
397 REASY/k8s-ariadne-rs

Query Kubernetes with natural language by compiling English to Cypher. No...

37
Emerging
398 PranavMotarwar/raglineage

Lineage-aware RAG engine for auditable, reproducible, versioned retrieval and answers

37
Emerging
399 sagarmk/beacon-plugin

Semantic code search plugin for Claude Code using hybrid vector search +...

37
Emerging
400 sismetanin/sentiment-analysis-of-tweets-in-russian

Sentiment analysis of tweets in Russian using Convolutional Neural Networks...

37
Emerging
« Prev 1 2 3 4 5 6 39 40 41 Next »