Self-Hosted Embedding Servers Embedding Tools
Deployable embedding API services that run locally or on your own infrastructure, providing OpenAI-compatible or custom endpoints. Does NOT include embedding models themselves, inference libraries, or managed embedding API providers.
There are 83 self-hosted embedding servers tools tracked. 3 score above 70 (verified tier). The highest-rated is FlagOpen/FlagEmbedding at 79/100 with 11,395 stars. 2 of the top 10 are actively maintained.
Get all 83 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=self-hosted-embedding-servers&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs |
|
Verified |
| 2 |
Blaizzy/mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding... |
|
Verified |
| 3 |
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding |
|
Verified |
| 4 |
Merck/Sapiens
Sapiens is a human antibody language model based on BERT. |
|
Established |
| 5 |
amansrivastava17/embedding-as-service
One-Stop Solution to encode sentence to fixed length vectors from various... |
|
Established |
| 6 |
IlyasMoutawwakil/py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and... |
|
Established |
| 7 |
jkrukowski/swift-embeddings
Run embedding models locally in Swift using MLTensor. |
|
Established |
| 8 |
jina-ai/examples
Jina examples and demos to help you get started |
|
Established |
| 9 |
freelawproject/inception
Our microservice for generating embeddings from blocks of text |
|
Established |
| 10 |
minimaxir/imgbeddings
Python package to generate image embeddings with CLIP without PyTorch/TensorFlow |
|
Emerging |
| 11 |
simonw/llm-embed-jina
Embedding models from Jina AI |
|
Emerging |
| 12 |
dayyass/muse-as-service
REST API for sentence tokenization and embedding using Multilingual... |
|
Emerging |
| 13 |
ddangelov/RESTful-Top2Vec
Expose a Top2Vec model with a REST API. |
|
Emerging |
| 14 |
n24q02m/qwen3-embed
Lightweight ONNX inference for Qwen3 embedding and reranking models |
|
Emerging |
| 15 |
rag-wtf/open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API |
|
Emerging |
| 16 |
josephrmartinez/recipe-dataset
Datasette tutorial. Calculate and query embeddings on 5,000 rows in a sqlite... |
|
Emerging |
| 17 |
LLukas22/tei-client
Convenience Client for Hugging Face Text Embeddings Inference (TEI) with... |
|
Emerging |
| 18 |
ART-Group-it/KERMIT
🐸 KERMIT - A lightweight library to encode and interpret Universal... |
|
Emerging |
| 19 |
yuvrajangadsingh/vemb
httpie for embeddings. Embed text, images, audio, video, and PDFs from the... |
|
Emerging |
| 20 |
jina-ai/jina-grep-cli
Semantic grep powered by Jina embeddings v5 (MLX on Apple Silicon) |
|
Emerging |
| 21 |
lh0x00/lightweight-embeddings
LightweightEmbeddings is a fast, free, and unlimited API service for... |
|
Emerging |
| 22 |
jina-ai/mlx-retrieval
Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX |
|
Emerging |
| 23 |
toshsan/embedding-server
Drop in replacement for OpenAI's embedding API. Self Hosted. |
|
Emerging |
| 24 |
ejaasaari/lemur
LEMUR reduces multi-vector retrieval for late interaction models such as... |
|
Emerging |
| 25 |
jina-ai/cli
All Jina AI APIs as Unix CLI commands. Search, read, embed, rerank - with pipes. |
|
Emerging |
| 26 |
623637646/EmbeddedScrollView
Embedded UIScrollView for iOS. |
|
Emerging |
| 27 |
jina-ai/jina-sagemaker
Jina Embedding Models on AWS SageMaker |
|
Emerging |
| 28 |
jakedahn/qwen3-embeddings-mlx
MLX-powered Qwen3 embedding server for Apple Silicon Macs. Features ... |
|
Emerging |
| 29 |
struct-chat/embedding
Vector Embedding Server in under 100 lines of code |
|
Experimental |
| 30 |
Abhishek6353/AllMiniLML6V2-coreml
CoreML conversion of all-MiniLM-L6-v2 with a full SwiftUI demo, tokenizer... |
|
Experimental |
| 31 |
louisbrulenaudet/lemone-api
Lemone: the API for french tax law and embeddings computation 🇫🇷 |
|
Experimental |
| 32 |
MindHackingHappiness/EI-harness-lite
Light Python 3.x+ wrapper for our MHH EI_for_AI super prompt. Also js client. |
|
Experimental |
| 33 |
IvanCampos/openai-text-embedding
Uncover hidden connections and find the most semantically similar text to... |
|
Experimental |
| 34 |
dadoomer/sentence-transformers-server
Your own API endpoint to perform NLP functions like semantic search,... |
|
Experimental |
| 35 |
noe/seqp
Sequence persistence library for Python |
|
Experimental |
| 36 |
theseedship/n8n_embeddings_qwen3_integration
Use this advanced node (tool or embedding) for Qwen3 embeddings (fit all... |
|
Experimental |
| 37 |
dnys1/embedding_explorer
Experiment with text embedding models locally in your browser. |
|
Experimental |
| 38 |
artryazanov/embedding-service
This is a FastAPI-based service for generating text embeddings, supporting... |
|
Experimental |
| 39 |
Maplecoder18/Qwen3-VL-Embedding
🌟 Enhance visual and textual understanding with Qwen3-VL-Embedding and... |
|
Experimental |
| 40 |
dust-ai-mr/dust-nlp
Dust Actor library for interacting with LLMs and embedding engines |
|
Experimental |
| 41 |
thinkbigcd/embedding-service
api service for generating and managing text embeddings |
|
Experimental |
| 42 |
dsjacobsen/embedding-service
A high-performance FastAPI service that generates vector embeddings for... |
|
Experimental |
| 43 |
bambara-martial/jina-grep-cli
Enable semantic grep and code search locally on Apple Silicon using Jina... |
|
Experimental |
| 44 |
rogelioRuiz/dust-embeddings-capacitor
On-device text embedding generation for iOS and Android via Capacitor |
|
Experimental |
| 45 |
elvatis/openclaw-gpu-bridge
OpenClaw plugin: Offload heavy compute (embeddings, BERTScore) to a remote GPU server |
|
Experimental |
| 46 |
aicubetechnology/aicube-embedding2embedding
AICUBE Embedding2Embedding - Unlock advanced embedding translation between... |
|
Experimental |
| 47 |
rogelioRuiz/dust-embeddings-swift
Standalone tokenizers and embedding runtime primitives for Dust — iOS/macOS |
|
Experimental |
| 48 |
ethanlee928/mlx-embeddings-server
This package offers an OpenAI-compatible API server for mlx-embeddings |
|
Experimental |
| 49 |
enot-style/imbeddings
A minimal FastAPI service for generating image embeddings using Hugging Face... |
|
Experimental |
| 50 |
cwccie/netembeddings
Pre-computed vector embeddings for networking concepts — RFCs, CLI commands,... |
|
Experimental |
| 51 |
enot-style/embeddings
OpenAI-compatible /v1/embeddings API for local Hugging Face text embedding... |
|
Experimental |
| 52 |
rogelioRuiz/dust-embeddings-kotlin
Standalone tokenizers and embedding runtime primitives for on-device text embeddings |
|
Experimental |
| 53 |
ayinedjimi/CUDAEmbeddings
GPU-accelerated embedding server for RAG systems - CUDA, FastAPI,... |
|
Experimental |
| 54 |
ChasingBlu/CAIROS_Daemon
Python/ C/C++ embedding pipeline with a 2d-3d vector-coordinates converter.... |
|
Experimental |
| 55 |
thiagosilvahyper/bihe-quantization
BIHE Protocol - Next-generation vector quantization combining E8 lattice... |
|
Experimental |
| 56 |
fahmiaziz98/unified-embedding-api
A modular and open-source RAG-ready Embedding API supporting dense, sparse... |
|
Experimental |
| 57 |
moda20/mes
Multimodal Embedding Service : This is a vibecodded application to serve as... |
|
Experimental |
| 58 |
TriDefender/jina-embedding-server
I rewrote the wheel so you don't have to pay for embed or rerank. The... |
|
Experimental |
| 59 |
startupradar/demo-find-similar-startups
Find similar startups with our API and OpenAI's embeddings |
|
Experimental |
| 60 |
Vokturz/fast-embeddings-api
fast-embeddings-api |
|
Experimental |
| 61 |
Blase-Labs/blase
'blase' is a Python library that enables users to train neural networks... |
|
Experimental |
| 62 |
didinj/embeddings-and-vector-database-examples
Everything You Need to Know About Embeddings and Vector Databases |
|
Experimental |
| 63 |
afriddev/EmbeRankis
EmbeRankis an open-source, production-ready service for embeddings and... |
|
Experimental |
| 64 |
CtrlAltElite-Devs/embedding.worker.faculytics
Embedding Worker for Faculytics 2.0 |
|
Experimental |
| 65 |
tubers9312345/mlx-serve-embeddings
🧠 Run local Apple Silicon embedding models with MLX, offering fast, private,... |
|
Experimental |
| 66 |
back2matching/turboquant-vectors
Compress embeddings 6x instantly with TurboQuant. First pip package using... |
|
Experimental |
| 67 |
aperepel/mlx-serve-embeddings
Local embeddings server for Apple Silicon using MLX, providing... |
|
Experimental |
| 68 |
AlwaysSany/huggingface-local-embedding
A Fast API server that provides local text and multi-modal embedding using... |
|
Experimental |
| 69 |
kemingy/mosec_emb
Embedding service with mosec that is compatible with OpenAI API. |
|
Experimental |
| 70 |
devflowinc/openembeddings
Self-hostable pay for what you use embedding server for bge-large-en and... |
|
Experimental |
| 71 |
arterm-sedov/cmw-infinity
Infinity server setup and management for Infinity embedding and reranking... |
|
Experimental |
| 72 |
nakedcity/zephyr
OpenAI-compatible embedding server built on pure ONNX Runtime—fast starts,... |
|
Experimental |
| 73 |
different-ai/embedbase-js
moved https://github.com/different-ai/embedbase/tree/main/sdk/embedbase-js |
|
Experimental |
| 74 |
ggwozdz90/embed-api
API for text embeddings using BGE-M3 model. Supports dense, sparse, and... |
|
Experimental |
| 75 |
Alex-ML-labs/text-embedding-service-MLA-
FastAPI service for sentence embeddings & cosine similarity (MiniLM-L6-v2).... |
|
Experimental |
| 76 |
SharvenRane/feature-store
Feature store implementation for image embeddings using Redis and Feast |
|
Experimental |
| 77 |
DevWael/fastembed-service
A self-hosted embedding generation API optimized for ARM architecture and... |
|
Experimental |
| 78 |
anvitha-sm/embedvisor
Embeddings web app + package + CLI in Python for data preprocessing:... |
|
Experimental |
| 79 |
K31NER/openai-embeddings-proxy
Proxy de embeddings compatible con la API de OpenAI en FastAPI que expone un... |
|
Experimental |
| 80 |
kaovern/embeddrix
A stupid simple service to generate text embeddings |
|
Experimental |
| 81 |
rhangelxs/russian_embeddings
API server for word embeddings for Russian language |
|
Experimental |
| 82 |
MongoExpUser/Text-and-Image-Embeddings-for-PostgreSQL
Generate Text and Image Embeddings |
|
Experimental |
| 83 |
acantarero/embedding_service
FastAPI service to generate text embeddings. Currently supports instructor... |
|
Experimental |