Self-Hosted Embedding Servers Embedding Tools

Deployable embedding API services that run locally or on your own infrastructure, providing OpenAI-compatible or custom endpoints. Does NOT include embedding models themselves, inference libraries, or managed embedding API providers.

There are 83 self-hosted embedding servers tools tracked. 3 score above 70 (verified tier). The highest-rated is FlagOpen/FlagEmbedding at 79/100 with 11,395 stars. 2 of the top 10 are actively maintained.

Get all 83 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=self-hosted-embedding-servers&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 FlagOpen/FlagEmbedding

Retrieval and Retrieval-augmented LLMs

79
Verified
2 Blaizzy/mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding...

76
Verified
3 qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

74
Verified
4 Merck/Sapiens

Sapiens is a human antibody language model based on BERT.

69
Established
5 amansrivastava17/embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from various...

53
Established
6 IlyasMoutawwakil/py-txi

A Python wrapper around HuggingFace's TGI (text-generation-inference) and...

53
Established
7 jkrukowski/swift-embeddings

Run embedding models locally in Swift using MLTensor.

52
Established
8 jina-ai/examples

Jina examples and demos to help you get started

51
Established
9 freelawproject/inception

Our microservice for generating embeddings from blocks of text

50
Established
10 minimaxir/imgbeddings

Python package to generate image embeddings with CLIP without PyTorch/TensorFlow

48
Emerging
11 simonw/llm-embed-jina

Embedding models from Jina AI

46
Emerging
12 dayyass/muse-as-service

REST API for sentence tokenization and embedding using Multilingual...

46
Emerging
13 ddangelov/RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

44
Emerging
14 n24q02m/qwen3-embed

Lightweight ONNX inference for Qwen3 embedding and reranking models

43
Emerging
15 rag-wtf/open-text-embeddings

Open Source Text Embedding Models with OpenAI Compatible API

43
Emerging
16 josephrmartinez/recipe-dataset

Datasette tutorial. Calculate and query embeddings on 5,000 rows in a sqlite...

42
Emerging
17 LLukas22/tei-client

Convenience Client for Hugging Face Text Embeddings Inference (TEI) with...

41
Emerging
18 ART-Group-it/KERMIT

🐸 KERMIT - A lightweight library to encode and interpret Universal...

39
Emerging
19 yuvrajangadsingh/vemb

httpie for embeddings. Embed text, images, audio, video, and PDFs from the...

39
Emerging
20 jina-ai/jina-grep-cli

Semantic grep powered by Jina embeddings v5 (MLX on Apple Silicon)

39
Emerging
21 lh0x00/lightweight-embeddings

LightweightEmbeddings is a fast, free, and unlimited API service for...

37
Emerging
22 jina-ai/mlx-retrieval

Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX

37
Emerging
23 toshsan/embedding-server

Drop in replacement for OpenAI's embedding API. Self Hosted.

36
Emerging
24 ejaasaari/lemur

LEMUR reduces multi-vector retrieval for late interaction models such as...

36
Emerging
25 jina-ai/cli

All Jina AI APIs as Unix CLI commands. Search, read, embed, rerank - with pipes.

33
Emerging
26 623637646/EmbeddedScrollView

Embedded UIScrollView for iOS.

33
Emerging
27 jina-ai/jina-sagemaker

Jina Embedding Models on AWS SageMaker

31
Emerging
28 jakedahn/qwen3-embeddings-mlx

MLX-powered Qwen3 embedding server for Apple Silicon Macs. Features ...

31
Emerging
29 struct-chat/embedding

Vector Embedding Server in under 100 lines of code

29
Experimental
30 Abhishek6353/AllMiniLML6V2-coreml

CoreML conversion of all-MiniLM-L6-v2 with a full SwiftUI demo, tokenizer...

28
Experimental
31 louisbrulenaudet/lemone-api

Lemone: the API for french tax law and embeddings computation 🇫🇷

28
Experimental
32 MindHackingHappiness/EI-harness-lite

Light Python 3.x+ wrapper for our MHH EI_for_AI super prompt. Also js client.

27
Experimental
33 IvanCampos/openai-text-embedding

Uncover hidden connections and find the most semantically similar text to...

26
Experimental
34 dadoomer/sentence-transformers-server

Your own API endpoint to perform NLP functions like semantic search,...

26
Experimental
35 noe/seqp

Sequence persistence library for Python

24
Experimental
36 theseedship/n8n_embeddings_qwen3_integration

Use this advanced node (tool or embedding) for Qwen3 embeddings (fit all...

24
Experimental
37 dnys1/embedding_explorer

Experiment with text embedding models locally in your browser.

24
Experimental
38 artryazanov/embedding-service

This is a FastAPI-based service for generating text embeddings, supporting...

23
Experimental
39 Maplecoder18/Qwen3-VL-Embedding

🌟 Enhance visual and textual understanding with Qwen3-VL-Embedding and...

23
Experimental
40 dust-ai-mr/dust-nlp

Dust Actor library for interacting with LLMs and embedding engines

23
Experimental
41 thinkbigcd/embedding-service

api service for generating and managing text embeddings

22
Experimental
42 dsjacobsen/embedding-service

A high-performance FastAPI service that generates vector embeddings for...

22
Experimental
43 bambara-martial/jina-grep-cli

Enable semantic grep and code search locally on Apple Silicon using Jina...

22
Experimental
44 rogelioRuiz/dust-embeddings-capacitor

On-device text embedding generation for iOS and Android via Capacitor

22
Experimental
45 elvatis/openclaw-gpu-bridge

OpenClaw plugin: Offload heavy compute (embeddings, BERTScore) to a remote GPU server

22
Experimental
46 aicubetechnology/aicube-embedding2embedding

AICUBE Embedding2Embedding - Unlock advanced embedding translation between...

21
Experimental
47 rogelioRuiz/dust-embeddings-swift

Standalone tokenizers and embedding runtime primitives for Dust — iOS/macOS

19
Experimental
48 ethanlee928/mlx-embeddings-server

This package offers an OpenAI-compatible API server for mlx-embeddings

19
Experimental
49 enot-style/imbeddings

A minimal FastAPI service for generating image embeddings using Hugging Face...

19
Experimental
50 cwccie/netembeddings

Pre-computed vector embeddings for networking concepts — RFCs, CLI commands,...

19
Experimental
51 enot-style/embeddings

OpenAI-compatible /v1/embeddings API for local Hugging Face text embedding...

19
Experimental
52 rogelioRuiz/dust-embeddings-kotlin

Standalone tokenizers and embedding runtime primitives for on-device text embeddings

19
Experimental
53 ayinedjimi/CUDAEmbeddings

GPU-accelerated embedding server for RAG systems - CUDA, FastAPI,...

19
Experimental
54 ChasingBlu/CAIROS_Daemon

Python/ C/C++ embedding pipeline with a 2d-3d vector-coordinates converter....

19
Experimental
55 thiagosilvahyper/bihe-quantization

BIHE Protocol - Next-generation vector quantization combining E8 lattice...

17
Experimental
56 fahmiaziz98/unified-embedding-api

A modular and open-source RAG-ready Embedding API supporting dense, sparse...

17
Experimental
57 moda20/mes

Multimodal Embedding Service : This is a vibecodded application to serve as...

16
Experimental
58 TriDefender/jina-embedding-server

I rewrote the wheel so you don't have to pay for embed or rerank. The...

16
Experimental
59 startupradar/demo-find-similar-startups

Find similar startups with our API and OpenAI's embeddings

16
Experimental
60 Vokturz/fast-embeddings-api

fast-embeddings-api

16
Experimental
61 Blase-Labs/blase

'blase' is a Python library that enables users to train neural networks...

15
Experimental
62 didinj/embeddings-and-vector-database-examples

Everything You Need to Know About Embeddings and Vector Databases

15
Experimental
63 afriddev/EmbeRankis

EmbeRankis an open-source, production-ready service for embeddings and...

15
Experimental
64 CtrlAltElite-Devs/embedding.worker.faculytics

Embedding Worker for Faculytics 2.0

14
Experimental
65 tubers9312345/mlx-serve-embeddings

🧠 Run local Apple Silicon embedding models with MLX, offering fast, private,...

14
Experimental
66 back2matching/turboquant-vectors

Compress embeddings 6x instantly with TurboQuant. First pip package using...

14
Experimental
67 aperepel/mlx-serve-embeddings

Local embeddings server for Apple Silicon using MLX, providing...

13
Experimental
68 AlwaysSany/huggingface-local-embedding

A Fast API server that provides local text and multi-modal embedding using...

12
Experimental
69 kemingy/mosec_emb

Embedding service with mosec that is compatible with OpenAI API.

12
Experimental
70 devflowinc/openembeddings

Self-hostable pay for what you use embedding server for bge-large-en and...

12
Experimental
71 arterm-sedov/cmw-infinity

Infinity server setup and management for Infinity embedding and reranking...

12
Experimental
72 nakedcity/zephyr

OpenAI-compatible embedding server built on pure ONNX Runtime—fast starts,...

12
Experimental
73 different-ai/embedbase-js

moved https://github.com/different-ai/embedbase/tree/main/sdk/embedbase-js

12
Experimental
74 ggwozdz90/embed-api

API for text embeddings using BGE-M3 model. Supports dense, sparse, and...

11
Experimental
75 Alex-ML-labs/text-embedding-service-MLA-

FastAPI service for sentence embeddings & cosine similarity (MiniLM-L6-v2)....

11
Experimental
76 SharvenRane/feature-store

Feature store implementation for image embeddings using Redis and Feast

11
Experimental
77 DevWael/fastembed-service

A self-hosted embedding generation API optimized for ARM architecture and...

11
Experimental
78 anvitha-sm/embedvisor

Embeddings web app + package + CLI in Python for data preprocessing:...

11
Experimental
79 K31NER/openai-embeddings-proxy

Proxy de embeddings compatible con la API de OpenAI en FastAPI que expone un...

11
Experimental
80 kaovern/embeddrix

A stupid simple service to generate text embeddings

10
Experimental
81 rhangelxs/russian_embeddings

API server for word embeddings for Russian language

10
Experimental
82 MongoExpUser/Text-and-Image-Embeddings-for-PostgreSQL

Generate Text and Image Embeddings

10
Experimental
83 acantarero/embedding_service

FastAPI service to generate text embeddings. Currently supports instructor...

10
Experimental