amansrivastava17/embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques

/ 100

Established

Supports multiple embedding models (BERT, XLNet, Word2Vec, etc.) with flexible pooling strategies (reduce_mean, reduce_max, etc.) to aggregate token embeddings into fixed-length sentence vectors. Deployable both as a Python module and as a client-server architecture with separate `embedding-as-service` server and `embedding-as-service-client` packages, enabling distributed inference across network boundaries. Built on transformer-based architectures with configurable sequence length and batch processing for production workloads.

210 stars. No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 18 / 25

How are scores calculated?

Stars

210

Forks

Language

Python

License

MIT

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead

Related tools

FlagOpen/FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Blaizzy/mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding models locally on...

qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Merck/Sapiens

Sapiens is a human antibody language model based on BERT.

IlyasMoutawwakil/py-txi

A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI...

Explore Embedding Tools

All categories Trending Embeddings directory Insights