qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Leverages ONNX Runtime instead of PyTorch to minimize dependencies and enable deployment in serverless environments like AWS Lambda. Supports dense embeddings, sparse embeddings (SPLADE++), late-interaction models (ColBERT), image embeddings, and cross-encoder reranking—with extensibility for custom models. Integrates directly with Qdrant vector database for end-to-end semantic search workflows.
2,771 stars. Used by 37 other packages. Actively maintained with 5 commits in the last 30 days. Available on PyPI.
Stars
2,771
Forks
184
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 12, 2026
Commits (30d)
5
Dependencies
10
Reverse dependents
37
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/qdrant/fastembed"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Blaizzy/mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding models locally on...
Merck/Sapiens
Sapiens is a human antibody language model based on BERT.
amansrivastava17/embedding-as-service
One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
IlyasMoutawwakil/py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI...