MinishLab/model2vec-rs
Official Rust Implementation of Model2Vec
Compresses sentence transformers into lightweight static embeddings via Model2Vec distillation, supporting f32/f16/i8 weight types from Hugging Face Hub or local paths. Provides both a library API and CLI tool with batch processing and configurable sequence length/batch size parameters. Optimized for inference-only workloads requiring sub-100ms latency on CPU.
161 stars and 2,871 monthly downloads.
Stars
161
Forks
15
Language
Rust
License
MIT
Category
Last pushed
Feb 05, 2026
Monthly downloads
2,871
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/MinishLab/model2vec-rs"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
Anush008/fastembed-rs
Rust library for vector embeddings and reranking.
finalfusion/finalfusion-rust
finalfusion embeddings in Rust
finalfusion/finalfusion-python
Finalfusion embeddings in Python
benoitc/erlang-python
Execute Python from Erlang using dirty NIFs with GIL-aware execution, rate limiting, and...