outmatic/TurboQuant
High-performance .NET implementation of Google's TurboQuant algorithm (ICLR 2026). Near-optimal vector quantization: compress embeddings to 2-4 bits with cosine > 0.995.
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/outmatic/TurboQuant"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
vcal-project/vcal-core
VCAL Core — high-performance semantic cache and vector cache library for LLM applications.
gorajing/zuhn
Personal knowledge operating system — ingests content, extracts insights via Claude, and stores...
stffns/snapvec
Fast compressed ANN search via randomized Hadamard transform + Lloyd-Max quantization. Pure NumPy.
stffns/hadamax
Fast compressed ANN search via randomized Hadamard transform and optimal Gaussian scalar...
ctx-0/embed-engine
resumable batch image embedding pipeline