vcache-project/vCache
Reliable and Efficient Semantic Prompt Caching with vCache
Implements online-learned decision boundaries for semantic similarity detection, eliminating manual threshold tuning while guaranteeing user-specified error rate bounds. Sits between application servers and LLM backends (OpenAI, Anthropic, or on-prem models), using embedding-based similarity matching with pluggable vector databases (HNSWLib), eviction policies (FIFO, LRU, MRU, SCU), and similarity evaluators. Provides modular configuration to swap inference engines, embedding models, and storage backends for RAG pipelines, agentic systems, and database-driven LLM workloads.
Stars
60
Forks
3
Language
Python
License
—
Category
Last pushed
Dec 17, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/vcache-project/vCache"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
RediSearch/RediSearch
A query and indexing engine for Redis, providing secondary indexing, full-text search, vector...
redis/redis-vl-python
Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.
redis-developer/redis-ai-resources
✨ A curated list of awesome community resources, integrations, and examples of Redis in the AI ecosystem.
luyug/GradCache
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
redis-developer/redis-product-search
Visual and semantic vector similarity with Redis Stack, FastAPI, PyTorch and Huggingface.