vcache-project/vCache

Reliable and Efficient Semantic Prompt Caching with vCache

/ 100

Emerging

Implements online-learned decision boundaries for semantic similarity detection, eliminating manual threshold tuning while guaranteeing user-specified error rate bounds. Sits between application servers and LLM backends (OpenAI, Anthropic, or on-prem models), using embedding-based similarity matching with pluggable vector databases (HNSWLib), eviction policies (FIFO, LRU, MRU, SCU), and similarity evaluators. Provides modular configuration to swap inference engines, embedding models, and storage backends for RAG pipelines, agentic systems, and database-driven LLM workloads.

No Package No Dependents

Maintenance 6 / 25

Adoption 8 / 25

Maturity 15 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

RediSearch/RediSearch

A query and indexing engine for Redis, providing secondary indexing, full-text search, vector...

redis/redis-vl-python

Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.

redis-developer/redis-ai-resources

✨ A curated list of awesome community resources, integrations, and examples of Redis in the AI ecosystem.

luyug/GradCache

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

redis-developer/redis-product-search

Visual and semantic vector similarity with Redis Stack, FastAPI, PyTorch and Huggingface.

Explore Vector Databases

All categories Trending Vector Database directory Insights