mad011408/IndexCache

Accelerate DeepSeek Sparse Attention models by reusing cross-layer indexes to cut computations and speed up inference with minimal quality loss

/ 100

Experimental

No License No Package No Dependents

Maintenance 13 / 25

Adoption 0 / 25

Maturity 1 / 25

Community 0 / 25

Stars

—

Forks

—

Language

—

License

—

Category

Last pushed

Mar 28, 2026

Commits (30d)

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/mad011408/IndexCache"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

RediSearch/RediSearch

A query and indexing engine for Redis, providing secondary indexing, full-text search, vector...

redis/redis-vl-python

Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.

redis-developer/redis-ai-resources

✨ A curated list of awesome community resources, integrations, and examples of Redis in the AI ecosystem.

luyug/GradCache

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

redis-developer/redis-product-search

Visual and semantic vector similarity with Redis Stack, FastAPI, PyTorch and Huggingface.