Madhan230205/token-reducer
⚡ Cut Claude token usage by 90%+ — free, open-source, local-first context compression for Claude Code. Hybrid RAG (BM25 + ONNX vectors), AST chunking, reranking. No API needed.
Stars
7
Forks
1
Language
Python
License
MIT
Category
Last pushed
Apr 03, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/Madhan230205/token-reducer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
vectorlessflow/vectorless
Vectorless is a hierarchical, reasoning-native document intelligence engine. 🌟 Star if you like it!
MisterTK/semantex
Local semantic code search — hybrid ColBERT + BM25, 34% better than grep, 222x fewer tokens for AI
sanjeevafk/BibleLM
A Bible chatbot powered by Retrieval-Augmented Generation (RAG), designed to uphold fidelity to...
LARIkoz/ai-model-benchmarks
119 AI models × 55 benchmarks with per-score freshness dates, auto-updated pricing, task...
gabonavarroo/faultmap
Automatically discover where and why your LLM is failing — embedding-space clustering +...