peva3/SmarterRouter

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

35
/ 100
Emerging

Performs automatic model profiling and benchmarking on local hardware to build decision models for intelligent routing, then continuously learns from user interactions to refine selections. Supports hot-swapping models without restarts, multi-GPU VRAM awareness across NVIDIA/AMD/Intel/Apple Silicon, and can seamlessly blend local Ollama/llama.cpp models with external cloud providers (OpenAI, Anthropic, etc.) under unified intelligent routing.

No Package No Dependents
Maintenance 10 / 25
Adoption 8 / 25
Maturity 11 / 25
Community 6 / 25

How are scores calculated?

Stars

63

Forks

3

Language

Python

License

MIT

Last pushed

Mar 04, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/peva3/SmarterRouter"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.