AIdevsmartdata/chimere

Rust-native MoE inference runtime with custom CUDA kernels for Blackwell GPUs. Includes DFlash speculative decoding, multi-tier Engram memory, and entropy-adaptive routing. Targets Qwen3.5-35B-A3B on a single RTX 5060 Ti 16GB.

24
/ 100
Experimental
No Package No Dependents
Maintenance 13 / 25
Adoption 2 / 25
Maturity 9 / 25
Community 0 / 25

How are scores calculated?

Stars

2

Forks

Language

Rust

License

Apache-2.0

Last pushed

Apr 07, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/AIdevsmartdata/chimere"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.