and2049/mamba-slm-hybrid-optimizer

Small Language Model Implementation based on Mamba (SSM) architecture. Muon optimizer to the 2D weight matrices while using the stable AdamW optimizer for all other parameters.

15
/ 100
Experimental
No Package No Dependents
Maintenance 6 / 25
Adoption 0 / 25
Maturity 9 / 25
Community 0 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Last pushed

Nov 02, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/generative-ai/and2049/mamba-slm-hybrid-optimizer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.