FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
45
/ 100
Emerging
2,717 stars. No commits in the last 6 months.
Stale 6m
No Package
No Dependents
Maintenance
0 / 25
Adoption
10 / 25
Maturity
16 / 25
Community
19 / 25
Stars
2,717
Forks
195
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Jun 25, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/FasterDecoding/Medusa"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NX-AI/xlstm
Official repository of the xLSTM.
70
DashyDashOrg/pandas-llm
Pandas-LLM
56
sinanuozdemir/oreilly-hands-on-gpt-llm
Mastering the Art of Scalable and Efficient AI Model Deployment
52
wxhcore/bumblecore
An LLM training framework built from the ground up, featuring a custom BumbleBee architecture...
49
MiniMax-AI/MiniMax-01
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model &...
48