romsto/Speculative-Decoding

Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan et al. 2023.

/ 100

Emerging

101 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 20 / 25

Stars

101

Forks

Language

Python

License

MIT

Category

Last pushed

Dec 02, 2024

Commits (30d)

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/romsto/Speculative-Decoding"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

sgl-project/SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

structuredllm/syncode

Efficient and general syntactical decoding for Large Language Models

SafeAILab/EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

hao-ai-lab/JacobiForcing

Jacobi Forcing: Fast and Accurate Diffusion-style Decoding

torchspec-project/TorchSpec

A PyTorch native library for training speculative decoding models