anmolg1997/SLM-From-Scratch
Build small language models from scratch — BPE tokenizer, composable Transformer (RoPE, GQA, SwiGLU), DeepSpeed training, SFT/DPO/RLHF alignment, GGUF/ONNX export
Stars
—
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 21, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/anmolg1997/SLM-From-Scratch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
lvapeab/nmt-keras
Neural Machine Translation with Keras
dair-ai/Transformers-Recipe
🧠 A study guide to learn about Transformers
jaketae/ensemble-transformers
Ensembling Hugging Face transformers made easy
SirawitC/Transformer_from_scratch_pytorch
Build a transformer model from scratch using pytorch to understand its inner workings and gain...
lof310/transformer
PyTorch implementation of the current SOTA Transformer. Configurable, efficient, and...