Llm Implementation Tutorials Transformer Models
There are 43 llm implementation tutorials models tracked. 4 score above 70 (verified tier). The highest-rated is AI-Hypercomputer/maxtext at 92/100 with 2,169 stars and 1,029 monthly downloads. 3 of the top 10 are actively maintained.
Get all 43 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-implementation-tutorials&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
AI-Hypercomputer/maxtext
A simple, performant and scalable Jax LLM! |
|
Verified |
| 2 |
mindspore-lab/mindnlp
MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore... |
|
Verified |
| 3 |
rasbt/reasoning-from-scratch
Implement a reasoning LLM in PyTorch from scratch, step by step |
|
Verified |
| 4 |
mosaicml/llm-foundry
LLM training code for Databricks foundation models |
|
Verified |
| 5 |
rickiepark/llm-from-scratch
<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소 |
|
Established |
| 6 |
rllm-team/rllm
Pytorch Library for Relational Table Learning with LLMs. |
|
Established |
| 7 |
CASE-Lab-UMD/LLM-Drop
The official implementation of the paper "Uncovering the Redundancy in... |
|
Established |
| 8 |
ridgerchu/matmulfreellm
Implementation for MatMul-free LM. |
|
Established |
| 9 |
FareedKhan-dev/train-llama4
Building LLaMA 4 MoE from Scratch |
|
Emerging |
| 10 |
xinzhanguo/hellollm
pre train a new llm |
|
Emerging |
| 11 |
joyehuang/minimind-notes
🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A... |
|
Emerging |
| 12 |
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中) |
|
Emerging |
| 13 |
shivendrra/SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from... |
|
Emerging |
| 14 |
AviSoori1x/seemore
From scratch implementation of a vision language model in pure PyTorch |
|
Emerging |
| 15 |
JohnMachado11/Build-a-Large-Language-Model-from-Scratch
Building a GPT-like LLM from scratch with PyTorch. |
|
Emerging |
| 16 |
NVIDIA/logits-processor-zoo
A collection of LogitsProcessors to customize and enhance LLM behavior for... |
|
Emerging |
| 17 |
ChaitanyaK77/Building-a-Small-Language-Model-SLM-
This Repository provides a Jupyter Notebook for building a small language... |
|
Emerging |
| 18 |
fangpin/llm-from-scratch
Build LLM from scratch |
|
Emerging |
| 19 |
zeyadusf/LLMs-from-Scratch
Build a Large Language Model (From Scratch) book and Finetuned Models |
|
Emerging |
| 20 |
donaldafeith/Pytorch_Merge
Merge LLM that are split in to parts |
|
Emerging |
| 21 |
rasbt/pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage... |
|
Emerging |
| 22 |
ronniross/attention-heatmap-visualizer
A set of scripts to generate full attention-head heatmaps for transformer-based LLMs |
|
Emerging |
| 23 |
GeeeekExplorer/transformers-patch
patches for huggingface transformers to save memory |
|
Emerging |
| 24 |
OpenNLPLab/TransnormerLLM
Official implementation of TransNormerLLM: A Faster and Better LLM |
|
Emerging |
| 25 |
hitz-zentroa/whisper-lm-transformers
Add n-gram and LLM language model support to HF Transformers Whisper models. |
|
Emerging |
| 26 |
hesamsheikh/llm-mechanics
Coding an LLM and its building blocks from scratch. |
|
Emerging |
| 27 |
ai-glimpse/toyllm
ToyLLM: Learning LLM from Scratch |
|
Emerging |
| 28 |
myscience/x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024) |
|
Emerging |
| 29 |
microsoft/encoder-decoder-slm
Efficient encoder-decoder architecture for small language models (≤1B... |
|
Emerging |
| 30 |
waltonfuture/InstructionGPT-4
InstructionGPT-4 |
|
Emerging |
| 31 |
Utshav-paudel/LLM-Zero-to-Hero
This repo contains the resources, projects and documentation of mine while... |
|
Emerging |
| 32 |
OpenVanguard/remma-o1
Remma-O1: An open-source Language Model with 1.17B Params, built on pytorch... |
|
Emerging |
| 33 |
kmkrofficial/LiteGPT
LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and... |
|
Emerging |
| 34 |
KillerShoaib/RLM-From-Scratch
Implementation of Recursive Language Model paper from scratch |
|
Experimental |
| 35 |
muna-ai/muna-predictors
Interesting Python functions compiled to run anywhere with Muna. |
|
Experimental |
| 36 |
feifeibear/Odysseus-Transformer
Odysseus: Playground of LLM Sequence Parallelism |
|
Experimental |
| 37 |
Nikshaan/llm-from-scratch
Implementation of build a LLM from scratch by Sebastian Raschka. |
|
Experimental |
| 38 |
ksm26/Pretraining-LLMs
Master the essential steps of pretraining large language models (LLMs).... |
|
Experimental |
| 39 |
AIDajiangtang/LLM-from-scratch
从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch |
|
Experimental |
| 40 |
villagecomputing/superpipe
Superpipe - optimized LLM pipelines for structured data |
|
Experimental |
| 41 |
10-OASIS-01/Autoregressive-Language-Model
This project is a comprehensive implementation of a Transformer-based... |
|
Experimental |
| 42 |
sanyalsunny111/Early_Weight_Avg
[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training |
|
Experimental |
| 43 |
Ki-Seki/Awesome-Transformer-Visualization
Explore visualization tools for understanding Transformer-based large... |
|
Experimental |