Llm Implementation Tutorials Transformer Models

There are 43 llm implementation tutorials models tracked. 4 score above 70 (verified tier). The highest-rated is AI-Hypercomputer/maxtext at 92/100 with 2,169 stars and 1,029 monthly downloads. 3 of the top 10 are actively maintained.

Get all 43 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-implementation-tutorials&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

92
Verified
2 mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore...

76
Verified
3 rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

71
Verified
4 mosaicml/llm-foundry

LLM training code for Databricks foundation models

71
Verified
5 rickiepark/llm-from-scratch

<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소

55
Established
6 rllm-team/rllm

Pytorch Library for Relational Table Learning with LLMs.

54
Established
7 CASE-Lab-UMD/LLM-Drop

The official implementation of the paper "Uncovering the Redundancy in...

52
Established
8 ridgerchu/matmulfreellm

Implementation for MatMul-free LM.

50
Established
9 FareedKhan-dev/train-llama4

Building LLaMA 4 MoE from Scratch

45
Emerging
10 xinzhanguo/hellollm

pre train a new llm

44
Emerging
11 joyehuang/minimind-notes

🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 | A...

44
Emerging
12 Tongjilibo/build_MiniLLM_from_scratch

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

43
Emerging
13 shivendrra/SmallLanguageModel

a LLM cookbook, for building your own from scratch, all the way from...

42
Emerging
14 AviSoori1x/seemore

From scratch implementation of a vision language model in pure PyTorch

42
Emerging
15 JohnMachado11/Build-a-Large-Language-Model-from-Scratch

Building a GPT-like LLM from scratch with PyTorch.

41
Emerging
16 NVIDIA/logits-processor-zoo

A collection of LogitsProcessors to customize and enhance LLM behavior for...

41
Emerging
17 ChaitanyaK77/Building-a-Small-Language-Model-SLM-

This Repository provides a Jupyter Notebook for building a small language...

41
Emerging
18 fangpin/llm-from-scratch

Build LLM from scratch

41
Emerging
19 zeyadusf/LLMs-from-Scratch

Build a Large Language Model (From Scratch) book and Finetuned Models

40
Emerging
20 donaldafeith/Pytorch_Merge

Merge LLM that are split in to parts

40
Emerging
21 rasbt/pytorch-memory-optim

This code repository contains the code used for my "Optimizing Memory Usage...

38
Emerging
22 ronniross/attention-heatmap-visualizer

A set of scripts to generate full attention-head heatmaps for transformer-based LLMs

35
Emerging
23 GeeeekExplorer/transformers-patch

patches for huggingface transformers to save memory

35
Emerging
24 OpenNLPLab/TransnormerLLM

Official implementation of TransNormerLLM: A Faster and Better LLM

35
Emerging
25 hitz-zentroa/whisper-lm-transformers

Add n-gram and LLM language model support to HF Transformers Whisper models.

35
Emerging
26 hesamsheikh/llm-mechanics

Coding an LLM and its building blocks from scratch.

34
Emerging
27 ai-glimpse/toyllm

ToyLLM: Learning LLM from Scratch

33
Emerging
28 myscience/x-lstm

Pytorch implementation of the xLSTM model by Beck et al. (2024)

32
Emerging
29 microsoft/encoder-decoder-slm

Efficient encoder-decoder architecture for small language models (≤1B...

32
Emerging
30 waltonfuture/InstructionGPT-4

InstructionGPT-4

32
Emerging
31 Utshav-paudel/LLM-Zero-to-Hero

This repo contains the resources, projects and documentation of mine while...

31
Emerging
32 OpenVanguard/remma-o1

Remma-O1: An open-source Language Model with 1.17B Params, built on pytorch...

31
Emerging
33 kmkrofficial/LiteGPT

LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and...

30
Emerging
34 KillerShoaib/RLM-From-Scratch

Implementation of Recursive Language Model paper from scratch

28
Experimental
35 muna-ai/muna-predictors

Interesting Python functions compiled to run anywhere with Muna.

27
Experimental
36 feifeibear/Odysseus-Transformer

Odysseus: Playground of LLM Sequence Parallelism

27
Experimental
37 Nikshaan/llm-from-scratch

Implementation of build a LLM from scratch by Sebastian Raschka.

27
Experimental
38 ksm26/Pretraining-LLMs

Master the essential steps of pretraining large language models (LLMs)....

25
Experimental
39 AIDajiangtang/LLM-from-scratch

从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch

21
Experimental
40 villagecomputing/superpipe

Superpipe - optimized LLM pipelines for structured data

21
Experimental
41 10-OASIS-01/Autoregressive-Language-Model

This project is a comprehensive implementation of a Transformer-based...

14
Experimental
42 sanyalsunny111/Early_Weight_Avg

[COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training

12
Experimental
43 Ki-Seki/Awesome-Transformer-Visualization

Explore visualization tools for understanding Transformer-based large...

11
Experimental