Llm Implementation Tutorials Transformer Models

There are 43 llm implementation tutorials models tracked. 4 score above 70 (verified tier). The highest-rated is AI-Hypercomputer/maxtext at 92/100 with 2,169 stars and 1,029 monthly downloads. 3 of the top 10 are actively maintained.

Get all 43 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-implementation-tutorials&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	AI-Hypercomputer/maxtext A simple, performant and scalable Jax LLM!	92	Verified	2,169	Python
2	mindspore-lab/mindnlp MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore...	76	Verified	913	Python
3	rasbt/reasoning-from-scratch Implement a reasoning LLM in PyTorch from scratch, step by step	71	Verified	3,452	Jupyter Notebook
4	mosaicml/llm-foundry LLM training code for Databricks foundation models	71	Verified	4,397	Python
5	rickiepark/llm-from-scratch <밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소	55	Established	97	Jupyter Notebook
6	rllm-team/rllm Pytorch Library for Relational Table Learning with LLMs.	54	Established	440	Python
7	CASE-Lab-UMD/LLM-Drop The official implementation of the paper "Uncovering the Redundancy in...	52	Established	189	Python
8	ridgerchu/matmulfreellm Implementation for MatMul-free LM.	50	Established	3,058	Python
9	FareedKhan-dev/train-llama4 Building LLaMA 4 MoE from Scratch	45	Emerging	72	Jupyter Notebook
10	xinzhanguo/hellollm pre train a new llm	44	Emerging	73	Python
11	joyehuang/minimind-notes 🚀 [从零构建 LLM] 极简大模型训练原理与实践指南。包含 Transformer, Pretraining, SFT 核心代码与对照实验。 \| A...	44	Emerging	67	Python
12	Tongjilibo/build_MiniLLM_from_scratch 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)	43	Emerging	537	Python
13	shivendrra/SmallLanguageModel a LLM cookbook, for building your own from scratch, all the way from...	42	Emerging	168	Jupyter Notebook
14	AviSoori1x/seemore From scratch implementation of a vision language model in pure PyTorch	42	Emerging	255	Jupyter Notebook
15	JohnMachado11/Build-a-Large-Language-Model-from-Scratch Building a GPT-like LLM from scratch with PyTorch.	41	Emerging	337	Python
16	NVIDIA/logits-processor-zoo A collection of LogitsProcessors to customize and enhance LLM behavior for...	41	Emerging	384	Python
17	ChaitanyaK77/Building-a-Small-Language-Model-SLM- This Repository provides a Jupyter Notebook for building a small language...	41	Emerging	32	Jupyter Notebook
18	fangpin/llm-from-scratch Build LLM from scratch	41	Emerging	97	Python
19	zeyadusf/LLMs-from-Scratch Build a Large Language Model (From Scratch) book and Finetuned Models	40	Emerging	184	Jupyter Notebook
20	donaldafeith/Pytorch_Merge Merge LLM that are split in to parts	40	Emerging	27	Python
21	rasbt/pytorch-memory-optim This code repository contains the code used for my "Optimizing Memory Usage...	38	Emerging	92	Python
22	ronniross/attention-heatmap-visualizer A set of scripts to generate full attention-head heatmaps for transformer-based LLMs	35	Emerging	13	Jupyter Notebook
23	GeeeekExplorer/transformers-patch patches for huggingface transformers to save memory	35	Emerging	35	Python
24	OpenNLPLab/TransnormerLLM Official implementation of TransNormerLLM: A Faster and Better LLM	35	Emerging	252	Python
25	hitz-zentroa/whisper-lm-transformers Add n-gram and LLM language model support to HF Transformers Whisper models.	35	Emerging	14	Python
26	hesamsheikh/llm-mechanics Coding an LLM and its building blocks from scratch.	34	Emerging	116	Jupyter Notebook
27	ai-glimpse/toyllm ToyLLM: Learning LLM from Scratch	33	Emerging	25	Python
28	myscience/x-lstm Pytorch implementation of the xLSTM model by Beck et al. (2024)	32	Emerging	183	Python
29	microsoft/encoder-decoder-slm Efficient encoder-decoder architecture for small language models (≤1B...	32	Emerging	32	Python
30	waltonfuture/InstructionGPT-4 InstructionGPT-4	32	Emerging	42	Python
31	Utshav-paudel/LLM-Zero-to-Hero This repo contains the resources, projects and documentation of mine while...	31	Emerging	34	Jupyter Notebook
32	OpenVanguard/remma-o1 Remma-O1: An open-source Language Model with 1.17B Params, built on pytorch...	31	Emerging	34	Python
33	kmkrofficial/LiteGPT LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and...	30	Emerging	34	Python
34	KillerShoaib/RLM-From-Scratch Implementation of Recursive Language Model paper from scratch	28	Experimental	38	Python
35	muna-ai/muna-predictors Interesting Python functions compiled to run anywhere with Muna.	27	Experimental	11	Python
36	feifeibear/Odysseus-Transformer Odysseus: Playground of LLM Sequence Parallelism	27	Experimental	79	Python
37	Nikshaan/llm-from-scratch Implementation of build a LLM from scratch by Sebastian Raschka.	27	Experimental	15	Python
38	ksm26/Pretraining-LLMs Master the essential steps of pretraining large language models (LLMs)....	25	Experimental	27	Jupyter Notebook
39	AIDajiangtang/LLM-from-scratch 从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch	21	Experimental	37	Jupyter Notebook
40	villagecomputing/superpipe Superpipe - optimized LLM pipelines for structured data	21	Experimental	109	Python
41	10-OASIS-01/Autoregressive-Language-Model This project is a comprehensive implementation of a Transformer-based...	14	Experimental	9	Python
42	sanyalsunny111/Early_Weight_Avg [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training	12	Experimental	19	Python
43	Ki-Seki/Awesome-Transformer-Visualization Explore visualization tools for understanding Transformer-based large...	11	Experimental	22	—

Comparisons in this category

llm-foundry and rllm (71 vs 54) llm-foundry and llm-from-scratch (71 vs 55) reasoning-from-scratch and llm-from-scratch (71 vs 41)