LLM Implementation Tutorials LLM Tools
Educational repositories and step-by-step guides for building LLMs from scratch using frameworks like PyTorch and Transformers. Includes basic implementations, minimal examples, and learning-focused projects. Does NOT include production LLM frameworks, pre-trained model fine-tuning, or inference optimization tools.
There are 102 llm implementation tutorials tools tracked. 1 score above 70 (verified tier). The highest-rated is Lightning-AI/litgpt at 81/100 with 13,225 stars and 15,196 monthly downloads. 1 of the top 10 are actively maintained.
Get all 102 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=llm-implementation-tutorials&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale. |
|
Verified |
| 2 |
SPUTNIKAI/LeechTransformer
Leech-Lila: A Geometric Attention Transformer(Language Model) with the Leech... |
|
Emerging |
| 3 |
liangyuwang/Tiny-DeepSpeed
Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library |
|
Emerging |
| 4 |
Yigtwxx/PredictaLM
PredictaLM is a lightweight Turkish language model designed for next-word... |
|
Emerging |
| 5 |
catherinesyeh/attention-viz
Visualizing query-key interactions in language + vision transformers (VIS 2023) |
|
Emerging |
| 6 |
viralcode/superGPT
Train your own LLM from scratch |
|
Emerging |
| 7 |
microsoft/Text2Grad
🚀 Text2Grad: Converting natural language feedback into gradient signals for... |
|
Emerging |
| 8 |
huangjia2019/llm-gpt
From classic NLP to modern LLMs: building language models step by step.... |
|
Emerging |
| 9 |
FareedKhan-dev/Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we... |
|
Emerging |
| 10 |
sondotpin/llm-scratch
ChatGPT-like LLM in PyTorch from scratch, step by step |
|
Emerging |
| 11 |
FareedKhan-dev/train-tiny-llm
Train a 29M parameter GPT from Scratch |
|
Emerging |
| 12 |
better-py/learn-py
Test code was written for research and verification of some Python libraries. |
|
Emerging |
| 13 |
CoinCheung/gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline... |
|
Emerging |
| 14 |
Arezkiiiii/mini_llm
🚀 Build and understand a Large Language Model from scratch using PyTorch... |
|
Experimental |
| 15 |
JoelDeonDsouza/Zenpool_LLM
Zenpool is a compact, fine-tuned MLL (Mini Language Learner) model |
|
Experimental |
| 16 |
RWKV-Wiki/MultilingualShareGPT
MultilingualShareGPT, the free multi-language corpus for LLM training |
|
Experimental |
| 17 |
loke-x/jam-gpt
An Experimental Reimplementation of LLM models for research and development process |
|
Experimental |
| 18 |
GURPREETKAURJETHRA/Multi-GPU-Fine-Training-LLMs
Multi GPU Fine Training LLMs using DeepSpeed and Accelerate. |
|
Experimental |
| 19 |
Ajax0564/VyomAI
VyomAI: state-of-the-art NLP LLM Vision MultiModel transformers ... |
|
Experimental |
| 20 |
MistyozAI/CosmicFish
CosmicFish is a fully original, efficient transformer-based language model... |
|
Experimental |
| 21 |
MuzzammilShah/Road-to-AI
A structured documentation hub for AI and ML concepts, based on Andrej... |
|
Experimental |
| 22 |
Shinichi0713/LLM-fundamental-study
this site is the fundamental page of LLM-mechanism |
|
Experimental |
| 23 |
sheikhcoders/SheikhLM
Blazingly fast and remarkably powerful. |
|
Experimental |
| 24 |
wozeparrot/tinyrwkv
tinygrad port of the RWKV large language model. |
|
Experimental |
| 25 |
harshtiwari01/llm-heatmap-visualizer
A set of scripts to generate full attention-head heatmaps for transformer-based LLMs |
|
Experimental |
| 26 |
MDalamin5/Build-and-Finetune-LLM-From-Scratch-Deploy-via-vLLM-AWS-GCP
A complete end-to-end learning repo covering everything from building Large... |
|
Experimental |
| 27 |
liangyuwang/train-large-model-from-scratch
A minimal, hackable pre-training stack for GPT-style language models |
|
Experimental |
| 28 |
mtkaya/transformer-edge-optimization
Optimize Transformer models for edge devices |
|
Experimental |
| 29 |
mytechnotalent/SARAN
SARAN: Shallow Auto-Regressive Attention Network |
|
Experimental |
| 30 |
max1ab/smol-training-playbook-zh
Hugging Face 文章《The Smol Training Playbook》中文翻译 |
|
Experimental |
| 31 |
Matonice/LLM_From_Scratch
Implementing all the parts and architecture of a large language model from scratch |
|
Experimental |
| 32 |
filipsedivy/SLiM-CZ-V1
🇨🇿 Slavic Linguistic integrated Micro-model for Czechia |
|
Experimental |
| 33 |
logic-OT/Decoder-Only-LLM
This repository features a custom-built decoder-only language model (LLM)... |
|
Experimental |
| 34 |
fahadsid1770/Aesop-LLM
custom Bengali Language Model built from scratch using PyTorch, featuring... |
|
Experimental |
| 35 |
ag-sc/CRAM-Generation-LLM
Using ChatGPT & GPT-4 to generate CRAM & PyCRAM designators in a one-shot fashion |
|
Experimental |
| 36 |
Michael-Jackson666/Zero2Hero-AI
From first principles to state-of-the-art: A hands-on journey implementing... |
|
Experimental |
| 37 |
IAAR-Shanghai/Awesome-Attention-Heads
An awesome repository & A comprehensive survey on interpretability of LLM... |
|
Experimental |
| 38 |
ranpy13/Learning-LLM
Learning to build LLM from scratch, following rasbt/LLMs-from-scratch footsteps. |
|
Experimental |
| 39 |
kreasof-ai/Homunculus-Project
Long term project about a custom AI architecture. Consist of cutting-edge... |
|
Experimental |
| 40 |
musa-kal/smoll-language-model
Educational language model implementation from scratch, featuring Bigram and... |
|
Experimental |
| 41 |
AnkitaMungalpara/Building-LLM-From-Scratch
This repository provides a step-by-step guide to creating your own large... |
|
Experimental |
| 42 |
adorow/LlmFromScratch
Notes and code from reading "Build a Large Language Model (From Scratch)" |
|
Experimental |
| 43 |
ashrithssreddy/llm-from-scratch
Building GPTs from the ground up. A hands-on journey through attention... |
|
Experimental |
| 44 |
Tek233/Build_a_Large_Language_Model_From_Scratch
Testing and experimenting with code from Sebastian Raschka’s book Build a... |
|
Experimental |
| 45 |
mcbieda/llm-from-scratch
1. A simple implementation of the LLM gpt-2 "from scratch" that allows easy... |
|
Experimental |
| 46 |
wln20/Attention-Viewer
A plug-and-play tool for visualizing attention-score heatmap in generative... |
|
Experimental |
| 47 |
SreeEswaran/Train-your-LLM
This repository contains code and resources for training, fine-tuning, and... |
|
Experimental |
| 48 |
benthecoder/AI
learning AI from scratch |
|
Experimental |
| 49 |
njmarko/llm-gpt-sort
A new type of sorting algorithm. Use large language model (llm like gpt,... |
|
Experimental |
| 50 |
hulkiciray/llm_from_scratch
Comprehensive fresh start for how LLMs work |
|
Experimental |
| 51 |
simply-pouria/The-LMs-Book
My study notes, code implementations, etc. while reading The Hundred-Page... |
|
Experimental |
| 52 |
ikun-llm/ikun-2.5B
练习时长两年半的 AI 大模型 (实际 26M params,2.5B = 两年半) | ikun meme-culture chatbot 🐔🏀 |
|
Experimental |
| 53 |
snnclsr/chatgpt-from-scratch
A full-stack ChatGPT-like application built (almost) from scratch |
|
Experimental |
| 54 |
aryagxr/llm-from-scratch
Implementation of LLM ✨from scratch✨ |
|
Experimental |
| 55 |
ariannamethod/RRPRAM
RRPRAM: Recurcsive Resonant Pattern Recognition Attention Mechanism by Arianna Method |
|
Experimental |
| 56 |
MAlshurbaji/LLM-Playground-From-Scratch
A hands-on educational repository demonstrating how to build a miniature LLM... |
|
Experimental |
| 57 |
MaharshPatelX/multi-model-conda-environment
This GitHub repository provides a comprehensive guide on setting up a Conda... |
|
Experimental |
| 58 |
ikun-llm/ikun-pretrain
从零预训练语言模型 | Pretrain LLM from scratch 📚 |
|
Experimental |
| 59 |
RajdeepKushwaha5/LLMs-from-scratch
This repository contains the code for developing, pretraining, and... |
|
Experimental |
| 60 |
Kitsunp/Small-lenguaje-Model-Hybrid-Norm-Furier-Formers
A compact language model implementing HybridNorm and Fourier-based... |
|
Experimental |
| 61 |
moorebrett0/microformer
A minimal, dual-memory transformer language model built from scratch in... |
|
Experimental |
| 62 |
itxmjr/LLM-From-Scratch
A step-by-step Guide: Build a GPT-like LLM From Scratch using PyTorch |
|
Experimental |
| 63 |
rudyon/pipeline
Training pipeline for LLMs in PyTorch. |
|
Experimental |
| 64 |
nachammai779/Large-Language-Models-from-Scratch
🧠 Building Large Language Models from the ground up — BPE tokenization,... |
|
Experimental |
| 65 |
Adwerse/Mini_LLM
🧠 Transformer built from scratch — RoPE, SwiGLU, KV-Cache, Flash Attention.... |
|
Experimental |
| 66 |
CastorYu/train-hybrid-llm-from-scratch
A simplistic script for training your own hybrid llm (using autoregressive... |
|
Experimental |
| 67 |
FreezB11/PsyDuck
a 60M parameter LLM from scratch |
|
Experimental |
| 68 |
prasanna00019/Small-Language-Models
A collection of Small Language Models (SLMs) built from scratch in PyTorch. |
|
Experimental |
| 69 |
JerryPan2718/flexgpt
Tradeoff between runtime and RAM usage for large language model inference. |
|
Experimental |
| 70 |
CS-433/ml-project-2-mlp
Advancing Homepage2Vec with LLM-Generated Datasets for Multilingual Website... |
|
Experimental |
| 71 |
ictup/Build-a-LLM-model-from-scratch-simple
LLM pipeline: data→tokenizer→attention→GPT train/eval→instruction... |
|
Experimental |
| 72 |
pprp/smol_training_zh
《Smol 训练手册》:打造世界级大模型的秘诀 |
|
Experimental |
| 73 |
bobo0810/MiniGPT-4-DeepSpeed
MiniGPT-4基于DeepSpeed加速➕ 扩充模型规模 ➕ 实验分析 |
|
Experimental |
| 74 |
UEFI-code/PyTorch_For_PoorGuys
This is a special PyTorch For Poor Guys Who can't afford big GPU |
|
Experimental |
| 75 |
Antoinechss/LLM-from-scratch
Building a LLM (GPT-3 like architecture) from scratch in Python and training... |
|
Experimental |
| 76 |
FranxYao/Language-Model-Pretraining-for-Text-Generation
LM pretraining for generation, reading list, resources, conference mappings. |
|
Experimental |
| 77 |
feljost/fgpt
Building a conversational LLM from scratch |
|
Experimental |
| 78 |
daniel-mehta/CooperLM-354M
A 354M parameter GPT-2 model trained on filtered Wikipedia, BookCorpus, and... |
|
Experimental |
| 79 |
ajheshbasnet/model-reincarnated
a collection of re-implementations of renowned artificial intelligence... |
|
Experimental |
| 80 |
Daegybyte/LLM_Training_Pipeline
Modular text-generation training pipeline using PyTorch, Hugging Face, and... |
|
Experimental |
| 81 |
AndrewKM210/micro-llm
A micro-LLM implementation that should be able to train and run on a standard laptop |
|
Experimental |
| 82 |
hichamcharhi/llm-from-scratch
Implémentez un LLM from scratch en Python — cours pédagogique en... |
|
Experimental |
| 83 |
sugam24/LLM-From-Scratch
Building a Large Language Model from the ground up. |
|
Experimental |
| 84 |
falseywinchnet/ConvexGPT
Mostly Convex ICNN based Large Language Model |
|
Experimental |
| 85 |
insooeric/LLM_Small
Long Language Model from SCRATCH |
|
Experimental |
| 86 |
Bhardwaj-Saurabh/Build_A_Small_Language_Model
A detail guide to build, train, fine-tune and deploye SLM. This repo... |
|
Experimental |
| 87 |
teatonedev/Llama3.2-From-Scratch
This repository serves as an educational resource for building an... |
|
Experimental |
| 88 |
omkarbg100/LLM-GPT
This is a LLM Model with full Architecture Coded ans is based On GPT-2 Weights |
|
Experimental |
| 89 |
jvscursulim/build-a-llm-from-scratch
Studies about LLM |
|
Experimental |
| 90 |
capecoder08/llm-playground
Playing with tokenizers, transformers, and LLMs |
|
Experimental |
| 91 |
Samya-S/Building-LLMs-from-scratch
A hands-on guide to implementing Large Language Models from scratch |
|
Experimental |
| 92 |
patrikwolf/ttt_theory
Specialization after Generalization |
|
Experimental |
| 93 |
storytellingengineer/Learning_LLM_using_Pytorch
Hi, this repo is my learning to build LLMs from scratch | Foundational... |
|
Experimental |
| 94 |
Jkanishkha0305/LLMs-from-Scratch
A curated collection of Large Language Models(LLMs), Small Language... |
|
Experimental |
| 95 |
jbergq/transformer-language-modeling
Notebook showing how to implement and train a transformer for language modeling |
|
Experimental |
| 96 |
anafisa/Text2Text-Transformer
Large Multi-Language Models for News Translation |
|
Experimental |
| 97 |
T-Mohamed-Shafeek/LLM-for-Language-Translation
This repository contains a simple and beginner-level noteboook which employs... |
|
Experimental |
| 98 |
sasa5linkar/Spacy-Serbian-Transformer
Using Jertehs corpuses to train spacy with tranformer and LLM |
|
Experimental |
| 99 |
aayushsharma-io/tensorix
Tensorix is a minimalistic and highly compatible Python library designed to... |
|
Experimental |
| 100 |
gusye1234/gpt-adapter
Collections of algorithms for efficiently fine-tuning LLM |
|
Experimental |
| 101 |
asiff00/Text-Generation-with-TensorFlow-from-Scratch
Text Generation with TensorFlow from Scratch |
|
Experimental |
| 102 |
aaaastark/NBART-Multilingual-Translator
This repository contains a Python script that uses a pre-trained NBART... |
|
Experimental |