LLM Training Experimentation Transformer Models
Repositories for training, fine-tuning, and experimenting with large language models including tutorials, frameworks, and custom implementations. Does NOT include deployment tools, specific downstream applications (chatbots, summarization), or model evaluation/analysis.
There are 151 llm training experimentation models tracked. 2 score above 70 (verified tier). The highest-rated is PaddlePaddle/PaddleNLP at 79/100 with 12,929 stars and 41,348 monthly downloads. 2 of the top 10 are actively maintained.
Get all 151 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-training-experimentation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
PaddlePaddle/PaddleNLP
Easy-to-use and powerful LLM and SLM library with awesome model zoo. |
|
Verified |
| 2 |
meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with... |
|
Verified |
| 3 |
arcee-ai/mergekit
Tools for merging pretrained large language models. |
|
Established |
| 4 |
changyeyu/LLM-RL-Visualized
๐100+ ๅๅ LLM / RL ๅ็ๅพ๐๏ผใๅคงๆจกๅ็ฎๆณใไฝ่ ๅทจ็ฎ๏ผ๐ฅ๏ผ100+ LLM/RL Algorithm Maps ๏ผ |
|
Established |
| 5 |
mindspore-lab/step_into_llm
MindSpore online courses: Step into LLM |
|
Established |
| 6 |
kyegomez/LFM2
A simple and minimal open source implementation of "Introducing LFM2: The... |
|
Established |
| 7 |
kyegomez/LFM
An open source implementation of LFMs from Liquid AI: Liquid Foundation Models |
|
Established |
| 8 |
BeastByteAI/scikit-llm
Seamlessly integrate LLMs into scikit-learn. |
|
Established |
| 9 |
ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best... |
|
Established |
| 10 |
IbrahimSobh/llms
Large Language Models: In this repository Language models are introduced... |
|
Established |
| 11 |
bobazooba/xllm
๐ฆ XโLLM: Cutting Edge & Easy LLM Finetuning |
|
Established |
| 12 |
Leeroo-AI/mergoo
A library for easily merging multiple LLM experts, and efficiently train the... |
|
Established |
| 13 |
r2d4/rellm
Exact structure out of any language model completion. |
|
Established |
| 14 |
iusztinpaul/hands-on-llms
๐ฆ ๐๐ฒ๐ฎ๐ฟ๐ป about ๐๐๐ ๐, ๐๐๐ ๐ข๐ฝ๐, and ๐๐ฒ๐ฐ๐๐ผ๐ฟ ๐๐๐ for free by designing, training,... |
|
Emerging |
| 15 |
socialfoundations/folktexts
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on... |
|
Emerging |
| 16 |
datawhalechina/base-llm
ไป NLP ๅฐ LLM ็็ฎๆณๅ จๆ ๆ็จ๏ผๅจ็บฟ้ ่ฏปๅฐๅ๏ผhttps://datawhalechina.github.io/base-llm/ |
|
Emerging |
| 17 |
young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for... |
|
Emerging |
| 18 |
Tzohar/PassLLM
World's most accurate password guessing AI tool. A PyTorch implementation of... |
|
Emerging |
| 19 |
HamedBabaei/LLMs4OM
LLMs4OM: Matching Ontologies with Large Language Models |
|
Emerging |
| 20 |
EvilFreelancer/impruver
A set of scripts and configurations for pretraining of Large Language Models (LLM) |
|
Emerging |
| 21 |
HamedBabaei/LLMs4OL
LLMs4OL:โ Large Language Models for Ontology Learning |
|
Emerging |
| 22 |
gjbex/Deploying-LLMs-locally
Material for a training on AI tools |
|
Emerging |
| 23 |
johnmai-dev/NotebookMLX
๐ NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama) |
|
Emerging |
| 24 |
souzatharsis/tamingLLMs
Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software |
|
Emerging |
| 25 |
declare-lab/red-instruct
Codes and datasets of the paper Red-Teaming Large Language Models using... |
|
Emerging |
| 26 |
hitz-zentroa/GoLLIE
Guideline following Large Language Model for Information Extraction |
|
Emerging |
| 27 |
SolomonB14D3/knowledge-fidelity
Behavioral auditing & repair toolkit for LLMs. Measures 8 dimensions via... |
|
Emerging |
| 28 |
janelu9/EasyLLM
Running Large Language Model easily. |
|
Emerging |
| 29 |
kyaiooiayk/Awesome-LLM-Large-Language-Models-Notes
What can I do with a LLM model? |
|
Emerging |
| 30 |
Curated-Awesome-Lists/awesome-llms-fine-tuning
Explore a comprehensive collection of resources, tutorials, papers, tools,... |
|
Emerging |
| 31 |
WhereIsAI/BiLLM
Tool for converting LLMs from uni-directional to bi-directional by removing... |
|
Emerging |
| 32 |
stylellm/stylellm_models
StyleLLMๆ้ฃๅคงๆจกๅ๏ผๅบไบๅคง่ฏญ่จๆจกๅ็ๆๆฌ้ฃๆ ผ่ฟ็งป้กน็ฎใText style transfer base on Large Language... |
|
Emerging |
| 33 |
coderonion/awesome-llm-and-aigc
๐๐๐A collection of some awesome public projects about Large Language... |
|
Emerging |
| 34 |
nrimsky/LM-exp
LLM experiments done during SERI MATS - focusing on activation steering /... |
|
Emerging |
| 35 |
virtualramblas/Domain-Specific-Small-Language-Models
Repository for the companion Colab notebook of the Domain-Specific Small... |
|
Emerging |
| 36 |
chanind/linear-relational
Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs)... |
|
Emerging |
| 37 |
PaddlePaddle/PALM
a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and... |
|
Emerging |
| 38 |
dobriban/Principles-of-AI-LLMs
Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring... |
|
Emerging |
| 39 |
JayZhang42/SLED
SLED: Self Logits Evolution Decoding for Improving Factuality in Large... |
|
Emerging |
| 40 |
LISA-ITMO/LLM-resume-moderator
ะะฒัะพะผะฐัะธะทะธััะตั ะผะพะดะตัะฐัะธั ัะตะทัะผะต ะฝะฐ ััััะบะพะผ ัะทัะบะต ั ะฟะพะผะพััั LLM. ะะปั... |
|
Emerging |
| 41 |
ausboss/Local-LLM-Langchain
Load local LLMs effortlessly in a Jupyter notebook for testing purposes... |
|
Emerging |
| 42 |
ictnlp/TruthX
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large... |
|
Emerging |
| 43 |
Jackksonns/CoVALend
CoVALend: a compliance-aware micro-lending default prediction pipeline with... |
|
Emerging |
| 44 |
JinXins/Awesome-Token-Merge-for-MLLMs
A paper list about Token Merge, Reduce, Resample, Drop for MLLMs. |
|
Emerging |
| 45 |
cahlen/conversation-dataset-generator
Craft conversational datasets (JSONL format with rich metadata) using LLMs.... |
|
Emerging |
| 46 |
danielsobrado/llm_notebooks
Concepts and examples on using and training LLMs |
|
Emerging |
| 47 |
rickiepark/the-lm-book
<๋๊ท๋ชจ ์ธ์ด ๋ชจ๋ธ, ํต์ฌ๋ง ๋น ๋ฅด๊ฒ!>(์ธ์ฌ์ดํธ, 2025)์ ์ฝ๋ ์ ์ฅ์ |
|
Emerging |
| 48 |
zwhe99/X-SIR
[ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual... |
|
Experimental |
| 49 |
wschella/llm-reliability
Code for the paper "Larger and more instructable language models become less... |
|
Experimental |
| 50 |
lfunderburk/automate-tech-post
LLM application: fine tuned model to generate social media posts from... |
|
Experimental |
| 51 |
Furyton/awesome-language-model-analysis
This paper list focuses on the theoretical and empirical analysis of... |
|
Experimental |
| 52 |
apanariello4/merge-and-rebase
Model merging, task-vector rebasin, and fine-tuning for vision and LLM models. |
|
Experimental |
| 53 |
RobinSmits/Dutch-LLMs
Various training, inference and validation code and results related to Open... |
|
Experimental |
| 54 |
CLDiego/SPE_GeoHackathon_2025
Foundational bootcamp on LLM usage (prompting & inference) โ tooling &... |
|
Experimental |
| 55 |
CristiVlad25/ai-papers
Tracing the evolution of AI and large language models from early neural... |
|
Experimental |
| 56 |
an-yongqi/systematic-outliers
[ICLR 2025] Systematic Outliers in Large Language Models. |
|
Experimental |
| 57 |
kvignesh1420/cot-icl-lab
[ACL 2025] Official implementation of the "CoT-ICL Lab" framework |
|
Experimental |
| 58 |
crux82/u-deppllama
Dependency parsing with Large Language Models |
|
Experimental |
| 59 |
North-Shore-AI/tinkex_cookbook
Elixir port of tinker-cookbook: training and evaluation recipes for the... |
|
Experimental |
| 60 |
yubainu/sibainu-engine
Real-time hallucination detection for LLMs via Geometric Drift Analysis in... |
|
Experimental |
| 61 |
jacksonchen1998/LLaMA-Paper-List
Collection of papers using LLaMA as backbone model |
|
Experimental |
| 62 |
Basel-anaya/LoreWeaver
LoreWeaver is a Novel Generation Multimodal LLM based on Mistral 7B LLM |
|
Experimental |
| 63 |
piratheon/LiquidBunny-llm
A bunch of script to train your own offsec LLM |
|
Experimental |
| 64 |
piratheon/LB-llm_training_scripts
A bunch of script to train your own offsec LLM |
|
Experimental |
| 65 |
Koziev/LM-pretrain
Char-level language model pretraining code and scripts |
|
Experimental |
| 66 |
tripathiarpan20/self-improvement-4all
Private self-improvement coaching with open-source LLMs |
|
Experimental |
| 67 |
phonism/llm4cp
Large Language Model for Competitive Programming |
|
Experimental |
| 68 |
GovOn-Org/GovOn
On-device AI ๋ฏผ์ ์ฒ๋ฆฌ ๋ฐ ๋ถ์ ์์คํ | LLM ๊ฒฝ๋ํ & ํ์ธํ๋ | ํ์ฅ๋ฏธ๋ฌํ ์ฐ๊ณ ํ๋ก์ ํธ - ์ฐ์ ์ฒด ์์ ๊ธฐ๋ฐ ํ์ฅ ์ค๋ฌด ์ญ๋ ๊ฐํ |
|
Experimental |
| 69 |
bosszii2709/ai-dataset-generator
๐ค Generate tailored AI training datasets quickly and easily, transforming... |
|
Experimental |
| 70 |
mickymultani/LLM-Architecture
Visualize some important concepts related to LLM architectures. |
|
Experimental |
| 71 |
christopherdanie/GovOn
Develop an on-device AI system that processes and analyzes complaints using... |
|
Experimental |
| 72 |
LaxmanNandi/MCH-Research
Conservation law for LLM context sensitivity: ฮRCI ร Var_Ratio โ K(domain).... |
|
Experimental |
| 73 |
Betswish/Cross-Lingual-Consistency
Easy-to-use framework for evaluating cross-lingual consistency of factual... |
|
Experimental |
| 74 |
mantzaris/KeemenaLM.jl
Language Models in Julia lang (transformers/GPT/decoders/chat etc) |
|
Experimental |
| 75 |
tehw0lf/writing-style-analyzer
Analyze and profile writing styles in German and English text using local... |
|
Experimental |
| 76 |
shuhulx/MergeLens
Pre-merge diagnostic framework for LLM model merging โ analyze... |
|
Experimental |
| 77 |
j341nono/llemb
Unified embedding extraction for decoder-only LLMs with support for pooling... |
|
Experimental |
| 78 |
igorbenav/practical-language-models
An open book that teaches language models starting from the learning problem... |
|
Experimental |
| 79 |
JianxXiong/AAPO
Implementation of AAPO (Arxiv: 2505.14264v2) paper |
|
Experimental |
| 80 |
ChanLiang/CONNER
[EMNLP 2023] Beyond Factuality: A Comprehensive Evaluation of Large Language... |
|
Experimental |
| 81 |
ictnlp/LSG
The code for AAAI 2025 โLarge Language Models Are Read/Write Policy-Makers... |
|
Experimental |
| 82 |
SolomonB14D3/confidence-cartography-toolkit
Teacher-forced confidence analysis for language models. pip install... |
|
Experimental |
| 83 |
hitz-zentroa/This-is-not-a-Dataset
We introduce a large semi-automatically generated dataset of ~400,000... |
|
Experimental |
| 84 |
twitter-research/lmsoc
Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining |
|
Experimental |
| 85 |
isaacus-dev/terge
An easy-to-use Python library for merging PyTorch models. |
|
Experimental |
| 86 |
ExplainableML/in-context-impersonation
[NeurIPS 2023 Spotlight] In-Context Impersonation Reveals Large Language... |
|
Experimental |
| 87 |
U4RASD/dalla-model-training
Dalla training recipe using Huggingface SFT trainer |
|
Experimental |
| 88 |
hquzhuguofeng/LLM-RoadMap
โญ๏ธโญ๏ธโญ๏ธLLMs RoadMap๏ผๅธฎๅฉๅไฝไปtransformersไปๅบ่ง่งไบ่งฃNLPไผ ็ปไปปๅก๏ผๆจกๅ้ซๆๅพฎ่ฐ๏ผไฝ็ฒพๅบฆๅพฎ่ฐ๏ผๅๅธๅผๆจกๅ่ฎญ็ป็ญๅทฅ็จๅ ๅฎน |
|
Experimental |
| 89 |
HROlive/Deep-Learning-Week
This 5 day online course was co-organised by LRZ and NVIDIA Deep Learning... |
|
Experimental |
| 90 |
kyegomez/ai-reading-list
This collection brings together the highest-signal research papers in modern... |
|
Experimental |
| 91 |
mirulili/3Ch-Jamo-Watermark
Capstone Project 2025 (Yonsei Univ.) |
|
Experimental |
| 92 |
VARUN3WARE/pplm-watermark
A research implementation of statistical text watermarking for large... |
|
Experimental |
| 93 |
julienbrasseur/llm-hallucination-detector
A lightweight library for extracting and analysing LLM internal representations |
|
Experimental |
| 94 |
machinelearningzuu/experiments-on-large-language-models
This Repository Contains Different Experiments on LLMs with Hugging Face,... |
|
Experimental |
| 95 |
HKUNLP/multilingual-transfer
Code for paper โLanguage Versatilists vs. Specialists: An Empirical... |
|
Experimental |
| 96 |
h3nock/ai-deep-dive
An open-source interactive learning platform for understanding LLMs through... |
|
Experimental |
| 97 |
Yash-Kavaiya/30-Days-LLM-Mastery-Course
30-Days-LLM-Mastery-Course: A comprehensive, hands-on course diving deep... |
|
Experimental |
| 98 |
juancmacias/Small_Lenguage_Model
Pรญldora formativa sobre SLM (Small Lenguage Model) |
|
Experimental |
| 99 |
NLPForUA/ZNO
Structured test tasks and model tuning scripts for multiple subjects from... |
|
Experimental |
| 100 |
augstentatious/TRuCAL
TRuCAL: Truth-Recursive universal Correction Attention Layer An open-source... |
|
Experimental |
| 101 |
Aminbcf/LLM-Polished-Version
This is lighter version of the llm i built as part pf my intership at expert... |
|
Experimental |
| 102 |
chazciii/rd-net
Inference-time drift experiment demonstrating reduced repetition collapse in... |
|
Experimental |
| 103 |
rraghavkaushik/NLP-Reading-List
A curated collection of NLP and LLM resources. Covers essential papers and... |
|
Experimental |
| 104 |
AlinaMustaqeem/open-LLM
Kickstart with LLMs |
|
Experimental |
| 105 |
jwliao1209/TWLLM-Tutor
๐ Taiwan-LLM Tutor: Large Language Models for Taiwanese Secondary Education |
|
Experimental |
| 106 |
nexageapps/LLM
Hands-on notebooks to understand and build Large Language Models (LLMs) from... |
|
Experimental |
| 107 |
ivangabriele-playground/Trump-0.0-minus42B
A really dumb and opinionated LLM โ exclusively trained on Donald J. Trump's... |
|
Experimental |
| 108 |
dettinjo/LLM-Fact-Auditor
A post-processing pipeline to fact-check, entity-link, and verify answers... |
|
Experimental |
| 109 |
S1LV3RJ1NX/mal-code
This repository contains the code for all the book that I am writing `My... |
|
Experimental |
| 110 |
NJUxlj/llm-hub
Popular Large Language Model's modeling file and finetune+pretrain scripts,... |
|
Experimental |
| 111 |
maximkha/The_Race_for_Intelligent_AI
An article that describes the current state of AI and the next steps to... |
|
Experimental |
| 112 |
one-some/lazy-transformers-merge
Merge transformers without using like a bajillion GB of RAM |
|
Experimental |
| 113 |
samratrajsharma/LLMs
Experimental implementations of core Large Language Model components... |
|
Experimental |
| 114 |
NLPForUA/UA-LLM
The entry point for adapting, training, evaluating, and leveraging various... |
|
Experimental |
| 115 |
HEMANGANI/LLM-Recommendation-Systems
This project fine-tunes large language models (LLMs) for text-based... |
|
Experimental |
| 116 |
ewdlop/LMNotes
Language model |
|
Experimental |
| 117 |
ekunnii/adversarial-feedback-chatbot
EMNLP 2020 finding paper "Learning Improvised Chatbots from Adversarial... |
|
Experimental |
| 118 |
tph-kds/vqa-llm
A Based Large Language Model (LLM) for VQA based on a custom model applying... |
|
Experimental |
| 119 |
CyberMaryVer/llm-notebooks
All the tutorials related to LLM |
|
Experimental |
| 120 |
crux82/advances-in-ai-2024
Materials used during the Lecture about LLMs held in the Summer School... |
|
Experimental |
| 121 |
anakin87/llama2-haystack
Using Llama2 with Haystack, the NLP/LLM framework. |
|
Experimental |
| 122 |
raideno/awesome-motion
A curated list of motion related resources. |
|
Experimental |
| 123 |
Itadori91/best-of-ai-open-source
Curated collection of 150+ exceptional open-source AI projects with a... |
|
Experimental |
| 124 |
NotShrirang/LLM-Garden
Implementing different LLM architectures in single repo |
|
Experimental |
| 125 |
SolomonB14D3/confidence-cartography
Teacher-forced confidence as a false-belief sensor for language models. |
|
Experimental |
| 126 |
Da9TH5e/PyPilot
A ๐๐ข๐ง๐ข-๐๐ ๐๐ฌ๐ฌ๐ข๐ฌ๐ญ๐๐ง๐ญ but in a python package for now (โ ๏ธ ๐ด๐ต๐ช๐ญ๐ญ ๐ช๐ฏ ๐ฆ๐ข๐ณ๐ญ๐บ ๐ฅ๐ฆ๐ท๐ฆ๐ญ๐ฐ๐ฑ๐ฎ๐ฆ๐ฏ๐ต) |
|
Experimental |
| 127 |
FawwazAhmd/msc-group-project
MSc group project evaluating instruction-tuned LLMs for legal clause... |
|
Experimental |
| 128 |
nath54/ChunkedDiffusion_LLM
Chunked Diffusion LLM is an innovative machine learning project exploring a... |
|
Experimental |
| 129 |
kaustpradalab/LLM-sycophancy
[AAAI'26 Main๐] Official code of "When Truth Is Overridden: Uncovering the... |
|
Experimental |
| 130 |
Anonym0usWork1221/JaraConverse-TransformersBased
This JaraConverse model is a cutting-edge Transformer-based supervised... |
|
Experimental |
| 131 |
mattzzz/shakeLLM
Exploration of LLMs using complete works of Shakespeare |
|
Experimental |
| 132 |
wahab-cide/african_languages_llm_project
Training multilingual language models on African languages including... |
|
Experimental |
| 133 |
avirupc/nlp
A curated collection of my learning path in NLP and LLMs. Contains my notes,... |
|
Experimental |
| 134 |
Blue-No1/open-weight-collection
Tracking open-weight LLMs for research, experiments, and inference comparisons. |
|
Experimental |
| 135 |
Adityaram0001/LLM-DeepLearning
A deep dive into the theory and practice of Large Language Models. This... |
|
Experimental |
| 136 |
priyanshujiiii/awesome_LLM
A curated list of papers, datasets, and resources on Large Language Models (LLMs) |
|
Experimental |
| 137 |
maris205/DNAHL
DNAHL Model- DNA sequenceย andย Human Language mixed large language model |
|
Experimental |
| 138 |
CAI991108/Machine-Learning-and-Language-Model
This project explores GPT-2 and Llama models through pre-training,... |
|
Experimental |
| 139 |
gokhaneraslan/llm-dataset-generator
Custom dataset generator from text and pdf |
|
Experimental |
| 140 |
Blue-No1/llm-research-notes
Notes & experiments on LLMs, open-weight models, multimodal systems, and... |
|
Experimental |
| 141 |
Shehrozkashif/AI-For-Organizations
Mitigating Hellucination in Private LLMs |
|
Experimental |
| 142 |
minorprojects/Stable-CAT
Stable Causal Attention Transformer(StableCAT) is a tiny, minimal modern ... |
|
Experimental |
| 143 |
Skwert001/hlft-legality-engine
Legality-gated evaluation for LLMs, a structural fix for hallucinations that... |
|
Experimental |
| 144 |
Alvaro8gb/Pheno-LLM
Step-forward structuring disease phenotypic entities with LLMs for disease... |
|
Experimental |
| 145 |
Francesco-Sovrano/llms_for_vulnerability_detection_are_lost_in_the_end
Replication package of the paper 'Large Language Models for In-File... |
|
Experimental |
| 146 |
mukeshmithrakumar/LLM-POC-2024
Popular Large Language Models from scratch - 2024 |
|
Experimental |
| 147 |
BjornMelin/nlp-engineering-hub
๐ Enterprise NLP systems and LLM applications. Features custom language... |
|
Experimental |
| 148 |
priyanka387/LangChain-Vector-Databases-in-Production
LLMs are deep learning models with billions of parameters that excel at a... |
|
Experimental |
| 149 |
TimKoornstra/learn-like-an-llm
Learn Like An LLM is an interactive tool that helps users understand... |
|
Experimental |
| 150 |
2006coder/LLMs-words-defs-vs-dictionaries-defs
evaluate AI's integrity |
|
Experimental |
| 151 |
thanoskaravangelis/llm-experimentation
Large Languade Model local chat in a Docker container, plus some NLP and... |
|
Experimental |