LLM Training Experimentation Transformer Models

Repositories for training, fine-tuning, and experimenting with large language models including tutorials, frameworks, and custom implementations. Does NOT include deployment tools, specific downstream applications (chatbots, summarization), or model evaluation/analysis.

There are 151 llm training experimentation models tracked. 2 score above 70 (verified tier). The highest-rated is PaddlePaddle/PaddleNLP at 79/100 with 12,929 stars and 41,348 monthly downloads. 2 of the top 10 are actively maintained.

Get all 151 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-training-experimentation&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

79
Verified
2 meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with...

73
Verified
3 arcee-ai/mergekit

Tools for merging pretrained large language models.

59
Established
4 changyeyu/LLM-RL-Visualized

๐ŸŒŸ100+ ๅŽŸๅˆ› LLM / RL ๅŽŸ็†ๅ›พ๐Ÿ“š๏ผŒใ€Šๅคงๆจกๅž‹็ฎ—ๆณ•ใ€‹ไฝœ่€…ๅทจ็Œฎ๏ผ๐Ÿ’ฅ๏ผˆ100+ LLM/RL Algorithm Maps ๏ผ‰

58
Established
5 mindspore-lab/step_into_llm

MindSpore online courses: Step into LLM

57
Established
6 kyegomez/LFM2

A simple and minimal open source implementation of "Introducing LFM2: The...

56
Established
7 kyegomez/LFM

An open source implementation of LFMs from Liquid AI: Liquid Foundation Models

56
Established
8 BeastByteAI/scikit-llm

Seamlessly integrate LLMs into scikit-learn.

55
Established
9 ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing

LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best...

54
Established
10 IbrahimSobh/llms

Large Language Models: In this repository Language models are introduced...

52
Established
11 bobazooba/xllm

๐Ÿฆ– Xโ€”LLM: Cutting Edge & Easy LLM Finetuning

52
Established
12 Leeroo-AI/mergoo

A library for easily merging multiple LLM experts, and efficiently train the...

52
Established
13 r2d4/rellm

Exact structure out of any language model completion.

50
Established
14 iusztinpaul/hands-on-llms

๐Ÿฆ– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป about ๐—Ÿ๐—Ÿ๐— ๐˜€, ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€, and ๐˜ƒ๐—ฒ๐—ฐ๐˜๐—ผ๐—ฟ ๐——๐—•๐˜€ for free by designing, training,...

49
Emerging
15 socialfoundations/folktexts

Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on...

48
Emerging
16 datawhalechina/base-llm

ไปŽ NLP ๅˆฐ LLM ็š„็ฎ—ๆณ•ๅ…จๆ ˆๆ•™็จ‹๏ผŒๅœจ็บฟ้˜…่ฏปๅœฐๅ€๏ผšhttps://datawhalechina.github.io/base-llm/

46
Emerging
17 young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for...

46
Emerging
18 Tzohar/PassLLM

World's most accurate password guessing AI tool. A PyTorch implementation of...

45
Emerging
19 HamedBabaei/LLMs4OM

LLMs4OM: Matching Ontologies with Large Language Models

42
Emerging
20 EvilFreelancer/impruver

A set of scripts and configurations for pretraining of Large Language Models (LLM)

42
Emerging
21 HamedBabaei/LLMs4OL

LLMs4OL:โ€Œ Large Language Models for Ontology Learning

41
Emerging
22 gjbex/Deploying-LLMs-locally

Material for a training on AI tools

41
Emerging
23 johnmai-dev/NotebookMLX

๐Ÿ“‹ NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)

40
Emerging
24 souzatharsis/tamingLLMs

Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software

40
Emerging
25 declare-lab/red-instruct

Codes and datasets of the paper Red-Teaming Large Language Models using...

39
Emerging
26 hitz-zentroa/GoLLIE

Guideline following Large Language Model for Information Extraction

39
Emerging
27 SolomonB14D3/knowledge-fidelity

Behavioral auditing & repair toolkit for LLMs. Measures 8 dimensions via...

39
Emerging
28 janelu9/EasyLLM

Running Large Language Model easily.

39
Emerging
29 kyaiooiayk/Awesome-LLM-Large-Language-Models-Notes

What can I do with a LLM model?

38
Emerging
30 Curated-Awesome-Lists/awesome-llms-fine-tuning

Explore a comprehensive collection of resources, tutorials, papers, tools,...

38
Emerging
31 WhereIsAI/BiLLM

Tool for converting LLMs from uni-directional to bi-directional by removing...

38
Emerging
32 stylellm/stylellm_models

StyleLLMๆ–‡้ฃŽๅคงๆจกๅž‹๏ผšๅŸบไบŽๅคง่ฏญ่จ€ๆจกๅž‹็š„ๆ–‡ๆœฌ้ฃŽๆ ผ่ฟ็งป้กน็›ฎใ€‚Text style transfer base on Large Language...

38
Emerging
33 coderonion/awesome-llm-and-aigc

๐Ÿš€๐Ÿš€๐Ÿš€A collection of some awesome public projects about Large Language...

37
Emerging
34 nrimsky/LM-exp

LLM experiments done during SERI MATS - focusing on activation steering /...

37
Emerging
35 virtualramblas/Domain-Specific-Small-Language-Models

Repository for the companion Colab notebook of the Domain-Specific Small...

37
Emerging
36 chanind/linear-relational

Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs)...

36
Emerging
37 PaddlePaddle/PALM

a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and...

36
Emerging
38 dobriban/Principles-of-AI-LLMs

Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring...

36
Emerging
39 JayZhang42/SLED

SLED: Self Logits Evolution Decoding for Improving Factuality in Large...

35
Emerging
40 LISA-ITMO/LLM-resume-moderator

ะะฒั‚ะพะผะฐั‚ะธะทะธั€ัƒะตั‚ ะผะพะดะตั€ะฐั†ะธัŽ ั€ะตะทัŽะผะต ะฝะฐ ั€ัƒััะบะพะผ ัะทั‹ะบะต ั ะฟะพะผะพั‰ัŒัŽ LLM. ะ”ะปั...

35
Emerging
41 ausboss/Local-LLM-Langchain

Load local LLMs effortlessly in a Jupyter notebook for testing purposes...

34
Emerging
42 ictnlp/TruthX

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large...

34
Emerging
43 Jackksonns/CoVALend

CoVALend: a compliance-aware micro-lending default prediction pipeline with...

33
Emerging
44 JinXins/Awesome-Token-Merge-for-MLLMs

A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.

31
Emerging
45 cahlen/conversation-dataset-generator

Craft conversational datasets (JSONL format with rich metadata) using LLMs....

30
Emerging
46 danielsobrado/llm_notebooks

Concepts and examples on using and training LLMs

30
Emerging
47 rickiepark/the-lm-book

<๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ, ํ•ต์‹ฌ๋งŒ ๋น ๋ฅด๊ฒŒ!>(์ธ์‚ฌ์ดํŠธ, 2025)์˜ ์ฝ”๋“œ ์ €์žฅ์†Œ

30
Emerging
48 zwhe99/X-SIR

[ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual...

29
Experimental
49 wschella/llm-reliability

Code for the paper "Larger and more instructable language models become less...

29
Experimental
50 lfunderburk/automate-tech-post

LLM application: fine tuned model to generate social media posts from...

28
Experimental
51 Furyton/awesome-language-model-analysis

This paper list focuses on the theoretical and empirical analysis of...

28
Experimental
52 apanariello4/merge-and-rebase

Model merging, task-vector rebasin, and fine-tuning for vision and LLM models.

28
Experimental
53 RobinSmits/Dutch-LLMs

Various training, inference and validation code and results related to Open...

27
Experimental
54 CLDiego/SPE_GeoHackathon_2025

Foundational bootcamp on LLM usage (prompting & inference) โ†’ tooling &...

27
Experimental
55 CristiVlad25/ai-papers

Tracing the evolution of AI and large language models from early neural...

27
Experimental
56 an-yongqi/systematic-outliers

[ICLR 2025] Systematic Outliers in Large Language Models.

27
Experimental
57 kvignesh1420/cot-icl-lab

[ACL 2025] Official implementation of the "CoT-ICL Lab" framework

27
Experimental
58 crux82/u-deppllama

Dependency parsing with Large Language Models

26
Experimental
59 North-Shore-AI/tinkex_cookbook

Elixir port of tinker-cookbook: training and evaluation recipes for the...

26
Experimental
60 yubainu/sibainu-engine

Real-time hallucination detection for LLMs via Geometric Drift Analysis in...

25
Experimental
61 jacksonchen1998/LLaMA-Paper-List

Collection of papers using LLaMA as backbone model

25
Experimental
62 Basel-anaya/LoreWeaver

LoreWeaver is a Novel Generation Multimodal LLM based on Mistral 7B LLM

24
Experimental
63 piratheon/LiquidBunny-llm

A bunch of script to train your own offsec LLM

24
Experimental
64 piratheon/LB-llm_training_scripts

A bunch of script to train your own offsec LLM

24
Experimental
65 Koziev/LM-pretrain

Char-level language model pretraining code and scripts

24
Experimental
66 tripathiarpan20/self-improvement-4all

Private self-improvement coaching with open-source LLMs

24
Experimental
67 phonism/llm4cp

Large Language Model for Competitive Programming

23
Experimental
68 GovOn-Org/GovOn

On-device AI ๋ฏผ์› ์ฒ˜๋ฆฌ ๋ฐ ๋ถ„์„ ์‹œ์Šคํ…œ | LLM ๊ฒฝ๋Ÿ‰ํ™” & ํŒŒ์ธํŠœ๋‹ | ํ˜„์žฅ๋ฏธ๋Ÿฌํ˜• ์—ฐ๊ณ„ ํ”„๋กœ์ ํŠธ - ์‚ฐ์—…์ฒด ์ˆ˜์š” ๊ธฐ๋ฐ˜ ํ˜„์žฅ ์‹ค๋ฌด ์—ญ๋Ÿ‰ ๊ฐ•ํ™”

23
Experimental
69 bosszii2709/ai-dataset-generator

๐Ÿค– Generate tailored AI training datasets quickly and easily, transforming...

23
Experimental
70 mickymultani/LLM-Architecture

Visualize some important concepts related to LLM architectures.

23
Experimental
71 christopherdanie/GovOn

Develop an on-device AI system that processes and analyzes complaints using...

22
Experimental
72 LaxmanNandi/MCH-Research

Conservation law for LLM context sensitivity: ฮ”RCI ร— Var_Ratio โ‰ˆ K(domain)....

22
Experimental
73 Betswish/Cross-Lingual-Consistency

Easy-to-use framework for evaluating cross-lingual consistency of factual...

22
Experimental
74 mantzaris/KeemenaLM.jl

Language Models in Julia lang (transformers/GPT/decoders/chat etc)

22
Experimental
75 tehw0lf/writing-style-analyzer

Analyze and profile writing styles in German and English text using local...

22
Experimental
76 shuhulx/MergeLens

Pre-merge diagnostic framework for LLM model merging โ€” analyze...

22
Experimental
77 j341nono/llemb

Unified embedding extraction for decoder-only LLMs with support for pooling...

21
Experimental
78 igorbenav/practical-language-models

An open book that teaches language models starting from the learning problem...

21
Experimental
79 JianxXiong/AAPO

Implementation of AAPO (Arxiv: 2505.14264v2) paper

21
Experimental
80 ChanLiang/CONNER

[EMNLP 2023] Beyond Factuality: A Comprehensive Evaluation of Large Language...

21
Experimental
81 ictnlp/LSG

The code for AAAI 2025 โ€œLarge Language Models Are Read/Write Policy-Makers...

20
Experimental
82 SolomonB14D3/confidence-cartography-toolkit

Teacher-forced confidence analysis for language models. pip install...

20
Experimental
83 hitz-zentroa/This-is-not-a-Dataset

We introduce a large semi-automatically generated dataset of ~400,000...

20
Experimental
84 twitter-research/lmsoc

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

20
Experimental
85 isaacus-dev/terge

An easy-to-use Python library for merging PyTorch models.

20
Experimental
86 ExplainableML/in-context-impersonation

[NeurIPS 2023 Spotlight] In-Context Impersonation Reveals Large Language...

19
Experimental
87 U4RASD/dalla-model-training

Dalla training recipe using Huggingface SFT trainer

19
Experimental
88 hquzhuguofeng/LLM-RoadMap

โญ๏ธโญ๏ธโญ๏ธLLMs RoadMap๏ผŒๅธฎๅŠฉๅ„ไฝไปŽtransformersไป“ๅบ“่ง†่ง’ไบ†่งฃNLPไผ ็ปŸไปปๅŠก๏ผŒๆจกๅž‹้ซ˜ๆ•ˆๅพฎ่ฐƒ๏ผŒไฝŽ็ฒพๅบฆๅพฎ่ฐƒ๏ผŒๅˆ†ๅธƒๅผๆจกๅž‹่ฎญ็ปƒ็ญ‰ๅทฅ็จ‹ๅ†…ๅฎน

19
Experimental
89 HROlive/Deep-Learning-Week

This 5 day online course was co-organised by LRZ and NVIDIA Deep Learning...

19
Experimental
90 kyegomez/ai-reading-list

This collection brings together the highest-signal research papers in modern...

19
Experimental
91 mirulili/3Ch-Jamo-Watermark

Capstone Project 2025 (Yonsei Univ.)

19
Experimental
92 VARUN3WARE/pplm-watermark

A research implementation of statistical text watermarking for large...

19
Experimental
93 julienbrasseur/llm-hallucination-detector

A lightweight library for extracting and analysing LLM internal representations

19
Experimental
94 machinelearningzuu/experiments-on-large-language-models

This Repository Contains Different Experiments on LLMs with Hugging Face,...

18
Experimental
95 HKUNLP/multilingual-transfer

Code for paper โ€Language Versatilists vs. Specialists: An Empirical...

17
Experimental
96 h3nock/ai-deep-dive

An open-source interactive learning platform for understanding LLMs through...

17
Experimental
97 Yash-Kavaiya/30-Days-LLM-Mastery-Course

30-Days-LLM-Mastery-Course: A comprehensive, hands-on course diving deep...

17
Experimental
98 juancmacias/Small_Lenguage_Model

Pรญldora formativa sobre SLM (Small Lenguage Model)

17
Experimental
99 NLPForUA/ZNO

Structured test tasks and model tuning scripts for multiple subjects from...

16
Experimental
100 augstentatious/TRuCAL

TRuCAL: Truth-Recursive universal Correction Attention Layer An open-source...

16
Experimental
101 Aminbcf/LLM-Polished-Version

This is lighter version of the llm i built as part pf my intership at expert...

16
Experimental
102 chazciii/rd-net

Inference-time drift experiment demonstrating reduced repetition collapse in...

16
Experimental
103 rraghavkaushik/NLP-Reading-List

A curated collection of NLP and LLM resources. Covers essential papers and...

16
Experimental
104 AlinaMustaqeem/open-LLM

Kickstart with LLMs

16
Experimental
105 jwliao1209/TWLLM-Tutor

๐Ÿ“˜ Taiwan-LLM Tutor: Large Language Models for Taiwanese Secondary Education

15
Experimental
106 nexageapps/LLM

Hands-on notebooks to understand and build Large Language Models (LLMs) from...

15
Experimental
107 ivangabriele-playground/Trump-0.0-minus42B

A really dumb and opinionated LLM โ€” exclusively trained on Donald J. Trump's...

15
Experimental
108 dettinjo/LLM-Fact-Auditor

A post-processing pipeline to fact-check, entity-link, and verify answers...

15
Experimental
109 S1LV3RJ1NX/mal-code

This repository contains the code for all the book that I am writing `My...

15
Experimental
110 NJUxlj/llm-hub

Popular Large Language Model's modeling file and finetune+pretrain scripts,...

15
Experimental
111 maximkha/The_Race_for_Intelligent_AI

An article that describes the current state of AI and the next steps to...

15
Experimental
112 one-some/lazy-transformers-merge

Merge transformers without using like a bajillion GB of RAM

14
Experimental
113 samratrajsharma/LLMs

Experimental implementations of core Large Language Model components...

14
Experimental
114 NLPForUA/UA-LLM

The entry point for adapting, training, evaluating, and leveraging various...

14
Experimental
115 HEMANGANI/LLM-Recommendation-Systems

This project fine-tunes large language models (LLMs) for text-based...

14
Experimental
116 ewdlop/LMNotes

Language model

13
Experimental
117 ekunnii/adversarial-feedback-chatbot

EMNLP 2020 finding paper "Learning Improvised Chatbots from Adversarial...

13
Experimental
118 tph-kds/vqa-llm

A Based Large Language Model (LLM) for VQA based on a custom model applying...

13
Experimental
119 CyberMaryVer/llm-notebooks

All the tutorials related to LLM

12
Experimental
120 crux82/advances-in-ai-2024

Materials used during the Lecture about LLMs held in the Summer School...

12
Experimental
121 anakin87/llama2-haystack

Using Llama2 with Haystack, the NLP/LLM framework.

12
Experimental
122 raideno/awesome-motion

A curated list of motion related resources.

12
Experimental
123 Itadori91/best-of-ai-open-source

Curated collection of 150+ exceptional open-source AI projects with a...

12
Experimental
124 NotShrirang/LLM-Garden

Implementing different LLM architectures in single repo

12
Experimental
125 SolomonB14D3/confidence-cartography

Teacher-forced confidence as a false-belief sensor for language models.

12
Experimental
126 Da9TH5e/PyPilot

A ๐Œ๐ข๐ง๐ข-๐€๐ˆ ๐€๐ฌ๐ฌ๐ข๐ฌ๐ญ๐š๐ง๐ญ but in a python package for now (โš ๏ธŽ ๐˜ด๐˜ต๐˜ช๐˜ญ๐˜ญ ๐˜ช๐˜ฏ ๐˜ฆ๐˜ข๐˜ณ๐˜ญ๐˜บ ๐˜ฅ๐˜ฆ๐˜ท๐˜ฆ๐˜ญ๐˜ฐ๐˜ฑ๐˜ฎ๐˜ฆ๐˜ฏ๐˜ต)

12
Experimental
127 FawwazAhmd/msc-group-project

MSc group project evaluating instruction-tuned LLMs for legal clause...

12
Experimental
128 nath54/ChunkedDiffusion_LLM

Chunked Diffusion LLM is an innovative machine learning project exploring a...

12
Experimental
129 kaustpradalab/LLM-sycophancy

[AAAI'26 Main๐ŸŽ‰] Official code of "When Truth Is Overridden: Uncovering the...

11
Experimental
130 Anonym0usWork1221/JaraConverse-TransformersBased

This JaraConverse model is a cutting-edge Transformer-based supervised...

11
Experimental
131 mattzzz/shakeLLM

Exploration of LLMs using complete works of Shakespeare

11
Experimental
132 wahab-cide/african_languages_llm_project

Training multilingual language models on African languages including...

11
Experimental
133 avirupc/nlp

A curated collection of my learning path in NLP and LLMs. Contains my notes,...

11
Experimental
134 Blue-No1/open-weight-collection

Tracking open-weight LLMs for research, experiments, and inference comparisons.

11
Experimental
135 Adityaram0001/LLM-DeepLearning

A deep dive into the theory and practice of Large Language Models. This...

11
Experimental
136 priyanshujiiii/awesome_LLM

A curated list of papers, datasets, and resources on Large Language Models (LLMs)

11
Experimental
137 maris205/DNAHL

DNAHL Model- DNA sequenceย andย Human Language mixed large language model

11
Experimental
138 CAI991108/Machine-Learning-and-Language-Model

This project explores GPT-2 and Llama models through pre-training,...

11
Experimental
139 gokhaneraslan/llm-dataset-generator

Custom dataset generator from text and pdf

11
Experimental
140 Blue-No1/llm-research-notes

Notes & experiments on LLMs, open-weight models, multimodal systems, and...

11
Experimental
141 Shehrozkashif/AI-For-Organizations

Mitigating Hellucination in Private LLMs

11
Experimental
142 minorprojects/Stable-CAT

Stable Causal Attention Transformer(StableCAT) is a tiny, minimal modern ...

11
Experimental
143 Skwert001/hlft-legality-engine

Legality-gated evaluation for LLMs, a structural fix for hallucinations that...

11
Experimental
144 Alvaro8gb/Pheno-LLM

Step-forward structuring disease phenotypic entities with LLMs for disease...

10
Experimental
145 Francesco-Sovrano/llms_for_vulnerability_detection_are_lost_in_the_end

Replication package of the paper 'Large Language Models for In-File...

10
Experimental
146 mukeshmithrakumar/LLM-POC-2024

Popular Large Language Models from scratch - 2024

10
Experimental
147 BjornMelin/nlp-engineering-hub

๐Ÿ“š Enterprise NLP systems and LLM applications. Features custom language...

10
Experimental
148 priyanka387/LangChain-Vector-Databases-in-Production

LLMs are deep learning models with billions of parameters that excel at a...

10
Experimental
149 TimKoornstra/learn-like-an-llm

Learn Like An LLM is an interactive tool that helps users understand...

10
Experimental
150 2006coder/LLMs-words-defs-vs-dictionaries-defs

evaluate AI's integrity

10
Experimental
151 thanoskaravangelis/llm-experimentation

Large Languade Model local chat in a Docker container, plus some NLP and...

10
Experimental