Llm Scaling Architecture Transformer Models

There are 56 llm scaling architecture models tracked. 3 score above 50 (established tier). The highest-rated is jncraton/languagemodels at 60/100 with 1,197 stars and 588 monthly downloads.

Get all 56 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-scaling-architecture&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	jncraton/languagemodels Explore large language models in 512MB of RAM	60	Established	1,197	HTML
2	microsoft/unilm Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities	57	Established	22,042	Python
3	albertan017/LLM4Decompile Reverse Engineering: Decompiling Binary Code with Large Language Models	54	Established	6,407	Python
4	haizelabs/verdict Inference-time scaling for LLMs-as-a-judge.	48	Emerging	332	Jupyter Notebook
5	bytedance/Sa2VA Official Repo For Pixel-LLM Codebase	47	Emerging	1,558	Python
6	JIA-Lab-research/LISA Project Page for "LISA: Reasoning Segmentation via Large Language Model"	45	Emerging	2,604	Python
7	Tencent-Hunyuan/GradLoc Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR...	43	Emerging	89	Python
8	yang-ai-lab/SleepLM SleepLM: Natural-Language Intelligence for Human Sleep	41	Emerging	29	Jupyter Notebook
9	Cardinal-Operations/ORLM ORLM: Training Large Language Models for Optimization Modeling	40	Emerging	237	Python
10	sinanuozdemir/oreilly-optimizing-llms Optimizing LLMs with Fine-Tuning and Prompt Engineering	39	Emerging	88	Jupyter Notebook
11	Victorwz/LongMem Official implementation of our NeurIPS 2023 paper "Augmenting Language...	37	Emerging	822	Python
12	thunlp/InfLLM The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for...	35	Emerging	395	Python
13	pdfosborne/elsciRL The core repository of the elsciRL framework.	35	Emerging	18	Python
14	skit-ai/SpeechLLM This repository contains the training, inference, evaluation code for...	33	Emerging	130	Python
15	huggingface/datablations Scaling Data-Constrained Language Models	33	Emerging	342	Jupyter Notebook
16	luciusssss/ZhuangBench [ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly	33	Emerging	25	Python
17	NiuTrans/LMT Building a inclusive, scalable, and high-performance multilingual translation model	32	Emerging	125	Python
18	UCSC-VLAA/m1 [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical...	32	Emerging	48	Jupyter Notebook
19	sshh12/llm_optimize LLM Optimize is a proof-of-concept library for doing LLM (large language...	31	Emerging	61	Python
20	VityaVitalich/STASC [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models	30	Emerging	11	Jupyter Notebook
21	mkuchnik/relm ReLM is a Regular Expression engine for Language Models	30	Emerging	107	Python
22	StupidTrees/SplitLLM Split Learning Simulation Framework for LLMs	30	Emerging	38	Python
23	WANGXinyiLinda/concept-based-demonstration-selection Offical code of the paper Large Language Models Are Implicitly Topic Models:...	30	Emerging	75	Python
24	locuslab/massive-activations Code accompanying the paper "Massive Activations in Large Language Models"	30	Emerging	197	Python
25	luohongyin/LangCode LangCode - Improving alignment and reasoning of large language models (LLMs)...	30	Emerging	49	Python
26	martin-wey/peft-llm-code Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning...	29	Experimental	25	Python
27	OSU-STARLAB/Simul-LLM [ACL 2024] An easily extensible framework for simultaneous, text-to-text...	29	Experimental	18	Python
28	ai8hyf/llm_split_recall_test Split and Recall: A simple and efficient benchmark to evaluate in-context...	28	Experimental	9	Python
29	NiuTrans/LaMaTE Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine...	27	Experimental	28	Python
30	YuanGongND/ltu Code, Dataset, and Pretrained Models for Audio and Speech Large Language...	26	Experimental	472	Python
31	ZigeW/data_management_LLM Collection of training data management explorations for large language models	26	Experimental	337	—
32	QwenLM/ParScale Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling	26	Experimental	476	Python
33	ymoslem/Adaptive-MT-LLM Adaptive Machine Translation with Large Language Models	25	Experimental	32	JavaScript
34	ryoungj/ObsScaling [NeurIPS'24 Spotlight] Observational Scaling Laws	24	Experimental	60	Jupyter Notebook
35	dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving Baseline achieving 0.8 accuracy on the private test set in the ZaloAI...	24	Experimental	24	Python
36	zzz47zzz/codebase-for-incremental-learning-with-llm [ACL2024] A Codebase for Incremental Learning with Large Language Models;...	24	Experimental	60	Python
37	mubingshen/MLC-SLM-Baseline The project is associated with the recently-launched INTERSPEECH 2025...	23	Experimental	50	Python
38	yinzhangyue/EoT Exchange-of-Thought: Enhancing Large Language Model Capabilities through...	23	Experimental	21	Python
39	Butanium/llm-lang-agnostic minimal code to reproduce results from Separating Tongue from Thought:...	22	Experimental	13	Jupyter Notebook
40	bminixhofer/zett Code for Zero-Shot Tokenizer Transfer	22	Experimental	143	Python
41	Y-Research-SBU/CSR Official Repository for CSR - ICML 2025 Oral	21	Experimental	21	Python
42	rhubarbwu/linguistic-collapse Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models...	21	Experimental	18	Python
43	hank0316/AdaSearch This includes the original implementation of "AdaSearch: Balancing...	20	Experimental	10	—
44	LSquaredM/mutual_info_scaling_law (NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for...	20	Experimental	13	Python
45	millioniron/LLM_exploration_Graph-Attention-Mechanisms-Perspective Code: Attention Mechanisms Perspective: Exploring LLM Processing of...	19	Experimental	12	Jupyter Notebook
46	HKUSTDial/megatran [VLDB'25] Official repo for Paper "Weak-to-Strong Prompts with...	16	Experimental	11	Python
47	IAAR-Shanghai/FastMem Fast Memorization of Prompt Improves Context Awareness of Large Language...	15	Experimental	24	Python
48	Xiaohao-Yang/LLM-ITL [ACL 2025 Main] Neural Topic Modeling with Large Language Models in the Loop	15	Experimental	11	Python
49	efficientscaling/Z1 [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"	15	Experimental	68	Python
50	EastTower16/LLMDataDistill distill large scale web page text	14	Experimental	12	C++
51	ictnlp/FastLongSpeech FastLongSpeech is a novel framework designed to extend the capabilities of...	14	Experimental	14	Python
52	UKPLab/arxiv2025-inherent-limits-plms Code repository for the paper "The Inherent Limits of Pretrained LLMs: The...	14	Experimental	13	Python
53	YutongWang1216/ReflectionLLMMT Code and data realeases for the paper -- TasTe: Teaching Large Language...	14	Experimental	13	Python
54	eminorhan/llm-memory Memory experiments with LLMs	13	Experimental	10	Python
55	GeorgeVern/lmcor Code for the EACL 2024 paper: "Small Language Models Improve Giants by...	12	Experimental	12	Python
56	wyt2000/InverseCoder [AAAI 2025] The official code of the paper "InverseCoder: Unleashing the...	12	Experimental	14	Python