Llm Scaling Architecture Transformer Models
There are 56 llm scaling architecture models tracked. 3 score above 50 (established tier). The highest-rated is jncraton/languagemodels at 60/100 with 1,197 stars and 588 monthly downloads.
Get all 56 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-scaling-architecture&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
jncraton/languagemodels
Explore large language models in 512MB of RAM |
|
Established |
| 2 |
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities |
|
Established |
| 3 |
albertan017/LLM4Decompile
Reverse Engineering: Decompiling Binary Code with Large Language Models |
|
Established |
| 4 |
haizelabs/verdict
Inference-time scaling for LLMs-as-a-judge. |
|
Emerging |
| 5 |
bytedance/Sa2VA
Official Repo For Pixel-LLM Codebase |
|
Emerging |
| 6 |
JIA-Lab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model" |
|
Emerging |
| 7 |
Tencent-Hunyuan/GradLoc
Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR... |
|
Emerging |
| 8 |
yang-ai-lab/SleepLM
SleepLM: Natural-Language Intelligence for Human Sleep |
|
Emerging |
| 9 |
Cardinal-Operations/ORLM
ORLM: Training Large Language Models for Optimization Modeling |
|
Emerging |
| 10 |
sinanuozdemir/oreilly-optimizing-llms
Optimizing LLMs with Fine-Tuning and Prompt Engineering |
|
Emerging |
| 11 |
Victorwz/LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language... |
|
Emerging |
| 12 |
thunlp/InfLLM
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for... |
|
Emerging |
| 13 |
pdfosborne/elsciRL
The core repository of the elsciRL framework. |
|
Emerging |
| 14 |
skit-ai/SpeechLLM
This repository contains the training, inference, evaluation code for... |
|
Emerging |
| 15 |
huggingface/datablations
Scaling Data-Constrained Language Models |
|
Emerging |
| 16 |
luciusssss/ZhuangBench
[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly |
|
Emerging |
| 17 |
NiuTrans/LMT
Building a inclusive, scalable, and high-performance multilingual translation model |
|
Emerging |
| 18 |
UCSC-VLAA/m1
[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical... |
|
Emerging |
| 19 |
sshh12/llm_optimize
LLM Optimize is a proof-of-concept library for doing LLM (large language... |
|
Emerging |
| 20 |
VityaVitalich/STASC
[ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models |
|
Emerging |
| 21 |
mkuchnik/relm
ReLM is a Regular Expression engine for Language Models |
|
Emerging |
| 22 |
StupidTrees/SplitLLM
Split Learning Simulation Framework for LLMs |
|
Emerging |
| 23 |
WANGXinyiLinda/concept-based-demonstration-selection
Offical code of the paper Large Language Models Are Implicitly Topic Models:... |
|
Emerging |
| 24 |
locuslab/massive-activations
Code accompanying the paper "Massive Activations in Large Language Models" |
|
Emerging |
| 25 |
luohongyin/LangCode
LangCode - Improving alignment and reasoning of large language models (LLMs)... |
|
Emerging |
| 26 |
martin-wey/peft-llm-code
Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning... |
|
Experimental |
| 27 |
OSU-STARLAB/Simul-LLM
[ACL 2024] An easily extensible framework for simultaneous, text-to-text... |
|
Experimental |
| 28 |
ai8hyf/llm_split_recall_test
Split and Recall: A simple and efficient benchmark to evaluate in-context... |
|
Experimental |
| 29 |
NiuTrans/LaMaTE
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine... |
|
Experimental |
| 30 |
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language... |
|
Experimental |
| 31 |
ZigeW/data_management_LLM
Collection of training data management explorations for large language models |
|
Experimental |
| 32 |
QwenLM/ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling |
|
Experimental |
| 33 |
ymoslem/Adaptive-MT-LLM
Adaptive Machine Translation with Large Language Models |
|
Experimental |
| 34 |
ryoungj/ObsScaling
[NeurIPS'24 Spotlight] Observational Scaling Laws |
|
Experimental |
| 35 |
dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving
Baseline achieving 0.8 accuracy on the private test set in the ZaloAI... |
|
Experimental |
| 36 |
zzz47zzz/codebase-for-incremental-learning-with-llm
[ACL2024] A Codebase for Incremental Learning with Large Language Models;... |
|
Experimental |
| 37 |
mubingshen/MLC-SLM-Baseline
The project is associated with the recently-launched INTERSPEECH 2025... |
|
Experimental |
| 38 |
yinzhangyue/EoT
Exchange-of-Thought: Enhancing Large Language Model Capabilities through... |
|
Experimental |
| 39 |
Butanium/llm-lang-agnostic
minimal code to reproduce results from Separating Tongue from Thought:... |
|
Experimental |
| 40 |
bminixhofer/zett
Code for Zero-Shot Tokenizer Transfer |
|
Experimental |
| 41 |
Y-Research-SBU/CSR
Official Repository for CSR - ICML 2025 Oral |
|
Experimental |
| 42 |
rhubarbwu/linguistic-collapse
Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models... |
|
Experimental |
| 43 |
hank0316/AdaSearch
This includes the original implementation of "AdaSearch: Balancing... |
|
Experimental |
| 44 |
LSquaredM/mutual_info_scaling_law
(NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for... |
|
Experimental |
| 45 |
millioniron/LLM_exploration_Graph-Attention-Mechanisms-Perspective
Code: Attention Mechanisms Perspective: Exploring LLM Processing of... |
|
Experimental |
| 46 |
HKUSTDial/megatran
[VLDB'25] Official repo for Paper "Weak-to-Strong Prompts with... |
|
Experimental |
| 47 |
IAAR-Shanghai/FastMem
Fast Memorization of Prompt Improves Context Awareness of Large Language... |
|
Experimental |
| 48 |
Xiaohao-Yang/LLM-ITL
[ACL 2025 Main] Neural Topic Modeling with Large Language Models in the Loop |
|
Experimental |
| 49 |
efficientscaling/Z1
[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code" |
|
Experimental |
| 50 |
EastTower16/LLMDataDistill
distill large scale web page text |
|
Experimental |
| 51 |
ictnlp/FastLongSpeech
FastLongSpeech is a novel framework designed to extend the capabilities of... |
|
Experimental |
| 52 |
UKPLab/arxiv2025-inherent-limits-plms
Code repository for the paper "The Inherent Limits of Pretrained LLMs: The... |
|
Experimental |
| 53 |
YutongWang1216/ReflectionLLMMT
Code and data realeases for the paper -- TasTe: Teaching Large Language... |
|
Experimental |
| 54 |
eminorhan/llm-memory
Memory experiments with LLMs |
|
Experimental |
| 55 |
GeorgeVern/lmcor
Code for the EACL 2024 paper: "Small Language Models Improve Giants by... |
|
Experimental |
| 56 |
wyt2000/InverseCoder
[AAAI 2025] The official code of the paper "InverseCoder: Unleashing the... |
|
Experimental |