Llm Scaling Architecture Transformer Models

There are 56 llm scaling architecture models tracked. 3 score above 50 (established tier). The highest-rated is jncraton/languagemodels at 60/100 with 1,197 stars and 588 monthly downloads.

Get all 56 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-scaling-architecture&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 jncraton/languagemodels

Explore large language models in 512MB of RAM

60
Established
2 microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

57
Established
3 albertan017/LLM4Decompile

Reverse Engineering: Decompiling Binary Code with Large Language Models

54
Established
4 haizelabs/verdict

Inference-time scaling for LLMs-as-a-judge.

48
Emerging
5 bytedance/Sa2VA

Official Repo For Pixel-LLM Codebase

47
Emerging
6 JIA-Lab-research/LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

45
Emerging
7 Tencent-Hunyuan/GradLoc

Implementation of GradLoc from the Tencent Hunyuan blog "Stabilizing RLVR...

43
Emerging
8 yang-ai-lab/SleepLM

SleepLM: Natural-Language Intelligence for Human Sleep

41
Emerging
9 Cardinal-Operations/ORLM

ORLM: Training Large Language Models for Optimization Modeling

40
Emerging
10 sinanuozdemir/oreilly-optimizing-llms

Optimizing LLMs with Fine-Tuning and Prompt Engineering

39
Emerging
11 Victorwz/LongMem

Official implementation of our NeurIPS 2023 paper "Augmenting Language...

37
Emerging
12 thunlp/InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for...

35
Emerging
13 pdfosborne/elsciRL

The core repository of the elsciRL framework.

35
Emerging
14 skit-ai/SpeechLLM

This repository contains the training, inference, evaluation code for...

33
Emerging
15 huggingface/datablations

Scaling Data-Constrained Language Models

33
Emerging
16 luciusssss/ZhuangBench

[ACL'24 Findings] Teaching Large Language Models an Unseen Language on the Fly

33
Emerging
17 NiuTrans/LMT

Building a inclusive, scalable, and high-performance multilingual translation model

32
Emerging
18 UCSC-VLAA/m1

[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical...

32
Emerging
19 sshh12/llm_optimize

LLM Optimize is a proof-of-concept library for doing LLM (large language...

31
Emerging
20 VityaVitalich/STASC

[ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models

30
Emerging
21 mkuchnik/relm

ReLM is a Regular Expression engine for Language Models

30
Emerging
22 StupidTrees/SplitLLM

Split Learning Simulation Framework for LLMs

30
Emerging
23 WANGXinyiLinda/concept-based-demonstration-selection

Offical code of the paper Large Language Models Are Implicitly Topic Models:...

30
Emerging
24 locuslab/massive-activations

Code accompanying the paper "Massive Activations in Large Language Models"

30
Emerging
25 luohongyin/LangCode

LangCode - Improving alignment and reasoning of large language models (LLMs)...

30
Emerging
26 martin-wey/peft-llm-code

Replication package of the paper "Exploring Parameter-Efficient Fine-Tuning...

29
Experimental
27 OSU-STARLAB/Simul-LLM

[ACL 2024] An easily extensible framework for simultaneous, text-to-text...

29
Experimental
28 ai8hyf/llm_split_recall_test

Split and Recall: A simple and efficient benchmark to evaluate in-context...

28
Experimental
29 NiuTrans/LaMaTE

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine...

27
Experimental
30 YuanGongND/ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language...

26
Experimental
31 ZigeW/data_management_LLM

Collection of training data management explorations for large language models

26
Experimental
32 QwenLM/ParScale

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

26
Experimental
33 ymoslem/Adaptive-MT-LLM

Adaptive Machine Translation with Large Language Models

25
Experimental
34 ryoungj/ObsScaling

[NeurIPS'24 Spotlight] Observational Scaling Laws

24
Experimental
35 dinhquy-nguyen-1704/ZaloAI2023-Elementary-Math-Solving

Baseline achieving 0.8 accuracy on the private test set in the ZaloAI...

24
Experimental
36 zzz47zzz/codebase-for-incremental-learning-with-llm

[ACL2024] A Codebase for Incremental Learning with Large Language Models;...

24
Experimental
37 mubingshen/MLC-SLM-Baseline

The project is associated with the recently-launched INTERSPEECH 2025...

23
Experimental
38 yinzhangyue/EoT

Exchange-of-Thought: Enhancing Large Language Model Capabilities through...

23
Experimental
39 Butanium/llm-lang-agnostic

minimal code to reproduce results from Separating Tongue from Thought:...

22
Experimental
40 bminixhofer/zett

Code for Zero-Shot Tokenizer Transfer

22
Experimental
41 Y-Research-SBU/CSR

Official Repository for CSR - ICML 2025 Oral

21
Experimental
42 rhubarbwu/linguistic-collapse

Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models...

21
Experimental
43 hank0316/AdaSearch

This includes the original implementation of "AdaSearch: Balancing...

20
Experimental
44 LSquaredM/mutual_info_scaling_law

(NeurIPS 2025) Official Code for L²M: Mutual Information Scaling Law for...

20
Experimental
45 millioniron/LLM_exploration_Graph-Attention-Mechanisms-Perspective

Code: Attention Mechanisms Perspective: Exploring LLM Processing of...

19
Experimental
46 HKUSTDial/megatran

[VLDB'25] Official repo for Paper "Weak-to-Strong Prompts with...

16
Experimental
47 IAAR-Shanghai/FastMem

Fast Memorization of Prompt Improves Context Awareness of Large Language...

15
Experimental
48 Xiaohao-Yang/LLM-ITL

[ACL 2025 Main] Neural Topic Modeling with Large Language Models in the Loop

15
Experimental
49 efficientscaling/Z1

[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"

15
Experimental
50 EastTower16/LLMDataDistill

distill large scale web page text

14
Experimental
51 ictnlp/FastLongSpeech

FastLongSpeech is a novel framework designed to extend the capabilities of...

14
Experimental
52 UKPLab/arxiv2025-inherent-limits-plms

Code repository for the paper "The Inherent Limits of Pretrained LLMs: The...

14
Experimental
53 YutongWang1216/ReflectionLLMMT

Code and data realeases for the paper -- TasTe: Teaching Large Language...

14
Experimental
54 eminorhan/llm-memory

Memory experiments with LLMs

13
Experimental
55 GeorgeVern/lmcor

Code for the EACL 2024 paper: "Small Language Models Improve Giants by...

12
Experimental
56 wyt2000/InverseCoder

[AAAI 2025] The official code of the paper "InverseCoder: Unleashing the...

12
Experimental