Llm Reasoning Research Transformer Models

There are 39 llm reasoning research models tracked. 1 score above 70 (verified tier). The highest-rated is cvs-health/uqlm at 75/100 with 1,121 stars and 1,881 monthly downloads. 2 of the top 10 are actively maintained.

Get all 39 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-reasoning-research&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 cvs-health/uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package...

75
Verified
2 PRIME-RL/TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

52
Established
3 sapientinc/HRM

Hierarchical Reasoning Model Official Release

49
Emerging
4 tigerchen52/query_level_uncertainty

query-level uncertainty in LLMs

41
Emerging
5 reasoning-survey/Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

38
Emerging
6 HKUDS/LightReasoner

"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"

38
Emerging
7 spcl/x1

Official Implementation of "Reasoning Language Models: A Blueprint"

37
Emerging
8 hao-ai-lab/Dynasor

[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model...

37
Emerging
9 mbzuai-oryx/Awesome-LLM-Post-training

Awesome Reasoning LLM Tutorial/Survey/Guide

35
Emerging
10 sail-sg/understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

35
Emerging
11 TIGER-AI-Lab/Pixel-Reasoner

Pixel-Level Reasoning Model trained with RL [NeuIPS25]

34
Emerging
12 Eclipsess/Awesome-Efficient-Reasoning-LLMs

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large...

34
Emerging
13 lqzxt/Time-R1

Time-R1 is a two-stage reinforcement fine-tuning framework that trains large...

33
Emerging
14 Qwen-Applications/CLIPO

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

32
Emerging
15 jqtangust/Robust-R1

🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware...

32
Emerging
16 Lanerra/reasoning-bank-slm

An experiment that applies Google Research's `ReasoningBank` technique to...

31
Emerging
17 TIGER-AI-Lab/VL-Rethinker

The official code of "VL-Rethinker: Incentivizing Self-Reflection of...

30
Emerging
18 iiis-ai/cumulative-reasoning

[TMLR] Cumulative Reasoning With Large Language Models...

30
Emerging
19 AlexanderVNikitin/kernel-language-entropy

Code for Fine-grained Uncertainty Quantification for LLMs from Semantic...

30
Emerging
20 Alsace08/Chain-of-Embedding

[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding...

29
Experimental
21 yongchao98/R1-Code-Interpreter

R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and...

29
Experimental
22 TIGER-AI-Lab/General-Reasoner

General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]

28
Experimental
23 andrewliao11/LongPerceptualThoughts

[COLM'25] The official implementation of "LongPerceptualThoughts: Distilling...

28
Experimental
24 StringNLPLAB/MGS

Repository for the paper "Advancing General-Purpose Reasoning Models with...

28
Experimental
25 InternLM/OREAL

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

27
Experimental
26 rkinas/reasoning_models_how_to

This repository serves as a collection of research notes and resources on...

26
Experimental
27 SalesforceAIResearch/Elastic-Reasoning

Make reasoning models scalable

26
Experimental
28 The-Martyr/CausalMM

[ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal...

25
Experimental
29 Tebmer/Rereading-LLM-Reasoning

EMNLP 2024 "Re-reading improves reasoning in large language models". Simply...

25
Experimental
30 ulab-uiuc/Time-R1

Time-R1: Framework and resources for endowing LLMs with comprehensive...

24
Experimental
31 PRIME-RL/Entropy-Mechanism-of-RL

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

23
Experimental
32 czg1225/VeriThinker

[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient

22
Experimental
33 WooooDyy/LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for...

22
Experimental
34 sparkle-reasoning/sparkle

[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs...

21
Experimental
35 Hyun-Ryu/clover

Official code for "Divide and Translate: Compositional First-Order Logic...

20
Experimental
36 Eric2i/LLM-MindMap

EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning...

18
Experimental
37 sastpg/RFTT

RFTT: Reasoning with Reinforced Functional Token Tuning

18
Experimental
38 zhaochen0110/Cotempqa

Code and data for "Living in the Moment: Can Large Language Models Grasp...

12
Experimental
39 hewei2001/ReachQA

[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs

11
Experimental