Llm Reasoning Research Transformer Models
There are 39 llm reasoning research models tracked. 1 score above 70 (verified tier). The highest-rated is cvs-health/uqlm at 75/100 with 1,121 stars and 1,881 monthly downloads. 2 of the top 10 are actively maintained.
Get all 39 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-reasoning-research&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
cvs-health/uqlm
UQLM: Uncertainty Quantification for Language Models, is a Python package... |
|
Verified |
| 2 |
PRIME-RL/TTRL
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning |
|
Established |
| 3 |
sapientinc/HRM
Hierarchical Reasoning Model Official Release |
|
Emerging |
| 4 |
tigerchen52/query_level_uncertainty
query-level uncertainty in LLMs |
|
Emerging |
| 5 |
reasoning-survey/Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models |
|
Emerging |
| 6 |
HKUDS/LightReasoner
"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?" |
|
Emerging |
| 7 |
spcl/x1
Official Implementation of "Reasoning Language Models: A Blueprint" |
|
Emerging |
| 8 |
hao-ai-lab/Dynasor
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model... |
|
Emerging |
| 9 |
mbzuai-oryx/Awesome-LLM-Post-training
Awesome Reasoning LLM Tutorial/Survey/Guide |
|
Emerging |
| 10 |
sail-sg/understand-r1-zero
Understanding R1-Zero-Like Training: A Critical Perspective |
|
Emerging |
| 11 |
TIGER-AI-Lab/Pixel-Reasoner
Pixel-Level Reasoning Model trained with RL [NeuIPS25] |
|
Emerging |
| 12 |
Eclipsess/Awesome-Efficient-Reasoning-LLMs
[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large... |
|
Emerging |
| 13 |
lqzxt/Time-R1
Time-R1 is a two-stage reinforcement fine-tuning framework that trains large... |
|
Emerging |
| 14 |
Qwen-Applications/CLIPO
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR |
|
Emerging |
| 15 |
jqtangust/Robust-R1
🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware... |
|
Emerging |
| 16 |
Lanerra/reasoning-bank-slm
An experiment that applies Google Research's `ReasoningBank` technique to... |
|
Emerging |
| 17 |
TIGER-AI-Lab/VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of... |
|
Emerging |
| 18 |
iiis-ai/cumulative-reasoning
[TMLR] Cumulative Reasoning With Large Language Models... |
|
Emerging |
| 19 |
AlexanderVNikitin/kernel-language-entropy
Code for Fine-grained Uncertainty Quantification for LLMs from Semantic... |
|
Emerging |
| 20 |
Alsace08/Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding... |
|
Experimental |
| 21 |
yongchao98/R1-Code-Interpreter
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and... |
|
Experimental |
| 22 |
TIGER-AI-Lab/General-Reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25] |
|
Experimental |
| 23 |
andrewliao11/LongPerceptualThoughts
[COLM'25] The official implementation of "LongPerceptualThoughts: Distilling... |
|
Experimental |
| 24 |
StringNLPLAB/MGS
Repository for the paper "Advancing General-Purpose Reasoning Models with... |
|
Experimental |
| 25 |
InternLM/OREAL
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning |
|
Experimental |
| 26 |
rkinas/reasoning_models_how_to
This repository serves as a collection of research notes and resources on... |
|
Experimental |
| 27 |
SalesforceAIResearch/Elastic-Reasoning
Make reasoning models scalable |
|
Experimental |
| 28 |
The-Martyr/CausalMM
[ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal... |
|
Experimental |
| 29 |
Tebmer/Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply... |
|
Experimental |
| 30 |
ulab-uiuc/Time-R1
Time-R1: Framework and resources for endowing LLMs with comprehensive... |
|
Experimental |
| 31 |
PRIME-RL/Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning. |
|
Experimental |
| 32 |
czg1225/VeriThinker
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient |
|
Experimental |
| 33 |
WooooDyy/LLM-Reverse-Curriculum-RL
Implementation of the ICML 2024 paper "Training Large Language Models for... |
|
Experimental |
| 34 |
sparkle-reasoning/sparkle
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs... |
|
Experimental |
| 35 |
Hyun-Ryu/clover
Official code for "Divide and Translate: Compositional First-Order Logic... |
|
Experimental |
| 36 |
Eric2i/LLM-MindMap
EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning... |
|
Experimental |
| 37 |
sastpg/RFTT
RFTT: Reasoning with Reinforced Functional Token Tuning |
|
Experimental |
| 38 |
zhaochen0110/Cotempqa
Code and data for "Living in the Moment: Can Large Language Models Grasp... |
|
Experimental |
| 39 |
hewei2001/ReachQA
[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs |
|
Experimental |