Llm Reasoning Research Transformer Models

There are 39 llm reasoning research models tracked. 1 score above 70 (verified tier). The highest-rated is cvs-health/uqlm at 75/100 with 1,121 stars and 1,881 monthly downloads. 2 of the top 10 are actively maintained.

Get all 39 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-reasoning-research&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

#	Model	Score	Tier	Stars	Language
1	cvs-health/uqlm UQLM: Uncertainty Quantification for Language Models, is a Python package...	75	Verified	1,121	Python
2	PRIME-RL/TTRL [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning	52	Established	1,014	Python
3	sapientinc/HRM Hierarchical Reasoning Model Official Release	49	Emerging	12,358	Python
4	tigerchen52/query_level_uncertainty query-level uncertainty in LLMs	41	Emerging	9	Python
5	reasoning-survey/Awesome-Reasoning-Foundation-Models ✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models	38	Emerging	652	—
6	HKUDS/LightReasoner "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"	38	Emerging	594	Python
7	spcl/x1 Official Implementation of "Reasoning Language Models: A Blueprint"	37	Emerging	94	Python
8	hao-ai-lab/Dynasor [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model...	37	Emerging	224	Python
9	mbzuai-oryx/Awesome-LLM-Post-training Awesome Reasoning LLM Tutorial/Survey/Guide	35	Emerging	2,321	Python
10	sail-sg/understand-r1-zero Understanding R1-Zero-Like Training: A Critical Perspective	35	Emerging	1,224	Python
11	TIGER-AI-Lab/Pixel-Reasoner Pixel-Level Reasoning Model trained with RL [NeuIPS25]	34	Emerging	282	Python
12	Eclipsess/Awesome-Efficient-Reasoning-LLMs [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large...	34	Emerging	752	—
13	lqzxt/Time-R1 Time-R1 is a two-stage reinforcement fine-tuning framework that trains large...	33	Emerging	94	Python
14	Qwen-Applications/CLIPO CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR	32	Emerging	10	Python
15	jqtangust/Robust-R1 🔥🔥🔥[AAAI 2026 Oral] Official Implementation of Robust-R1: Degradation-Aware...	32	Emerging	520	Python
16	Lanerra/reasoning-bank-slm An experiment that applies Google Research's `ReasoningBank` technique to...	31	Emerging	99	Python
17	TIGER-AI-Lab/VL-Rethinker The official code of "VL-Rethinker: Incentivizing Self-Reflection of...	30	Emerging	184	Python
18	iiis-ai/cumulative-reasoning [TMLR] Cumulative Reasoning With Large Language Models...	30	Emerging	308	Python
19	AlexanderVNikitin/kernel-language-entropy Code for Fine-grained Uncertainty Quantification for LLMs from Semantic...	30	Emerging	36	Python
20	Alsace08/Chain-of-Embedding [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding...	29	Experimental	95	Python
21	yongchao98/R1-Code-Interpreter R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and...	29	Experimental	31	Python
22	TIGER-AI-Lab/General-Reasoner General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]	28	Experimental	222	Python
23	andrewliao11/LongPerceptualThoughts [COLM'25] The official implementation of "LongPerceptualThoughts: Distilling...	28	Experimental	11	Python
24	StringNLPLAB/MGS Repository for the paper "Advancing General-Purpose Reasoning Models with...	28	Experimental	19	Python
25	InternLM/OREAL Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning	27	Experimental	193	Python
26	rkinas/reasoning_models_how_to This repository serves as a collection of research notes and resources on...	26	Experimental	132	Python
27	SalesforceAIResearch/Elastic-Reasoning Make reasoning models scalable	26	Experimental	47	Jupyter Notebook
28	The-Martyr/CausalMM [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal...	25	Experimental	61	Python
29	Tebmer/Rereading-LLM-Reasoning EMNLP 2024 "Re-reading improves reasoning in large language models". Simply...	25	Experimental	29	Python
30	ulab-uiuc/Time-R1 Time-R1: Framework and resources for endowing LLMs with comprehensive...	24	Experimental	66	Python
31	PRIME-RL/Entropy-Mechanism-of-RL The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.	23	Experimental	421	Python
32	czg1225/VeriThinker [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient	22	Experimental	65	Python
33	WooooDyy/LLM-Reverse-Curriculum-RL Implementation of the ICML 2024 paper "Training Large Language Models for...	22	Experimental	116	Python
34	sparkle-reasoning/sparkle [NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs...	21	Experimental	16	Python
35	Hyun-Ryu/clover Official code for "Divide and Translate: Compositional First-Order Logic...	20	Experimental	27	Python
36	Eric2i/LLM-MindMap EMNLP 2025 - "Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning...	18	Experimental	12	Python
37	sastpg/RFTT RFTT: Reasoning with Reinforced Functional Token Tuning	18	Experimental	29	Python
38	zhaochen0110/Cotempqa Code and data for "Living in the Moment: Can Large Language Models Grasp...	12	Experimental	32	Python
39	hewei2001/ReachQA [EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs	11	Experimental	59	Python

Comparisons in this category

uqlm and kernel-language-entropy (75 vs 30) uqlm and query_level_uncertainty (75 vs 41)