All Transformer Models
6,429 models ranked by quality score · Page 22 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 2101 |
AIRI-Institute/Probing_framework
Framework for probing tasks |
|
Emerging |
| 2102 |
RishabSA/interp-refusal-tokens
We study whether categorical refusal tokens enable controllable and... |
|
Emerging |
| 2103 |
dirmacs/lancor
A Rust client library for llama.cpp's OpenAI-compatible API server |
|
Emerging |
| 2104 |
anthonyfoust/ai-stack-homelab
Complete AI automation stack optimized for Mac Mini M4, but can work in... |
|
Emerging |
| 2105 |
taesiri/ArXivQA
WIP - Automated Question Answering for ArXiv Papers with Large Language... |
|
Emerging |
| 2106 |
nsi319/Finetune-Transformers
Abstractive text summarization by fine-tuning seq2seq models. |
|
Emerging |
| 2107 |
AspirinCode/AlphaPPImd
Exploring the conformational ensembles of protein-protein complexes with... |
|
Emerging |
| 2108 |
deep-div/Fine-Tuning-LLMs-and-VisionModels
Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to... |
|
Emerging |
| 2109 |
sixfingerdev/-Sixfinger-API---10-20x-Faster-AI-Chat-API
# ⚡ Sixfinger API - 10-20x Faster AI Chat API. İncludes 9 models. |
|
Emerging |
| 2110 |
styfeng/TinyDialogues
Code & data for the EMNLP 2024 paper: Is Child-Directed Speech Effective... |
|
Emerging |
| 2111 |
NTU-SQUAD/transformers-coqa
Albert for Conversational Question Answering Challenge |
|
Emerging |
| 2112 |
titanml/takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment... |
|
Emerging |
| 2113 |
codefuse-ai/GALLa
[ACL 2025] Graph Aligned Large Language Models for Improved Source Code Understanding |
|
Emerging |
| 2114 |
pagraf/Seabed-Net
Quick start guide for Seabed-Net |
|
Emerging |
| 2115 |
deep-symbolic-mathematics/Multimodal-Symbolic-Regression
[ICLR 2024 Spotlight] SNIP on Symbolic Regression: Deep Symbolic Regression... |
|
Emerging |
| 2116 |
wassemgtk/llm.scala
Extensible implementation of a Language Model (LLM) training framework in Scala. |
|
Emerging |
| 2117 |
dropbox/grallama-panel
GraLLAMA panel for LLAMA data |
|
Emerging |
| 2118 |
FranxYao/FlanT5-CoT-Specialization
Implementation of ICML 23 Paper: Specializing Smaller Language Models... |
|
Emerging |
| 2119 |
IParraMartin/An-Explanation-Is-All-You-Need
The original transformer implementation from scratch. It contains... |
|
Emerging |
| 2120 |
xiaoachen98/Open-LLaVA-NeXT
An open-source implementation for training LLaVA-NeXT. |
|
Emerging |
| 2121 |
SCRN-VRC/Language-Translation-with-Fragment-Shaders
EN to JP and JP to EN with transformer models |
|
Emerging |
| 2122 |
Chunjiang-Intelligence/Credal-Transformer
论文「Credal Transformer: A Principled Approach for Quantifying and Mitigating... |
|
Emerging |
| 2123 |
RaptorMai/MLLM-CompBench
[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs... |
|
Emerging |
| 2124 |
FudanDISC/ReForm-Eval
An benchmark for evaluating the capabilities of large vision-language models (LVLMs) |
|
Emerging |
| 2125 |
Yifan-Song793/ETO
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents... |
|
Emerging |
| 2126 |
nlp-uoregon/Okapi
Okapi: Instruction-tuned Large Language Models in Multiple Languages with... |
|
Emerging |
| 2127 |
ivanovitchm/PPGEEC2318
Repository for EEC2318, a graduate course on PPgEEC about Machine Learning |
|
Emerging |
| 2128 |
TamSiuhin/LLM-UM-Reading
A list of large language models for user modeling (LLM-UM) papers, based on... |
|
Emerging |
| 2129 |
tongnie/ImputeFormer
[KDD 2024] "ImputeFormer: Low Rankness-Induced Transformers for... |
|
Emerging |
| 2130 |
smpanaro/coreml-llm-cli
CLI to demonstrate running a large language model (LLM) on Apple Neural Engine. |
|
Emerging |
| 2131 |
makllama/makllama
MaK(Mac+Kubernetes)llama - Running and orchestrating large language models... |
|
Emerging |
| 2132 |
Relaxed-System-Lab/HexGen
[ICML 2024] Serving LLMs on heterogeneous decentralized clusters. |
|
Emerging |
| 2133 |
AGI-Edgerunners/LLM-Optimizers-Papers
Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic... |
|
Emerging |
| 2134 |
juzhengz/LoRI
[COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation |
|
Emerging |
| 2135 |
QwenLM/PolyMath
[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath:... |
|
Emerging |
| 2136 |
Saivineeth147/llm-testlab
Comprehensive Testing Tool for Large Language Models |
|
Emerging |
| 2137 |
miranthajayatilake/nanoQA
Question-answering on your own data with Large Language Models (LLMs) |
|
Emerging |
| 2138 |
ZongXR/8th-National-AI-Training-Competition
第八届全国职工职业技能大赛人工智能训练师赛项 |
|
Emerging |
| 2139 |
frankluise5220/ComfyUI-Lorahelper
A professional automation toolkit for ComfyUI to prepare LoRA training data... |
|
Emerging |
| 2140 |
DomHudson/bert-in-production
A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 )... |
|
Emerging |
| 2141 |
danieloquelis/natural-language-git
Offline LLM-powered Git CLI tool. NLGit interprets your natural language... |
|
Emerging |
| 2142 |
JonSnow1807/Medical-Prescription-OCR
OCR system for handwritten medical prescriptions using Donut transformer and... |
|
Emerging |
| 2143 |
vbario/sleeping-llm
A language model that forms persistent memories from conversation and... |
|
Emerging |
| 2144 |
OpenMOSS/LongLLaDA
[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs |
|
Emerging |
| 2145 |
singhsidhukuldeep/Text-Summarizer
Comparing state of the art models for text summary generation |
|
Emerging |
| 2146 |
RahulSChand/llama2.c-for-dummies
Step by step explanation/tutorial of llama2.c |
|
Emerging |
| 2147 |
KishanBagaria/dAbot
🤖 CLI tool to automate stuff on DeviantArt.com |
|
Emerging |
| 2148 |
xmed-lab/TAM
[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs |
|
Emerging |
| 2149 |
EagleW/Stage-wise-Fine-tuning
Code for Stage-wise Fine-tuning for Graph-to-Text Generation |
|
Emerging |
| 2150 |
jshuadvd/LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2... |
|
Emerging |
| 2151 |
alan-turing-institute/prompto
An open source library for asynchronous querying of LLM endpoints |
|
Emerging |
| 2152 |
HLTCHKUST/VG-GPLMs
The code repository for EMNLP 2021 paper "Vision Guided Generative... |
|
Emerging |
| 2153 |
Orion-AI-Lab/televit
Teleconnection-driven vision transformers for improved long-term forecasting |
|
Emerging |
| 2154 |
ryoungj/ObsScaling
[NeurIPS'24 Spotlight] Observational Scaling Laws |
|
Emerging |
| 2155 |
vmarinowski/infini-attention
An unofficial pytorch implementation of 'Efficient Infinite Context... |
|
Emerging |
| 2156 |
ant-louis/belgpt2
🇧🇪 BelGPT-2: the 1st GPT model pretrained in French. |
|
Emerging |
| 2157 |
raymin0223/fast_robust_early_exit
Fast and Robust Early-Exiting Framework for Autoregressive Language Models... |
|
Emerging |
| 2158 |
AlexIoannides/transformers-gen-ai
Developing generative language models using transformers. |
|
Emerging |
| 2159 |
iVishalr/GPT
A minimal and efficient Pytorch implementation of OpenAI's GPT (Generative... |
|
Emerging |
| 2160 |
mts-ai/OpenAutoNLU
An open-source pipeline for training natural language understanding models |
|
Emerging |
| 2161 |
otvam/pyscalexfmr
Optimization and Scaling of Medium-Frequency Transformers |
|
Emerging |
| 2162 |
yangjianxin1/LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently |
|
Emerging |
| 2163 |
Mmorgan-ML/Phase-Slip-Sampler
Phase-Slip is a stochastic intervention architecture that operates on the... |
|
Emerging |
| 2164 |
UIC-Liu-Lab/ContinualLM
An Extensible Continual Learning Framework Focused on Language Models (LMs) |
|
Emerging |
| 2165 |
kyegomez/MambaDecoderBlock
MambaDecoderBlock is a novel decoder architecture that replaces traditional... |
|
Emerging |
| 2166 |
ChanMeng666/interactive-story-generator
【Join our constellation of stargazers!⭐️】An interactive AI-powered story... |
|
Emerging |
| 2167 |
shikiw/Modality-Integration-Rate
[ICCV 2025] The official code of the paper "Deciphering Cross-Modal... |
|
Emerging |
| 2168 |
curtisgray/wingman
Wingman is the fastest and easiest way to run Llama models on your PC or Mac. |
|
Emerging |
| 2169 |
obss/turkish-question-generation
Automated question generation and question answering from Turkish texts... |
|
Emerging |
| 2170 |
ntropy-network/enrichment_models
This repository benchmark Ntropy API against different Large Language Models... |
|
Emerging |
| 2171 |
Utshav-paudel/LLM-Zero-to-Hero
This repo contains the resources, projects and documentation of mine while... |
|
Emerging |
| 2172 |
dsdanielpark/hf-transllm
LLMtranslator translates and generates text in multiple languages. |
|
Emerging |
| 2173 |
Kagamma/llama-pas
Free Pascal bindings for llama.cpp |
|
Emerging |
| 2174 |
qiqiApink/MotionGPT
The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs... |
|
Emerging |
| 2175 |
vipulraheja/coedit
Official implementation of the paper "CoEdIT: Text Editing by Task-Specific... |
|
Emerging |
| 2176 |
katanaml/table-query-model
Table Query with ML |
|
Emerging |
| 2177 |
Riko0/messenger_logger_callback
messenger-logger-callback — Send ML training logs to Telegram. Standalone... |
|
Emerging |
| 2178 |
luiskugel/AI-Writing-Assistant-for-Thunderbird
A Thunderbird extension that helps improve your email writing using various... |
|
Emerging |
| 2179 |
Phildram1/myantfarm-ai
Multi-Agent LLM Orchestration for High-Quality Incident Response - 100%... |
|
Emerging |
| 2180 |
LostBeard/SpawnDev.BlazorJS.TransformersJS
Use Transformers.js from Blazor WebAssembly to run pretrained models with... |
|
Emerging |
| 2181 |
Kirill-Kravtsov/drophead-pytorch
An implementation of drophead regularization for pytorch transformers |
|
Emerging |
| 2182 |
iboing/CorDA
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models... |
|
Emerging |
| 2183 |
rohit901/VANE-Bench
[NAACL'25] Contains code and documentation for our VANE-Bench paper. |
|
Emerging |
| 2184 |
baldoarbol/BodyShapeGPT
Fine-tuned LLMs generate accurate 3D human avatars from textual descriptions... |
|
Emerging |
| 2185 |
black-roland/homeassistant-cloud-ru-ai
Cloud.ru Foundation Models — cloud-based AI assistants for Home Assistant |
|
Emerging |
| 2186 |
pdaicode/awesome-LLMs-finetuning
Collection of resources for finetuning Large Language Models (LLMs). |
|
Emerging |
| 2187 |
naity/finetune-esm
Scalable Protein Language Model Finetuning with Distributed Learning and... |
|
Emerging |
| 2188 |
yinizhilian/ICLR2025-Papers-with-Code
历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025. |
|
Emerging |
| 2189 |
hscspring/llama.np
Inference Llama/Llama2/Llama3 Modes in NumPy |
|
Emerging |
| 2190 |
samestrin/llm-newsletter-generator
llm-newsletter-generator transforms a valid RSS feed into a "Newsletter"... |
|
Emerging |
| 2191 |
Roboflow-Universe/finetune-RF-DETR
Modular CLI pipeline for fine‑tuning RF‑DETR object detection models on... |
|
Emerging |
| 2192 |
shinomakoi/magi_llm_gui
A Qt GUI for large language models |
|
Emerging |
| 2193 |
zzz47zzz/codebase-for-incremental-learning-with-llm
[ACL2024] A Codebase for Incremental Learning with Large Language Models;... |
|
Emerging |
| 2194 |
princeton-pli/AdaptMI
[COLM 2025] Adaptive Skill-based In-context Math Instruction for Small... |
|
Emerging |
| 2195 |
prajjwal1/generalize_lm_nli
Code for the paper EMNLP 2021 workshop paper "Generalization in NLI: Ways... |
|
Emerging |
| 2196 |
dmis-lab/Outlier-Safe-Pre-Training
[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large... |
|
Emerging |
| 2197 |
botisan-ai/sentence-transformers.js
Run sentence-transformers (SBERT) compatible models in Node.js or browser. |
|
Emerging |
| 2198 |
hao-ai-lab/d3LLM
d3LLM: Ultra-Fast Diffusion LLM 🚀 |
|
Emerging |
| 2199 |
amin-tehrani/ollama-colab
Serve Ollama LLMs on Google Colab (free plan) using Ngrok |
|
Emerging |
| 2200 |
Zalexanninev15/GetFreeChat
Automatic collection of free instances of AI text models (ChatGPT, Claude,... |
|
Emerging |