All Transformer Models
6,427 models ranked by quality score · Page 2 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 101 |
NX-AI/xlstm
Official repository of the xLSTM. |
|
Established |
| 102 |
inseq-team/inseq
Interpretability for sequence generation models 🐛 🔍 |
|
Established |
| 103 |
csinva/imodelsX
Interpret text data with LLMs (sklearn compatible). |
|
Established |
| 104 |
EricLBuehler/mistral.rs
Fast, flexible LLM inference |
|
Established |
| 105 |
sauravpanda/BrowserAI
Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser |
|
Established |
| 106 |
NVlabs/MambaVision
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid... |
|
Established |
| 107 |
lucidrains/dreamer4
Implementation of Danijar's latest iteration for his Dreamer line of work |
|
Established |
| 108 |
NVIDIA/sphinx-llm
LLM extensions for Sphinx Documentation |
|
Established |
| 109 |
RBLN-SW/optimum-rbln
⚡ A seamless integration of HuggingFace Transformers & Diffusers with RBLN... |
|
Established |
| 110 |
sintel-dev/sigllm
Using Large Language Models for Time Series Anomaly Detection |
|
Established |
| 111 |
cyberchitta/llm-context.py
Share code with LLMs via Model Context Protocol or clipboard. Rule-based... |
|
Established |
| 112 |
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences |
|
Established |
| 113 |
hassancs91/SimplerLLM
Simplify interactions with Large Language Models |
|
Established |
| 114 |
deeppavlov/AutoIntent
Automated machine learning for text classification |
|
Established |
| 115 |
jncraton/languagemodels
Explore large language models in 512MB of RAM |
|
Established |
| 116 |
Michael-A-Kuykendall/shimmy
⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF +... |
|
Established |
| 117 |
UbiquitousLearning/mllm
Fast Multimodal LLM on Mobile Devices |
|
Established |
| 118 |
om-ai-lab/VLM-R1
Solve Visual Understanding with Reinforced VLMs |
|
Established |
| 119 |
skyzh/tiny-llm
A course of learning LLM inference serving on Apple Silicon for systems... |
|
Established |
| 120 |
kaito-project/aikit
🏗️ Fine-tune, build, and deploy open-source LLMs easily! |
|
Established |
| 121 |
zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs. |
|
Established |
| 122 |
stas00/ml-engineering
Machine Learning Engineering Open Book |
|
Established |
| 123 |
mybigday/llama.rn
React Native binding of llama.cpp |
|
Established |
| 124 |
ModelTC/LightCompress
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models... |
|
Established |
| 125 |
poloclub/transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with... |
|
Established |
| 126 |
FastFlowLM/FastFlowLM
Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but... |
|
Established |
| 127 |
arcee-ai/mergekit
Tools for merging pretrained large language models. |
|
Established |
| 128 |
structuredllm/syncode
Efficient and general syntactical decoding for Large Language Models |
|
Established |
| 129 |
zhihu/ZhiLight
A highly optimized LLM inference acceleration engine for Llama and its variants. |
|
Established |
| 130 |
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation... |
|
Established |
| 131 |
changyeyu/LLM-RL-Visualized
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps ) |
|
Established |
| 132 |
peremartra/optipfair
Structured pruning and bias visualization for Large Language Models. Tools... |
|
Established |
| 133 |
eole-nlp/eole
Open language modeling toolkit based on PyTorch |
|
Established |
| 134 |
lucidrains/simple-hierarchical-transformer
Experiments around a simple idea for inducing multiple hierarchical... |
|
Established |
| 135 |
roboflow/maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2,... |
|
Established |
| 136 |
argosopentech/argos-translate
Open-source offline translation library written in Python |
|
Established |
| 137 |
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs,... |
|
Established |
| 138 |
BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can... |
|
Established |
| 139 |
azukds/tubular
Python package implementing ML feature engineering and pre-processing for... |
|
Established |
| 140 |
HandsOnLLM/Hands-On-Large-Language-Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models" |
|
Established |
| 141 |
kyegomez/BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language... |
|
Established |
| 142 |
huggingface/optimum-habana
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU) |
|
Established |
| 143 |
clusterzx/paperless-ai
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama,... |
|
Established |
| 144 |
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks. |
|
Established |
| 145 |
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities |
|
Established |
| 146 |
xaviviro/python-toon
🐍 TOON for Python (Token-Oriented Object Notation) Encoder/Decoder - Reduce... |
|
Established |
| 147 |
NexaAI/nexa-sdk
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and... |
|
Established |
| 148 |
thu-ml/SageAttention
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves... |
|
Established |
| 149 |
sign-language-translator/sign-language-translator
Python library & framework to build custom translators for the... |
|
Established |
| 150 |
nyu-mll/jiant
jiant is an nlp toolkit |
|
Established |
| 151 |
scaleapi/llm-engine
Scale LLM Engine public repository |
|
Established |
| 152 |
kyegomez/LFM2
A simple and minimal open source implementation of "Introducing LFM2: The... |
|
Established |
| 153 |
NVIDIA-NeMo/Automodel
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging... |
|
Established |
| 154 |
OpenNMT/CTranslate2
Fast inference engine for Transformer models |
|
Established |
| 155 |
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models |
|
Established |
| 156 |
niedev/RTranslator
Open source real-time translation app for Android that runs locally |
|
Established |
| 157 |
microsoft/mup
maximal update parametrization (µP) |
|
Established |
| 158 |
mistralai/mistral-inference
Official inference library for Mistral models |
|
Established |
| 159 |
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side... |
|
Established |
| 160 |
rickiepark/nlp-with-transformers
<트랜스포머를 활용한 자연어 처리> 예제 코드를 위한 저장소입니다. |
|
Established |
| 161 |
Picovoice/picollm
On-device LLM Inference Powered by X-Bit Quantization |
|
Established |
| 162 |
ManuelSLemos/RabbitLLM
Run 70B+ LLMs on a single 4GB GPU — no quantization required. |
|
Established |
| 163 |
explosion/spacy-llm
🦙 Integrating LLMs into structured NLP pipelines |
|
Established |
| 164 |
fashn-AI/fashn-human-parser
Human parsing model for fashion and virtual try-on applications |
|
Established |
| 165 |
b4rtaz/distributed-llama
Distributed LLM inference. Connect home devices into a powerful cluster to... |
|
Established |
| 166 |
Freed-Wu/translate-shell
Translate text by google, bing, youdaozhiyun, haici, stardict, openai, large... |
|
Established |
| 167 |
CrazyBoyM/llama3-Chinese-chat
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。 |
|
Established |
| 168 |
BeastByteAI/scikit-llm
Seamlessly integrate LLMs into scikit-learn. |
|
Established |
| 169 |
NVIDIA/kvpress
LLM KV cache compression made easy |
|
Established |
| 170 |
jakobdylanc/llmcord
Make Discord your LLM frontend - Supports any OpenAI compatible API (Ollama,... |
|
Established |
| 171 |
GradientHQ/parallax
Parallax is a distributed model serving framework that lets you build your... |
|
Established |
| 172 |
TinyLLaVA/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models |
|
Established |
| 173 |
mdsrqbl/omnihuman
AI model that understands text & humanoids. |
|
Established |
| 174 |
label-sleuth/label-sleuth
Open source no-code system for text annotation and building of text classifiers |
|
Established |
| 175 |
nrl-ai/llama-assistant
AI-powered assistant to help you with your daily tasks, powered by Llama 3,... |
|
Established |
| 176 |
cheahjs/free-llm-api-resources
A list of free LLM inference resources accessible via API. |
|
Established |
| 177 |
Tiiny-AI/PowerInfer
High-speed Large Language Model Serving for Local Deployment |
|
Established |
| 178 |
analyticalrohit/AI-ML-Cheatsheets
All Stanford Cheatsheets: Artificial Intelligence, Transformers, LLMs, Deep... |
|
Established |
| 179 |
OpenMachine-ai/transformer-tricks
A collection of tricks and tools to speed up transformer models |
|
Established |
| 180 |
quic/efficient-transformers
This library empowers users to seamlessly port pretrained models and... |
|
Established |
| 181 |
peremartra/Large-Language-Model-Notebooks-Course
Practical course about Large Language Models. |
|
Established |
| 182 |
ericmjl/llamabot
Pythonic class-based interface to LLMs |
|
Established |
| 183 |
albertan017/LLM4Decompile
Reverse Engineering: Decompiling Binary Code with Large Language Models |
|
Established |
| 184 |
Shivanandroy/simpleT5
simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets... |
|
Established |
| 185 |
huggingface/audio-transformers-course
The Hugging Face Course on Transformers for Audio |
|
Established |
| 186 |
MattyB95/Jabberjay
🦜 Synthetic Voice Detection |
|
Established |
| 187 |
sgl-project/ome
Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU... |
|
Established |
| 188 |
muxi-ai/onellm
Unified interface for interacting with various LLMs hundreds of models,... |
|
Established |
| 189 |
ServerlessLLM/ServerlessLLM
Serverless LLM Serving for Everyone. |
|
Established |
| 190 |
floneum/floneum
Instant, controllable, local pre-trained AI models in Rust |
|
Established |
| 191 |
underneathall/pinferencia
Python + Inference - Model Deployment library in Python. Simplest model... |
|
Established |
| 192 |
davidpirogov/toon-llm
Token-Oriented Object Notation (TOON) is an LLM-optimized data serialization... |
|
Established |
| 193 |
lucidrains/locoformer
LocoFormer - Generalist Locomotion via Long-Context Adaptation |
|
Established |
| 194 |
avilum/minrlm
Token-efficient Recursive Language Model. 3.6x fewer tokens than vanilla... |
|
Established |
| 195 |
PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback |
|
Established |
| 196 |
shibing624/textgen
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM,... |
|
Established |
| 197 |
GeeeekExplorer/nano-vllm
Nano vLLM |
|
Established |
| 198 |
mlabonne/llm-datasets
Curated list of datasets and tools for post-training. |
|
Established |
| 199 |
HowieHwong/TrustLLM
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models |
|
Established |
| 200 |
Mobile-Artificial-Intelligence/llama_sdk
lcpp is a dart implementation of llama.cpp used by the mobile artificial... |
|
Established |