All Transformer Models

6,427 models ranked by quality score · Page 2 of 65

Showing 101–200 of 6,427
# Model Score Tier
101 NX-AI/xlstm

Official repository of the xLSTM.

63
Established
102 inseq-team/inseq

Interpretability for sequence generation models 🐛 🔍

63
Established
103 csinva/imodelsX

Interpret text data with LLMs (sklearn compatible).

63
Established
104 EricLBuehler/mistral.rs

Fast, flexible LLM inference

62
Established
105 sauravpanda/BrowserAI

Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser

62
Established
106 NVlabs/MambaVision

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid...

62
Established
107 lucidrains/dreamer4

Implementation of Danijar's latest iteration for his Dreamer line of work

62
Established
108 NVIDIA/sphinx-llm

LLM extensions for Sphinx Documentation

62
Established
109 RBLN-SW/optimum-rbln

⚡ A seamless integration of HuggingFace Transformers & Diffusers with RBLN...

61
Established
110 sintel-dev/sigllm

Using Large Language Models for Time Series Anomaly Detection

61
Established
111 cyberchitta/llm-context.py

Share code with LLMs via Model Context Protocol or clipboard. Rule-based...

61
Established
112 huggingface/alignment-handbook

Robust recipes to align language models with human and AI preferences

61
Established
113 hassancs91/SimplerLLM

Simplify interactions with Large Language Models

61
Established
114 deeppavlov/AutoIntent

Automated machine learning for text classification

60
Established
115 jncraton/languagemodels

Explore large language models in 512MB of RAM

60
Established
116 Michael-A-Kuykendall/shimmy

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF +...

60
Established
117 UbiquitousLearning/mllm

Fast Multimodal LLM on Mobile Devices

60
Established
118 om-ai-lab/VLM-R1

Solve Visual Understanding with Reinforced VLMs

60
Established
119 skyzh/tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems...

60
Established
120 kaito-project/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

60
Established
121 zjunlp/EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

60
Established
122 stas00/ml-engineering

Machine Learning Engineering Open Book

60
Established
123 mybigday/llama.rn

React Native binding of llama.cpp

60
Established
124 ModelTC/LightCompress

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models...

60
Established
125 poloclub/transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with...

59
Established
126 FastFlowLM/FastFlowLM

Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but...

59
Established
127 arcee-ai/mergekit

Tools for merging pretrained large language models.

59
Established
128 structuredllm/syncode

Efficient and general syntactical decoding for Large Language Models

59
Established
129 zhihu/ZhiLight

A highly optimized LLM inference acceleration engine for Llama and its variants.

59
Established
130 OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation...

59
Established
131 changyeyu/LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

58
Established
132 peremartra/optipfair

Structured pruning and bias visualization for Large Language Models. Tools...

58
Established
133 eole-nlp/eole

Open language modeling toolkit based on PyTorch

58
Established
134 lucidrains/simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical...

58
Established
135 roboflow/maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2,...

58
Established
136 argosopentech/argos-translate

Open-source offline translation library written in Python

58
Established
137 EleutherAI/gpt-neox

An implementation of model parallel autoregressive transformers on GPUs,...

58
Established
138 BlinkDL/RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can...

57
Established
139 azukds/tubular

Python package implementing ML feature engineering and pre-processing for...

57
Established
140 HandsOnLLM/Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

57
Established
141 kyegomez/BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language...

57
Established
142 huggingface/optimum-habana

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

57
Established
143 clusterzx/paperless-ai

An automated document analyzer for Paperless-ngx using OpenAI API, Ollama,...

57
Established
144 mlabonne/llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

57
Established
145 microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

57
Established
146 xaviviro/python-toon

🐍 TOON for Python (Token-Oriented Object Notation) Encoder/Decoder - Reduce...

57
Established
147 NexaAI/nexa-sdk

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and...

57
Established
148 thu-ml/SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves...

57
Established
149 sign-language-translator/sign-language-translator

Python library & framework to build custom translators for the...

57
Established
150 nyu-mll/jiant

jiant is an nlp toolkit

56
Established
151 scaleapi/llm-engine

Scale LLM Engine public repository

56
Established
152 kyegomez/LFM2

A simple and minimal open source implementation of "Introducing LFM2: The...

56
Established
153 NVIDIA-NeMo/Automodel

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging...

56
Established
154 OpenNMT/CTranslate2

Fast inference engine for Transformer models

56
Established
155 BradyFU/Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

56
Established
156 niedev/RTranslator

Open source real-time translation app for Android that runs locally

56
Established
157 microsoft/mup

maximal update parametrization (µP)

56
Established
158 mistralai/mistral-inference

Official inference library for Mistral models

56
Established
159 labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side...

56
Established
160 rickiepark/nlp-with-transformers

<트랜스포머를 활용한 자연어 처리> 예제 코드를 위한 저장소입니다.

56
Established
161 Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

55
Established
162 ManuelSLemos/RabbitLLM

Run 70B+ LLMs on a single 4GB GPU — no quantization required.

55
Established
163 explosion/spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

55
Established
164 fashn-AI/fashn-human-parser

Human parsing model for fashion and virtual try-on applications

55
Established
165 b4rtaz/distributed-llama

Distributed LLM inference. Connect home devices into a powerful cluster to...

55
Established
166 Freed-Wu/translate-shell

Translate text by google, bing, youdaozhiyun, haici, stardict, openai, large...

55
Established
167 CrazyBoyM/llama3-Chinese-chat

Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。

55
Established
168 BeastByteAI/scikit-llm

Seamlessly integrate LLMs into scikit-learn.

55
Established
169 NVIDIA/kvpress

LLM KV cache compression made easy

55
Established
170 jakobdylanc/llmcord

Make Discord your LLM frontend - Supports any OpenAI compatible API (Ollama,...

54
Established
171 GradientHQ/parallax

Parallax is a distributed model serving framework that lets you build your...

54
Established
172 TinyLLaVA/TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

54
Established
173 mdsrqbl/omnihuman

AI model that understands text & humanoids.

54
Established
174 label-sleuth/label-sleuth

Open source no-code system for text annotation and building of text classifiers

54
Established
175 nrl-ai/llama-assistant

AI-powered assistant to help you with your daily tasks, powered by Llama 3,...

54
Established
176 cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

54
Established
177 Tiiny-AI/PowerInfer

High-speed Large Language Model Serving for Local Deployment

54
Established
178 analyticalrohit/AI-ML-Cheatsheets

All Stanford Cheatsheets: Artificial Intelligence, Transformers, LLMs, Deep...

54
Established
179 OpenMachine-ai/transformer-tricks

A collection of tricks and tools to speed up transformer models

54
Established
180 quic/efficient-transformers

This library empowers users to seamlessly port pretrained models and...

54
Established
181 peremartra/Large-Language-Model-Notebooks-Course

Practical course about Large Language Models.

54
Established
182 ericmjl/llamabot

Pythonic class-based interface to LLMs

54
Established
183 albertan017/LLM4Decompile

Reverse Engineering: Decompiling Binary Code with Large Language Models

54
Established
184 Shivanandroy/simpleT5

simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets...

54
Established
185 huggingface/audio-transformers-course

The Hugging Face Course on Transformers for Audio

54
Established
186 MattyB95/Jabberjay

🦜 Synthetic Voice Detection

53
Established
187 sgl-project/ome

Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU...

53
Established
188 muxi-ai/onellm

Unified interface for interacting with various LLMs hundreds of models,...

53
Established
189 ServerlessLLM/ServerlessLLM

Serverless LLM Serving for Everyone.

53
Established
190 floneum/floneum

Instant, controllable, local pre-trained AI models in Rust

53
Established
191 underneathall/pinferencia

Python + Inference - Model Deployment library in Python. Simplest model...

53
Established
192 davidpirogov/toon-llm

Token-Oriented Object Notation (TOON) is an LLM-optimized data serialization...

53
Established
193 lucidrains/locoformer

LocoFormer - Generalist Locomotion via Long-Context Adaptation

53
Established
194 avilum/minrlm

Token-efficient Recursive Language Model. 3.6x fewer tokens than vanilla...

53
Established
195 PKU-Alignment/align-anything

Align Anything: Training All-modality Model with Feedback

53
Established
196 shibing624/textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM,...

53
Established
197 GeeeekExplorer/nano-vllm

Nano vLLM

53
Established
198 mlabonne/llm-datasets

Curated list of datasets and tools for post-training.

53
Established
199 HowieHwong/TrustLLM

[ICML 2024] TrustLLM: Trustworthiness in Large Language Models

52
Established
200 Mobile-Artificial-Intelligence/llama_sdk

lcpp is a dart implementation of llama.cpp used by the mobile artificial...

52
Established