All Transformer Models
6,427 models ranked by quality score · Page 3 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 201 |
OpenVoiceOS/ovos-audio-transformer-plugin-ggwave
data over sound plugin |
|
Established |
| 202 |
ScrapeGraphAI/toonify
Toonify: Compact data format reducing LLM token usage by 30-60% |
|
Established |
| 203 |
PRIME-RL/TTRL
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning |
|
Established |
| 204 |
nerdai/llms-from-scratch-rs
A comprehensive Rust translation of the code from Sebastian Raschka's Build... |
|
Established |
| 205 |
avikumart/LLM-GenAI-Transformers-Notebooks
An repository containing all the LLM notebooks with tutorial and projects |
|
Established |
| 206 |
mgonzs13/llama_ros
llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2 |
|
Established |
| 207 |
TharinduDR/TransQuest
Transformer based translation quality estimation |
|
Established |
| 208 |
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need". |
|
Established |
| 209 |
PacktPublishing/Mastering-NLP-from-Foundations-to-LLMs
Mastering NLP from Foundations to LLMs, Published by Packt |
|
Established |
| 210 |
explosion/curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components |
|
Established |
| 211 |
ai-decentralized/BloomBee
Decentralized LLMs fine-tuning and inference with offloading |
|
Established |
| 212 |
SalesforceAIResearch/uni2ts
Unified Training of Universal Time Series Forecasting Transformers |
|
Established |
| 213 |
ServiceNow/TACTiS
TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time... |
|
Established |
| 214 |
fixie-ai/ultravox
A fast multimodal LLM for real-time voice |
|
Established |
| 215 |
helpmefindaname/transformer-smaller-training-vocab
Temporary remove unused tokens during training to save ram and speed. |
|
Established |
| 216 |
google/deepconsensus
DeepConsensus uses gap-aware sequence transformers to correct errors in... |
|
Established |
| 217 |
stanfordnlp/axbench
Stanford NLP Python library for benchmarking the utility of LLM... |
|
Established |
| 218 |
UKPLab/gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires... |
|
Established |
| 219 |
mindspore-lab/step_into_llm
MindSpore online courses: Step into LLM |
|
Established |
| 220 |
alesanfra/toons
A high-performance TOON (Token Oriented Object Notation) parser and... |
|
Established |
| 221 |
adithya-s-k/AI-Engineering.academy
Mastering Applied AI, One Concept at a Time |
|
Established |
| 222 |
jsksxs360/How-to-use-Transformers
Transformers 库快速入门教程 |
|
Established |
| 223 |
huggingface/transformers.js-examples
A collection of 🤗 Transformers.js demos and example applications |
|
Established |
| 224 |
dvgodoy/FineTuningLLMs
Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with... |
|
Established |
| 225 |
moment-timeseries-foundation-model/moment
MOMENT: A Family of Open Time-series Foundation Models, ICML'24 |
|
Established |
| 226 |
ridgerchu/matmulfreellm
Implementation for MatMul-free LM. |
|
Established |
| 227 |
Omid-Nejati/MedViTV2
MedViTV2: Medical Image Classification with KAN-Integrated Transformers and... |
|
Established |
| 228 |
minggnim/nlp-models
A repository for training transformer based models |
|
Established |
| 229 |
yjg30737/pyqt-openai
VividNode: Multi-purpose Text & Image Generation Desktop Chatbot (supporting... |
|
Established |
| 230 |
ruanchaves/hashformers
Accurate word segmentation for hashtags and text, powered by Transformers... |
|
Established |
| 231 |
serge-chat/serge
A web interface for chatting with Alpaca through llama.cpp. Fully... |
|
Established |
| 232 |
ggml-org/llama.vscode
VS Code extension for LLM-assisted code/text completion |
|
Established |
| 233 |
kyegomez/MambaTransformer
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and... |
|
Established |
| 234 |
hyunwoongko/nanoRLHF
nanoRLHF: from-scratch journey into how LLMs and RLHF really work. |
|
Established |
| 235 |
SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and... |
|
Established |
| 236 |
tattn/LocalLLMClient
Swift package to run local LLMs on iOS, macOS, Linux |
|
Established |
| 237 |
Strvm/meta-ai-api
Llama 3 API 70B & 405B (MetaAI Reverse Engineered) |
|
Established |
| 238 |
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning... |
|
Emerging |
| 239 |
iusztinpaul/hands-on-llms
🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training,... |
|
Emerging |
| 240 |
lucidrains/alphagenome
Implementation of AlphaGenome, Deepmind's updated genomic attention model |
|
Emerging |
| 241 |
IntelLabs/nlp-architect
A model library for exploring state-of-the-art deep learning topologies and... |
|
Emerging |
| 242 |
mukel/llama3.java
Practical Llama 3 inference in Java |
|
Emerging |
| 243 |
bodaay/HuggingFaceModelDownloader
Simple go utility to download HuggingFace Models and Datasets |
|
Emerging |
| 244 |
abelriboulot/onnxt5
Summarization, translation, sentiment-analysis, text-generation and more at... |
|
Emerging |
| 245 |
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理) |
|
Emerging |
| 246 |
louisfb01/start-llms
A complete guide to start and improve your LLM skills in 2026 with little... |
|
Emerging |
| 247 |
intel/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM,... |
|
Emerging |
| 248 |
KimMeen/Time-LLM
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting... |
|
Emerging |
| 249 |
sapientinc/HRM
Hierarchical Reasoning Model Official Release |
|
Emerging |
| 250 |
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets,... |
|
Emerging |
| 251 |
galilai-group/stable-pretraining
Reliable, minimal and scalable library for pretraining foundation and world models |
|
Emerging |
| 252 |
kossisoroyce/timber
Ollama for classical ML models. AOT compiler that turns XGBoost, LightGBM,... |
|
Emerging |
| 253 |
kyegomez/Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model" |
|
Emerging |
| 254 |
kyegomez/MambaByte
Implementation of MambaByte in "MambaByte: Token-free Selective State Space... |
|
Emerging |
| 255 |
maziyarpanahi/openmed
open-source healthcare ai |
|
Emerging |
| 256 |
DashyDashOrg/pandas-llm
Pandas-LLM |
|
Emerging |
| 257 |
AXERA-TECH/ax-llm
Explore LLM model deployment based on AXera's AI chips |
|
Emerging |
| 258 |
jhkchan/translategemma-cli
Local CLI for Google's TranslateGemma translation models with multi-platform... |
|
Emerging |
| 259 |
TIGER-AI-Lab/MMLU-Pro
The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task... |
|
Emerging |
| 260 |
ZHZisZZ/dllm
dLLM: Simple Diffusion Language Modeling |
|
Emerging |
| 261 |
multimodal-art-projection/YuE
YuE: Open Full-song Music Generation Foundation Model, something similar to... |
|
Emerging |
| 262 |
telekom/mltb2
Machine Learning Toolbox 2 |
|
Emerging |
| 263 |
dbiir/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo |
|
Emerging |
| 264 |
kyegomez/LFM
An open source implementation of LFMs from Liquid AI: Liquid Foundation Models |
|
Emerging |
| 265 |
eth-sri/matharena
Evaluation of LLMs on latest math competitions |
|
Emerging |
| 266 |
ddh0/easy-llama
Python package wrapping llama.cpp for on-device LLM inference |
|
Emerging |
| 267 |
TIGER-AI-Lab/VLM2Vec
This repo contains the code for "VLM2Vec: Training Vision-Language Models... |
|
Emerging |
| 268 |
edwko/OuteTTS
Interface for OuteTTS models. |
|
Emerging |
| 269 |
DadaNanjesha/AI-Text-Humanizer-App
Transform AI-generated text into formal, human-like, and academic writing... |
|
Emerging |
| 270 |
UdbhavPrasad072300/Transformer-Implementations
Library - Vanilla, ViT, DeiT, BERT, GPT |
|
Emerging |
| 271 |
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) |
|
Emerging |
| 272 |
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model ——... |
|
Emerging |
| 273 |
ggml-org/llama.vim
Vim plugin for LLM-assisted code/text completion |
|
Emerging |
| 274 |
guinmoon/LLMFarm
llama and other large language models on iOS and MacOS offline using GGML library. |
|
Emerging |
| 275 |
lone-cloud/gerbil
A desktop app for running Large Language Models locally. |
|
Emerging |
| 276 |
tensorchord/modelz-llm
OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and... |
|
Emerging |
| 277 |
socialfoundations/folktexts
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on... |
|
Emerging |
| 278 |
google-deepmind/long-form-factuality
Benchmarking long-form factuality in large language models. Original code... |
|
Emerging |
| 279 |
MadryLab/context-cite
Attribute (or cite) statements generated by LLMs back to in-context information. |
|
Emerging |
| 280 |
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and... |
|
Emerging |
| 281 |
megagonlabs/ginza-transformers
Use custom tokenizers in spacy-transformers |
|
Emerging |
| 282 |
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT |
|
Emerging |
| 283 |
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs)... |
|
Emerging |
| 284 |
datawhalechina/llms-from-scratch-cn
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理 |
|
Emerging |
| 285 |
CASE-Lab-UMD/LLM-Drop
The official implementation of the paper "Uncovering the Redundancy in... |
|
Emerging |
| 286 |
autonomousvision/transfuser
[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for... |
|
Emerging |
| 287 |
yotambraun/APDTFlow
APDTFlow is a modern and extensible forecasting framework for time series... |
|
Emerging |
| 288 |
AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on... |
|
Emerging |
| 289 |
kyegomez/attn_res
A clean, single-file PyTorch implementation of Attention Residuals (Kimi... |
|
Emerging |
| 290 |
BiomedSciAI/biomed-multi-omic
Build foundation model for RNA or DNA data |
|
Emerging |
| 291 |
mirpo/fastapi-gen
Build LLM-enabled FastAPI applications without build configuration. |
|
Emerging |
| 292 |
beehive-lab/GPULlama3.java
GPU-accelerated Llama3.java inference in pure Java using TornadoVM. |
|
Emerging |
| 293 |
MiniMax-AI/MiniMax-01
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model... |
|
Emerging |
| 294 |
belladoreai/llama3-tokenizer-js
JS tokenizer for LLaMA 3 and LLaMA 3.1 |
|
Emerging |
| 295 |
NiuTrans/LaTeXTrans
A tool for translating the content of LaTeX documents into various other... |
|
Emerging |
| 296 |
LoicGrobol/zeldarose
Train transformer-based models. |
|
Emerging |
| 297 |
Kohulan/DECIMER-Image_Transformer
DECIMER Image Transformer is a deep-learning-based tool designed for... |
|
Emerging |
| 298 |
YerbaPage/LongCodeZip
LongCodeZip: Compress Long Context for Code Language Models [ASE2025] |
|
Emerging |
| 299 |
haizelabs/verdict
Inference-time scaling for LLMs-as-a-judge. |
|
Emerging |
| 300 |
zjunlp/EasyInstruct
[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs. |
|
Emerging |