All Transformer Models
6,429 models ranked by quality score · Page 4 of 65
| # | Model | Score | Tier |
|---|---|---|---|
| 301 |
modelscope/easydistill
a toolkit on knowledge distillation for large language models |
|
Established |
| 302 |
potamides/DeTikZify
Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ. |
|
Established |
| 303 |
sovit-123/vision_transformers
Vision Transformers for image classification, image segmentation, and object... |
|
Established |
| 304 |
keith2018/TinyGPT
Tiny C++ LLM inference implementation from scratch |
|
Established |
| 305 |
Nicolepcx/Transformers-in-Action
This is the corresponding code for the book Transformers in Action |
|
Established |
| 306 |
guanwei49/LogLLM
LogLLM: Log-based Anomaly Detection Using Large Language Models (system log... |
|
Established |
| 307 |
tjake/Jlama
Jlama is a modern LLM inference engine for Java |
|
Established |
| 308 |
BioinfoMachineLearning/DeepInteract
A geometric deep learning framework (Geometric Transformers) for predicting... |
|
Established |
| 309 |
GeeeekExplorer/nano-vllm
Nano vLLM |
|
Established |
| 310 |
jax-ml/jax-llm-examples
Minimal yet performant LLM examples in pure JAX |
|
Established |
| 311 |
hscspring/hcgf
Humanable Chat Generative-model Fine-tuning | LLM微调 |
|
Established |
| 312 |
MattyB95/Jabberjay
🦜 Synthetic Voice Detection |
|
Established |
| 313 |
tue-mps/eomt
[CVPR 2025 Highlight] Official code and models for Encoder-only Mask... |
|
Established |
| 314 |
kyegomez/zeta
Build high-performance AI models with modular building blocks |
|
Established |
| 315 |
SKTBrain/KoBERT
Korean BERT pre-trained cased (KoBERT) |
|
Established |
| 316 |
IbrahimSobh/llms
Large Language Models: In this repository Language models are introduced... |
|
Established |
| 317 |
CASE-Lab-UMD/LLM-Drop
The official implementation of the paper "Uncovering the Redundancy in... |
|
Established |
| 318 |
microsoft/vidur
A large-scale simulation framework for LLM inference |
|
Established |
| 319 |
fla-org/flame
🔥 A minimal training framework for scaling FLA models |
|
Established |
| 320 |
sb-ai-lab/RePlay
A Comprehensive Framework for Building End-to-End Recommendation Systems... |
|
Established |
| 321 |
zhenye234/LLaSA_training
LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis |
|
Established |
| 322 |
xrsrke/toolformer
Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools |
|
Established |
| 323 |
IBM/TabFormer
Code & Data for "Tabular Transformers for Modeling Multivariate Time Series"... |
|
Established |
| 324 |
facebookresearch/LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative... |
|
Established |
| 325 |
sinanuozdemir/oreilly-hands-on-gpt-llm
Mastering the Art of Scalable and Efficient AI Model Deployment |
|
Established |
| 326 |
bobazooba/xllm
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning |
|
Established |
| 327 |
VectorInstitute/vector-inference
Efficient LLM inference on Slurm clusters. |
|
Established |
| 328 |
oripress/AlgoTune
AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and... |
|
Established |
| 329 |
kyegomez/SingLoRA
This repository provides a minimal, single-file implementation of SingLoRA... |
|
Established |
| 330 |
alibaba/InferSim
A Lightweight LLM Inference Performance Simulator |
|
Established |
| 331 |
FareedKhan-dev/train-llm-from-scratch
A straightforward method for training your LLM, from downloading data to... |
|
Established |
| 332 |
foundation-model-stack/fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features,... |
|
Established |
| 333 |
fluxions-ai/vui
100M parameter lightweight conversational text-to-speech model with breaths,... |
|
Established |
| 334 |
yoshoku/llama_cpp.rb
llama_cpp.rb provides Ruby bindings for llama.cpp |
|
Established |
| 335 |
TsinghuaC3I/MARTI
A Framework for LLM-based Multi-Agent Reinforced Training and Inference |
|
Established |
| 336 |
Leeroo-AI/mergoo
A library for easily merging multiple LLM experts, and efficiently train the... |
|
Established |
| 337 |
thu-nics/C2C
[ICLR'26] The official code implementation for "Cache-to-Cache: Direct... |
|
Established |
| 338 |
thammegowda/nllb-serve
Meta's "No Language Left Behind" models served as web app and REST API |
|
Established |
| 339 |
yuriwa/crewai-sheets-ui
Use google sheets as a gui for crewAI |
|
Established |
| 340 |
OpenVoiceOS/ovos-audio-transformer-plugin-ggwave
data over sound plugin |
|
Established |
| 341 |
young-geng/scalax
A simple library for scaling up JAX programs |
|
Established |
| 342 |
Denis2054/Transformers-for-NLP-2nd-Edition
Transformer models from BERT to GPT-4, environments from Hugging Face to... |
|
Established |
| 343 |
kyegomez/LongNet
Implementation of plug in and play Attention from "LongNet: Scaling... |
|
Established |
| 344 |
NLPOptimize/flash-tokenizer
EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING |
|
Established |
| 345 |
kossisoroyce/timber
Ollama for classical ML models. AOT compiler that turns XGBoost, LightGBM,... |
|
Established |
| 346 |
grammarly/gector
Official implementation of the papers "GECToR – Grammatical Error... |
|
Established |
| 347 |
tanyuqian/redco
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A... |
|
Established |
| 348 |
Nicolepcx/transformers-the-definitive-guide
This is the official repository for the book Transformers - The Definitive Guide |
|
Established |
| 349 |
monologg/KoELECTRA
Pretrained ELECTRA Model for Korean |
|
Established |
| 350 |
ikergarcia1996/Easy-Translate
Easy-Translate is a script for translating large text files with a SINGLE... |
|
Established |
| 351 |
kyegomez/LIMoE
Implementation of the "the first large-scale multimodal mixture of experts... |
|
Established |
| 352 |
cdqa-suite/cdQA
⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System. |
|
Established |
| 353 |
rasbt/LLM-workshop-2024
A 4-hour coding workshop to understand how LLMs are implemented and used |
|
Established |
| 354 |
AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on... |
|
Established |
| 355 |
ai-forever/ru-gpts
Russian GPT3 models. |
|
Established |
| 356 |
fixie-ai/ultravox
A fast multimodal LLM for real-time voice |
|
Established |
| 357 |
tylerelyt/LLM-Workshop
🌟 Learn Large Language Model development through hands-on projects and... |
|
Established |
| 358 |
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need". |
|
Established |
| 359 |
abhimishra91/transformers-tutorials
Github repo with tutorials to fine tune transformers for diff NLP tasks |
|
Established |
| 360 |
bshao001/ChatLearner
A chatbot implemented in TensorFlow based on the seq2seq model, with certain... |
|
Established |
| 361 |
alephpi/Texo-web
The web application for Texo, a minimalist SOTA LaTeX OCR model which... |
|
Established |
| 362 |
tensorgi/TPA
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6)... |
|
Established |
| 363 |
monologg/JointBERT
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification... |
|
Established |
| 364 |
kennethleungty/Llama-2-Open-Source-LLM-CPU-Inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A |
|
Established |
| 365 |
cure-lab/LTSF-Linear
[AAAI-23 Oral] Official implementation of the paper "Are Transformers... |
|
Established |
| 366 |
vitoplantamura/OnnxStream
Lightweight inference library for ONNX files, written in C++. It can run... |
|
Established |
| 367 |
jeya-maria-jose/Medical-Transformer
Official Pytorch Code for "Medical Transformer: Gated Axial-Attention for... |
|
Established |
| 368 |
OscarKjell/text
Using Transformers from HuggingFace in R |
|
Established |
| 369 |
Rishit-dagli/Fast-Transformer
An implementation of Additive Attention |
|
Established |
| 370 |
kyegomez/PALI3
Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS:... |
|
Established |
| 371 |
GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS
End to End Generative AI Industry Projects on LLM Models with... |
|
Established |
| 372 |
daviddaytw/react-native-transformers
Run local LLM from Huggingface in React-Native or Expo using onnxruntime. |
|
Established |
| 373 |
symfony/ai-platform
PHP library for interacting with AI platform provider. |
|
Established |
| 374 |
helpmefindaname/transformer-smaller-training-vocab
Temporary remove unused tokens during training to save ram and speed. |
|
Established |
| 375 |
pszemraj/textsum
CLI & Python API to easily summarize text-based files with transformers |
|
Established |
| 376 |
shreyansh26/Annotated-ML-Papers
Annotations of the interesting ML papers I read |
|
Established |
| 377 |
lonePatient/Bert-Multi-Label-Text-Classification
This repo contains a PyTorch implementation of a pretrained BERT model for... |
|
Established |
| 378 |
NVlabs/OmniVinci
OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and... |
|
Established |
| 379 |
kmeng01/rome
Locating and editing factual associations in GPT (NeurIPS 2022) |
|
Established |
| 380 |
showlab/Show-o
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer... |
|
Established |
| 381 |
CVHub520/X-AnyLabeling-Server
A Simple, Lightweight, and Extensible Serving Framework for X-AnyLabeling |
|
Established |
| 382 |
kyegomez/RT-X
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open... |
|
Established |
| 383 |
tensorops/TransformerX
Flexible Python library providing building blocks (layers) for reproducible... |
|
Established |
| 384 |
opendilab/LightRFT
LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement... |
|
Established |
| 385 |
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from... |
|
Established |
| 386 |
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models.... |
|
Established |
| 387 |
chanind/frame-semantic-transformer
Frame Semantic Parser based on T5 and FrameNet |
|
Established |
| 388 |
EleutherAI/knowledge-neurons
A library for finding knowledge neurons in pretrained transformer models. |
|
Established |
| 389 |
pengzhangzhi/Open-dLLM
Open diffusion language model for code generation — releasing pretraining,... |
|
Established |
| 390 |
cruiseresearchgroup/SensorLLM
[EMNLP 2025] Official implementation of "SensorLLM: Aligning Large Language... |
|
Established |
| 391 |
alesanfra/toons
A high-performance TOON (Token Oriented Object Notation) parser and... |
|
Established |
| 392 |
kyegomez/SwitchTransformers
Implementation of Switch Transformers from the paper: "Switch Transformers:... |
|
Established |
| 393 |
MDGrey33/pyvisionai
The PyVisionAI Official Repo |
|
Established |
| 394 |
qcri/LLMeBench
Benchmarking Large Language Models |
|
Established |
| 395 |
BeRo1985/pasllm
PasLLM - LLM inference engine in Object Pascal (synced from my private work... |
|
Established |
| 396 |
ridgerchu/matmulfreellm
Implementation for MatMul-free LM. |
|
Established |
| 397 |
EfficientMoE/MoE-Infinity
PyTorch library for cost-effective, fast and easy serving of MoE models. |
|
Established |
| 398 |
HPAI-BSC/TuRTLe
TuRTLe: A Unified Evaluation of LLMs for RTL Generation 🐢 (MLCAD 2025) |
|
Established |
| 399 |
jina-ai/rungpt
An open-source cloud-native of large multi-modal models (LMMs) serving framework. |
|
Established |
| 400 |
Rishit-dagli/Perceiver
Implementation of Perceiver, General Perception with Iterative Attention |
|
Established |