All Transformer Models

6,427 models ranked by quality score

Showing 1–100 of 6,427
# Model Score Tier
1 huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine...

100
Verified
2 vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

100
Verified
3 sgl-project/sglang

SGLang is a high-performance serving framework for large language models and...

100
Verified
4 unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss,...

94
Verified
5 vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

93
Verified
6 huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

93
Verified
7 alibaba/MNN

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba,...

93
Verified
8 LMCache/LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

92
Verified
9 AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

92
Verified
10 modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5,...

91
Verified
11 linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

90
Verified
12 intel/neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity;...

90
Verified
13 bitsandbytes-foundation/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

90
Verified
14 huggingface/optimum

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and...

90
Verified
15 huggingface/tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

90
Verified
16 xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you...

89
Verified
17 fla-org/flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

89
Verified
18 SwanHubX/SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and...

89
Verified
19 tensorzero/tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It...

89
Verified
20 Blaizzy/mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models...

89
Verified
21 oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any...

88
Verified
22 openvinotoolkit/nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

86
Verified
23 Dao-AILab/flash-attention

Fast and memory-efficient exact attention

86
Verified
24 qubvel-org/segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and...

84
Verified
25 lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

83
Verified
26 adapter-hub/adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

82
Verified
27 AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We...

82
Verified
28 huggingface/text-generation-inference

Large Language Model Text Generation Inference

82
Verified
29 intel/auto-round

🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed...

81
Verified
30 microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing...

80
Verified
31 filipstrand/mflux

MLX native implementations of state-of-the-art generative image models

79
Verified
32 lucidrains/x-transformers

A concise but complete full-attention transformer with a set of promising...

79
Verified
33 ModelCloud/GPTQModel

LLM model quantization (compression) toolkit with hw acceleration support...

79
Verified
34 withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp....

79
Verified
35 PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

79
Verified
36 OpenNMT/OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

76
Verified
37 sgl-project/SpecForge

Train speculative decoding models effortlessly and port them smoothly to...

76
Verified
38 PaddlePaddle/FastDeploy

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on...

76
Verified
39 NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

76
Verified
40 ARahim3/mlx-tune

Bringing the Unsloth experience to Mac users via Apple's MLX framework

75
Verified
41 ludwig-ai/ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

75
Verified
42 ThilinaRajapakse/simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA,...

75
Verified
43 cvs-health/uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package...

75
Verified
44 ExtensityAI/symbolicai

A neurosymbolic perspective on LLMs

75
Verified
45 pytorch/ao

PyTorch native quantization and sparsity for training and inference

74
Verified
46 mishushakov/llm-scraper

Turn any webpage into structured data using LLMs

74
Verified
47 bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible...

74
Verified
48 NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and...

73
Verified
49 meta-llama/llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with...

73
Verified
50 cubist38/mlx-openai-server

A high-performance API server that provides OpenAI-compatible endpoints for...

72
Verified
51 agentscope-ai/Trinity-RFT

Trinity-RFT is a general-purpose, flexible and scalable framework designed...

72
Verified
52 amazon-science/chronos-forecasting

Chronos: Pretrained Models for Time Series Forecasting

71
Verified
53 mosaicml/llm-foundry

LLM training code for Databricks foundation models

71
Verified
54 transformerlab/transformerlab-app

The open source research environment for AI researchers to seamlessly train,...

71
Verified
55 MaartenGr/BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

71
Verified
56 rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

71
Verified
57 gpustack/gpustack

Performance-optimized AI inference on your GPUs. Unlock superior throughput...

71
Verified
58 hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

70
Verified
59 webis-de/small-text

Active Learning for Text Classification in Python

70
Verified
60 mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others....

70
Verified
61 rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

69
Established
62 VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision...

69
Established
63 tenstorrent/tt-metal

:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

69
Established
64 OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on...

69
Established
65 InternLM/lagent

A lightweight framework for building LLM-based agents

69
Established
66 mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore...

69
Established
67 THU-BPM/MarkLLM

MarkLLM: An Open-Source Toolkit for LLM Watermarking.(EMNLP 2024 System...

68
Established
68 InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

68
Established
69 leondgarse/keras_cv_attention_models

Keras...

68
Established
70 ModelTC/LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving...

68
Established
71 stochasticai/xTuring

Build, personalize and control your own LLMs. From data pre-processing to...

68
Established
72 SciSharp/LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

68
Established
73 huggingface/transformers.js

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly...

68
Established
74 tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

67
Established
75 huggingface/course

The Hugging Face course on Transformers

67
Established
76 zhudotexe/kani

kani (カニ) is a highly hackable microframework for tool-calling language...

67
Established
77 Tongjilibo/bert4torch

An elegent pytorch implement of transformers

67
Established
78 microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large...

67
Established
79 Goekdeniz-Guelmez/mlx-lm-lora

Train Large Language Models on MLX.

66
Established
80 jd-opensource/xllm

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

66
Established
81 alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

66
Established
82 h2oai/h2o-llmstudio

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs....

66
Established
83 ScalaConsultants/Aspect-Based-Sentiment-Analysis

💭 Aspect-Based-Sentiment-Analysis: Transformer & Explainable ML (TensorFlow)

65
Established
84 mlc-ai/mlc-llm

Universal LLM Deployment Engine with ML Compilation

65
Established
85 p-e-w/heretic

Fully automatic censorship removal for language models

65
Established
86 allenai/dolma

Data and tools for generating and inspecting OLMo pre-training data.

65
Established
87 microsoft/torchscale

Foundation Architecture for (M)LLMs

65
Established
88 shibing624/MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training...

65
Established
89 mostlygeek/llama-swap

Reliable model swapping for any local OpenAI/Anthropic compatible server -...

65
Established
90 jessevig/bertviz

BertViz: Visualize Attention in Transformer Models

65
Established
91 run-llama/LlamaIndexTS

Data framework for your LLM applications. Focus on server side solution

65
Established
92 huggingface/optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

64
Established
93 cel-ai/celai

Open source framework designed to accelerate the development of omnichannel...

64
Established
94 NielsRogge/Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

64
Established
95 lightonai/pylate

Late Interaction Models Training & Retrieval

64
Established
96 Mobile-Artificial-Intelligence/maid

Maid is a free and open source application for interfacing with llama.cpp...

64
Established
97 kha-white/manga-ocr

Optical character recognition for Japanese text, with the main focus being...

64
Established
98 jingyaogong/minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

64
Established
99 kanishkamisra/minicons

Utility for behavioral and representational analyses of Language Models

63
Established
100 bigscience-workshop/petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x...

63
Established
1 2 3 63 64 65 Next »