The Transformer Directory

Quality-scored directory of 6,427 transformer models, updated daily. Every model scored on maintenance, adoption, maturity, and community signals.

Transformer models and tools for fine-tuning, quantisation, inference optimisation, and deployment of attention-based architectures.

Verified

60

70–100

Established

177

50–69

Emerging

1,626

30–49

Experimental

4,564

10–29

Top models by quality score

# Model Score
1 huggingface/transformers

πŸ€— Transformers: the model-definition framework for state-of-the-art machine...

100
2 vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

100
3 sgl-project/sglang

SGLang is a high-performance serving framework for large language models and...

100
4 unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. πŸ¦₯ Train OpenAI gpt-oss,...

94
5 vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

93
6 huggingface/peft

πŸ€— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

93
7 alibaba/MNN

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba,...

93
8 LMCache/LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

92
9 AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

92
10 modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5,...

91
11 linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

90
12 intel/neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity;...

90
13 bitsandbytes-foundation/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

90
14 huggingface/optimum

πŸš€ Accelerate inference and training of πŸ€— Transformers, Diffusers, TIMM and...

90
15 huggingface/tokenizers

πŸ’₯ Fast State-of-the-Art Tokenizers optimized for Research and Production

90
16 xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you...

89
17 fla-org/flash-linear-attention

πŸš€ Efficient implementations of state-of-the-art linear attention models

89
18 SwanHubX/SwanLab

⚑️SwanLab - an open-source, modern-design AI training tracking and...

89
19 tensorzero/tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It...

89
20 Blaizzy/mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models...

89

Browse by category

Transformer Architecture Tutorials

267 models

Local LLM Deployment

245 models

LoRA QLoRA Fine-tuning

206 models

ML Foundations Curricula

183 models

Review Sentiment Classification

171 models

Interactive AI Chat UIs

170 models

LLM Inference Engines

153 models

LLM Training Experimentation

151 models

GPT2 Pretraining Fine-tuning

128 models

RLHF Alignment Training

106 models

Text Summarization Transformers

100 models

Conversational Chatbot Applications

96 models

Multilingual LLM Adaptation

91 models

Multimodal Vision Language

89 models

Mathematical Reasoning Transformers

84 models

3D Vision Transformers

83 models

Transformer Frameworks Wrappers

80 models

AI-Powered Business Analytics

80 models

Llm Fine Tuning

77 models

Messaging Platform Chatbots

75 models

LLM Quantization Methods

71 models

Text Classification Transformers

71 models

BERT Model Implementations

68 models

Multi-agent Orchestration

66 models

HuggingFace Learning Resources

65 models

Question Answering Systems

64 models

Time Series Forecasting Transformers

61 models

LLM Terminal Automation

61 models

NLP Learning Coursework

60 models

Transformer Interpretability Mechanistic

57 models

Llm Scaling Architecture

56 models

Vision Language Models

56 models

Medical Image Segmentation Transformers

53 models

Hate Speech Detection

52 models

Text to Image Generation

50 models

Llm Frameworks Libraries

50 models

Emotion Detection Transformers

50 models

Neural Machine Translation

49 models

Prompt Engineering Security

49 models

Power Transformer Design

48 models

Model Evaluation Diagnostics

48 models

LLM Implementation From Scratch

44 models

OCR Document Extraction

44 models

Named Entity Recognition

44 models

Streamlit LLM Interfaces

44 models

Llm Implementation Tutorials

43 models

Browser-Based ML Inference

43 models

Transformer Training Optimization

42 models

Vision Transformer Implementations

41 models

Fake News Detection

40 models

Llm Reasoning Research

39 models

LLM Benchmark Leaderboards

39 models

Llm Learning Resources

38 models

Therapeutic Chatbot Applications

38 models

Math Reasoning Datasets

37 models

Multimodal Fusion Transformers

37 models

Protein Transformers ML

36 models

Text to Speech TTS

35 models

Llama Model Implementations

35 models

Vision Language Instruction Tuning

34 models

ViT Image Classification

34 models

Financial Return Prediction

34 models

AI-Powered SaaS Startups

34 models

ML API Deployment

34 models

Music Generation Transformers

33 models

Korean Language Models

33 models

Medical Image Diagnosis Transformers

32 models

Transformer Architecture Education

31 models

Resume Job Matching

31 models

Semantic Textual Similarity

31 models

Llm Finetuning Frameworks

30 models

Academic Thesis Repositories

30 models

Multi-provider LLM Interfaces

29 models

Llm Interpretability Explainability

29 models

Retrieval Augmented Generation

29 models

Llm Compression Optimization

28 models

Llm Knowledge Distillation

28 models

Image Captioning Transformers

28 models

Domain Specific Benchmarks

27 models

Diffusion Language Models

27 models

Text Clustering Topic Modeling

26 models

BLIP Image Captioning

25 models

T5 mT5 Fine-tuning

24 models

Graph Transformers

24 models

Creative Text Generation

24 models

CLIP Image Embeddings

23 models

Instruction Tuning Datasets

23 models

Llm Knowledge Editing

21 models

Whisper Speech Transcription

21 models

Vision Transformer Classification

21 models

Audio Classification Transformers

21 models

Semantic Search Retrieval

21 models

Tokenizer Libraries

20 models

Sparse Attention Optimization

20 models

Evaluation Frameworks Metrics

19 models

Molecular Generation Transformers

19 models

Mixture Of Experts Llms

19 models

Study Aid Generators

19 models

Object Detection Transformers

19 models

Financial Sentiment Analysis

19 models

Essay Scoring Grading

19 models

Parameter Efficient Adapters

18 models

Llm Inference Serving

18 models

Multimodal Vision Language Models

18 models

AI Content Detection

18 models

Recommendation Systems Transformers

18 models

LLM Pruning Compression

17 models

Llm Research Curation

17 models

Bias Detection Transformers

17 models

Llm Domain Datasets

16 models

Machine Translation Transformers

16 models

Llm Cuda Optimization

15 models

Disaster Tweet Classification

15 models

Cybersecurity Threat Detection

15 models

Graph Language Models

14 models

Llm Quantization Techniques

13 models

Attention Mechanism Implementations

13 models

Wav2Vec2 Speech Recognition

13 models

Speculative Decoding Algorithms

12 models

Llm Hallucination Mitigation

12 models

PHP AI SDKs

12 models

Direct Preference Optimization

12 models

PII Redaction Anonymization

11 models

Llm Framework Abstractions

11 models

Code Completion Copilots

11 models

Llm Docker Deployments

11 models

Indic Language Translation

11 models

Llm Recommendation Systems

11 models

Clinical Text Classification

11 models

Gpt Model Fine Tuning

10 models

Spam Detection Transformers

10 models

Llm Robot Planning

9 models

YouTube Video Summarization

9 models

Apple Silicon Llm Inference

8 models

Code Model Training

8 models

Kv Cache Optimization

7 models

Llm Bias Evaluation

7 models

Mistral Ai Tools

6 models

Safety Robustness Evaluation

6 models

Gpt Multilingual Training

6 models

Clinical Llm Tools

6 models

Chain Of Thought Reasoning

6 models

Llm Evaluation Benchmarking

6 models

Vision Transformer Optimization

6 models

Llm Knowledge Graph Generation

6 models

Jailbreak Attacks Analysis

6 models

Llm Orchestration Platforms

5 models

Mixup Augmentation Frameworks

5 models

Bert Model Frameworks

5 models

Compositional Reasoning Embeddings

5 models

Gpt Implementation Tutorials

4 models

Rust Llm Infrastructure

4 models

Llm Function Calling

4 models

Text Classification

4 models

Ai Music Generation

4 models

Llm Data Labeling

3 models

Multimodal Rag Systems

3 models

Prompt Engineering Techniques

3 models

Llm Serialization Formats

3 models

Ml Inference Benchmarking

3 models

State Space Model Architectures

3 models

Nlp Learning Resources

3 models

Llm Translation Tools

3 models

Competitive Agent Games

3 models

Julia Ml Frameworks

3 models

Llm Agent Training Gyms

3 models

Chatgpt Api Tutorials

3 models

Protein Design Llms

3 models

Clip Vision Language

3 models

Llm Fine Tuning Optimization

3 models

Multimodal Visual Grounding

3 models

Synthetic Data Generation

3 models

Structured Output Enforcement

3 models

Ai Generated Text Detection

3 models

Explainability Interpretability Frameworks

2 models

Llm Fine Tuning Frameworks

2 models

Nlp Fundamentals Tutorials

2 models

Distributed Training Frameworks

2 models

Text Tokenization Libraries

2 models

Graph Neural Networks

2 models

Gpt2 Language Models

2 models

Text Summarization Tools

2 models

End To End Asr Frameworks

2 models

Semantic Segmentation Techniques

2 models

Protein Language Models

2 models

Llm Chat Interfaces

2 models

Agent Memory Systems

2 models

Transformer Implementation Education

2 models

Image Caption Generation

2 models

Neural Data Compression

2 models

Rust Agent Frameworks

2 models

Langchain Integration Patterns

2 models

Ollama Chat Interfaces

2 models

Ai Stock Analysis

2 models

Defect Detection Quality Forensics

2 models

Generative Ai Learning

2 models

Variational Autoencoders Nlp

2 models

Llm Chatbot Interfaces

2 models

Vulnerability Detection Llm

2 models

Llm Thesis Research

2 models

Jax Ml Frameworks

2 models

Trajectory Prediction Ml

2 models

Ml Benchmarking Frameworks

2 models

Peptide Property Prediction

2 models

Knowledge Distillation Compression

2 models

Model Fine Tuning Methods

2 models

Task Oriented Dialogue Systems

2 models

Llm Request Routing

2 models

Llm Pentest Automation

2 models

Image Captioning Tools

2 models

Hybrid Retrieval Optimization

2 models

Video Editing Diffusion

1 models

Uncategorized

1 models

Financial Ai Agents

1 models

Content Based Recommendation

1 models

Ai Image Generation Platforms

1 models

Computer Vision Learning

1 models

Lightweight Training Utilities

1 models

Llm Orchestration Routing

1 models

Loss Function Implementations

1 models

Chatglm Fine Tuning

1 models

Machine Translation Systems

1 models

Agent Memory Infrastructure

1 models

Speech Ai Coursework

1 models

Chatbot Nlp Frameworks

1 models

Time Series Forecasting

1 models

Energy Sector Forecasting

1 models

Local Voice Assistants

1 models

Character Motion Animation

1 models

Session Context Memory

1 models

Ai Powered Search Engines

1 models

Ai Presentation Generation

1 models

Feature Selection Frameworks

1 models

Ios Nlp Frameworks

1 models

Sign Language Recognition

1 models

Compositional T2I Generation

1 models

Legal Document Analysis

1 models

Lottery Number Prediction

1 models

Kaggle Competition Solutions

1 models

Speaker Diarization Embedding

1 models

Text Translation Tools

1 models

Generative Ai Learning Projects

1 models

Rna Structure Learning

1 models

Nlp Education Courses

1 models

Lora Training Tools

1 models

Mcp Demo Examples

1 models

Causal Inference Nlp

1 models

Self Supervised Learning

1 models

Black Box Optimization

1 models

Healthcare Ai Diagnostics

1 models

Ai Video Generation

1 models

Multi Agent Debate Systems

1 models

Generative Ai Platforms

1 models

Qwen Llm Ecosystem

1 models

Llm Provider Sdks

1 models

Ollama Go Clients

1 models

Go Ml Bindings

1 models

Medical Image Segmentation

1 models

Game Playing Agents

1 models

Llm Evaluation Frameworks

1 models

Adversarial Nlp Robustness

1 models

World Models Frameworks

1 models

Paper Implementation Collections

1 models

Fact Checking Systems

1 models

Memory Augmented Architectures

1 models

Pdf Qa Systems

1 models

Spiking Neural Networks

1 models

Speech Synthesis Diffusion

1 models

Image Generation Mcp

1 models

Hugging Face Tutorials

1 models

Music Similarity Embeddings

1 models

Domain Adaptation Frameworks

1 models

Nano Gpt Variants

1 models

Model Compression Optimization

1 models

Text To Speech Frameworks

1 models

Text To Sql Rag

1 models

Kubernetes Llm Serving

1 models

Multimodal Search Engines

1 models

Prompt Engineering Optimization

1 models

Local Rag Frameworks

1 models

Rag Qa Systems

1 models

Hate Speech Content Moderation

1 models

Ml Project Portfolios

1 models

Membership Inference Attacks

1 models

Ml Project Collections

1 models

Rust Onnx Runtime

1 models

Variational Autoencoder Implementations

1 models

Llm Experimentation Labs

1 models

Javascript Ml Libraries

1 models

Diffusion Web Interfaces

1 models

Edge Device Ml Frameworks

1 models

Reading Comprehension Qa

1 models

Keyword Speech Recognition

1 models

Stable Diffusion Tools

1 models

Advanced Summarization Methods

1 models

Mental Health Chatbots

1 models

Youtube Transcript Summarization

1 models