The Transformer Directory
Quality-scored directory of 6,427 transformer models, updated daily. Every model scored on maintenance, adoption, maturity, and community signals.
Transformer models and tools for fine-tuning, quantisation, inference optimisation, and deployment of attention-based architectures.
60
70β100
177
50β69
1,626
30β49
4,564
10β29
Top models by quality score
| # | Model | Score |
|---|---|---|
| 1 |
huggingface/transformers
π€ Transformers: the model-definition framework for state-of-the-art machine... |
|
| 2 |
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs |
|
| 3 |
sgl-project/sglang
SGLang is a high-performance serving framework for large language models and... |
|
| 4 |
unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss,... |
|
| 5 |
vllm-project/vllm-omni
A framework for efficient model inference with omni-modality models |
|
| 6 |
huggingface/peft
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning. |
|
| 7 |
alibaba/MNN
MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba,... |
|
| 8 |
LMCache/LMCache
Supercharge Your LLM with the Fastest KV Cache Layer |
|
| 9 |
AI-Hypercomputer/maxtext
A simple, performant and scalable Jax LLM! |
|
| 10 |
modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5,... |
|
| 11 |
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training |
|
| 12 |
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity;... |
|
| 13 |
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch. |
|
| 14 |
huggingface/optimum
π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and... |
|
| 15 |
huggingface/tokenizers
π₯ Fast State-of-the-Art Tokenizers optimized for Research and Production |
|
| 16 |
xorbitsai/inference
Swap GPT for any LLM by changing a single line of code. Xinference lets you... |
|
| 17 |
fla-org/flash-linear-attention
π Efficient implementations of state-of-the-art linear attention models |
|
| 18 |
SwanHubX/SwanLab
β‘οΈSwanLab - an open-source, modern-design AI training tracking and... |
|
| 19 |
tensorzero/tensorzero
TensorZero is an open-source stack for industrial-grade LLM applications. It... |
|
| 20 |
Blaizzy/mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models... |
|
Browse by category
Transformer Architecture Tutorials
267 models
Local LLM Deployment
245 models
LoRA QLoRA Fine-tuning
206 models
ML Foundations Curricula
183 models
Review Sentiment Classification
171 models
Interactive AI Chat UIs
170 models
LLM Inference Engines
153 models
LLM Training Experimentation
151 models
GPT2 Pretraining Fine-tuning
128 models
RLHF Alignment Training
106 models
Text Summarization Transformers
100 models
Conversational Chatbot Applications
96 models
Multilingual LLM Adaptation
91 models
Multimodal Vision Language
89 models
Mathematical Reasoning Transformers
84 models
3D Vision Transformers
83 models
Transformer Frameworks Wrappers
80 models
AI-Powered Business Analytics
80 models
Llm Fine Tuning
77 models
Messaging Platform Chatbots
75 models
LLM Quantization Methods
71 models
Text Classification Transformers
71 models
BERT Model Implementations
68 models
Multi-agent Orchestration
66 models
HuggingFace Learning Resources
65 models
Question Answering Systems
64 models
Time Series Forecasting Transformers
61 models
LLM Terminal Automation
61 models
NLP Learning Coursework
60 models
Transformer Interpretability Mechanistic
57 models
Llm Scaling Architecture
56 models
Vision Language Models
56 models
Medical Image Segmentation Transformers
53 models
Hate Speech Detection
52 models
Text to Image Generation
50 models
Llm Frameworks Libraries
50 models
Emotion Detection Transformers
50 models
Neural Machine Translation
49 models
Prompt Engineering Security
49 models
Power Transformer Design
48 models
Model Evaluation Diagnostics
48 models
LLM Implementation From Scratch
44 models
OCR Document Extraction
44 models
Named Entity Recognition
44 models
Streamlit LLM Interfaces
44 models
Llm Implementation Tutorials
43 models
Browser-Based ML Inference
43 models
Transformer Training Optimization
42 models
Vision Transformer Implementations
41 models
Fake News Detection
40 models
Llm Reasoning Research
39 models
LLM Benchmark Leaderboards
39 models
Llm Learning Resources
38 models
Therapeutic Chatbot Applications
38 models
Math Reasoning Datasets
37 models
Multimodal Fusion Transformers
37 models
Protein Transformers ML
36 models
Text to Speech TTS
35 models
Llama Model Implementations
35 models
Vision Language Instruction Tuning
34 models
ViT Image Classification
34 models
Financial Return Prediction
34 models
AI-Powered SaaS Startups
34 models
ML API Deployment
34 models
Music Generation Transformers
33 models
Korean Language Models
33 models
Medical Image Diagnosis Transformers
32 models
Transformer Architecture Education
31 models
Resume Job Matching
31 models
Semantic Textual Similarity
31 models
Llm Finetuning Frameworks
30 models
Academic Thesis Repositories
30 models
Multi-provider LLM Interfaces
29 models
Llm Interpretability Explainability
29 models
Retrieval Augmented Generation
29 models
Llm Compression Optimization
28 models
Llm Knowledge Distillation
28 models
Image Captioning Transformers
28 models
Domain Specific Benchmarks
27 models
Diffusion Language Models
27 models
Text Clustering Topic Modeling
26 models
BLIP Image Captioning
25 models
T5 mT5 Fine-tuning
24 models
Graph Transformers
24 models
Creative Text Generation
24 models
CLIP Image Embeddings
23 models
Instruction Tuning Datasets
23 models
Llm Knowledge Editing
21 models
Whisper Speech Transcription
21 models
Vision Transformer Classification
21 models
Audio Classification Transformers
21 models
Semantic Search Retrieval
21 models
Tokenizer Libraries
20 models
Sparse Attention Optimization
20 models
Evaluation Frameworks Metrics
19 models
Molecular Generation Transformers
19 models
Mixture Of Experts Llms
19 models
Study Aid Generators
19 models
Object Detection Transformers
19 models
Financial Sentiment Analysis
19 models
Essay Scoring Grading
19 models
Parameter Efficient Adapters
18 models
Llm Inference Serving
18 models
Multimodal Vision Language Models
18 models
AI Content Detection
18 models
Recommendation Systems Transformers
18 models
LLM Pruning Compression
17 models
Llm Research Curation
17 models
Bias Detection Transformers
17 models
Llm Domain Datasets
16 models
Machine Translation Transformers
16 models
Llm Cuda Optimization
15 models
Disaster Tweet Classification
15 models
Cybersecurity Threat Detection
15 models
Graph Language Models
14 models
Llm Quantization Techniques
13 models
Attention Mechanism Implementations
13 models
Wav2Vec2 Speech Recognition
13 models
Speculative Decoding Algorithms
12 models
Llm Hallucination Mitigation
12 models
PHP AI SDKs
12 models
Direct Preference Optimization
12 models
PII Redaction Anonymization
11 models
Llm Framework Abstractions
11 models
Code Completion Copilots
11 models
Llm Docker Deployments
11 models
Indic Language Translation
11 models
Llm Recommendation Systems
11 models
Clinical Text Classification
11 models
Gpt Model Fine Tuning
10 models
Spam Detection Transformers
10 models
Llm Robot Planning
9 models
YouTube Video Summarization
9 models
Apple Silicon Llm Inference
8 models
Code Model Training
8 models
Kv Cache Optimization
7 models
Llm Bias Evaluation
7 models
Mistral Ai Tools
6 models
Safety Robustness Evaluation
6 models
Gpt Multilingual Training
6 models
Clinical Llm Tools
6 models
Chain Of Thought Reasoning
6 models
Llm Evaluation Benchmarking
6 models
Vision Transformer Optimization
6 models
Llm Knowledge Graph Generation
6 models
Jailbreak Attacks Analysis
6 models
Llm Orchestration Platforms
5 models
Mixup Augmentation Frameworks
5 models
Bert Model Frameworks
5 models
Compositional Reasoning Embeddings
5 models
Gpt Implementation Tutorials
4 models
Rust Llm Infrastructure
4 models
Llm Function Calling
4 models
Text Classification
4 models
Ai Music Generation
4 models
Llm Data Labeling
3 models
Multimodal Rag Systems
3 models
Prompt Engineering Techniques
3 models
Llm Serialization Formats
3 models
Ml Inference Benchmarking
3 models
State Space Model Architectures
3 models
Nlp Learning Resources
3 models
Llm Translation Tools
3 models
Competitive Agent Games
3 models
Julia Ml Frameworks
3 models
Llm Agent Training Gyms
3 models
Chatgpt Api Tutorials
3 models
Protein Design Llms
3 models
Clip Vision Language
3 models
Llm Fine Tuning Optimization
3 models
Multimodal Visual Grounding
3 models
Synthetic Data Generation
3 models
Structured Output Enforcement
3 models
Ai Generated Text Detection
3 models
Explainability Interpretability Frameworks
2 models
Llm Fine Tuning Frameworks
2 models
Nlp Fundamentals Tutorials
2 models
Distributed Training Frameworks
2 models
Text Tokenization Libraries
2 models
Graph Neural Networks
2 models
Gpt2 Language Models
2 models
Text Summarization Tools
2 models
End To End Asr Frameworks
2 models
Semantic Segmentation Techniques
2 models
Protein Language Models
2 models
Llm Chat Interfaces
2 models
Agent Memory Systems
2 models
Transformer Implementation Education
2 models
Image Caption Generation
2 models
Neural Data Compression
2 models
Rust Agent Frameworks
2 models
Langchain Integration Patterns
2 models
Ollama Chat Interfaces
2 models
Ai Stock Analysis
2 models
Defect Detection Quality Forensics
2 models
Generative Ai Learning
2 models
Variational Autoencoders Nlp
2 models
Llm Chatbot Interfaces
2 models
Vulnerability Detection Llm
2 models
Llm Thesis Research
2 models
Jax Ml Frameworks
2 models
Trajectory Prediction Ml
2 models
Ml Benchmarking Frameworks
2 models
Peptide Property Prediction
2 models
Knowledge Distillation Compression
2 models
Model Fine Tuning Methods
2 models
Task Oriented Dialogue Systems
2 models
Llm Request Routing
2 models
Llm Pentest Automation
2 models
Image Captioning Tools
2 models
Hybrid Retrieval Optimization
2 models
Video Editing Diffusion
1 models
Uncategorized
1 models
Financial Ai Agents
1 models
Content Based Recommendation
1 models
Ai Image Generation Platforms
1 models
Computer Vision Learning
1 models
Lightweight Training Utilities
1 models
Llm Orchestration Routing
1 models
Loss Function Implementations
1 models
Chatglm Fine Tuning
1 models
Machine Translation Systems
1 models
Agent Memory Infrastructure
1 models
Speech Ai Coursework
1 models
Chatbot Nlp Frameworks
1 models
Time Series Forecasting
1 models
Energy Sector Forecasting
1 models
Local Voice Assistants
1 models
Character Motion Animation
1 models
Session Context Memory
1 models
Ai Powered Search Engines
1 models
Ai Presentation Generation
1 models
Feature Selection Frameworks
1 models
Ios Nlp Frameworks
1 models
Sign Language Recognition
1 models
Compositional T2I Generation
1 models
Legal Document Analysis
1 models
Lottery Number Prediction
1 models
Kaggle Competition Solutions
1 models
Speaker Diarization Embedding
1 models
Text Translation Tools
1 models
Generative Ai Learning Projects
1 models
Rna Structure Learning
1 models
Nlp Education Courses
1 models
Lora Training Tools
1 models
Mcp Demo Examples
1 models
Causal Inference Nlp
1 models
Self Supervised Learning
1 models
Black Box Optimization
1 models
Healthcare Ai Diagnostics
1 models
Ai Video Generation
1 models
Multi Agent Debate Systems
1 models
Generative Ai Platforms
1 models
Qwen Llm Ecosystem
1 models
Llm Provider Sdks
1 models
Ollama Go Clients
1 models
Go Ml Bindings
1 models
Medical Image Segmentation
1 models
Game Playing Agents
1 models
Llm Evaluation Frameworks
1 models
Adversarial Nlp Robustness
1 models
World Models Frameworks
1 models
Paper Implementation Collections
1 models
Fact Checking Systems
1 models
Memory Augmented Architectures
1 models
Pdf Qa Systems
1 models
Spiking Neural Networks
1 models
Speech Synthesis Diffusion
1 models
Image Generation Mcp
1 models
Hugging Face Tutorials
1 models
Music Similarity Embeddings
1 models
Domain Adaptation Frameworks
1 models
Nano Gpt Variants
1 models
Model Compression Optimization
1 models
Text To Speech Frameworks
1 models
Text To Sql Rag
1 models
Kubernetes Llm Serving
1 models
Multimodal Search Engines
1 models
Prompt Engineering Optimization
1 models
Local Rag Frameworks
1 models
Rag Qa Systems
1 models
Hate Speech Content Moderation
1 models
Ml Project Portfolios
1 models
Membership Inference Attacks
1 models
Ml Project Collections
1 models
Rust Onnx Runtime
1 models
Variational Autoencoder Implementations
1 models
Llm Experimentation Labs
1 models
Javascript Ml Libraries
1 models
Diffusion Web Interfaces
1 models
Edge Device Ml Frameworks
1 models
Reading Comprehension Qa
1 models
Keyword Speech Recognition
1 models
Stable Diffusion Tools
1 models
Advanced Summarization Methods
1 models
Mental Health Chatbots
1 models
Youtube Transcript Summarization
1 models