hiyouga/LlamaFactory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Supports modular fine-tuning approaches including supervised fine-tuning, reward modeling, and reinforcement learning methods (PPO, DPO, KTO, ORPO), with optimizations like Flash Attention, quantized LoRA, and advanced optimizers (GaLore, BAdam, Muon). Provides both CLI and Gradio web interface for model training and inference, integrating with vLLM/SGLang for OpenAI-compatible API deployment.
68,347 stars. Actively maintained with 24 commits in the last 30 days.
Stars
68,347
Forks
8,346
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 10, 2026
Commits (30d)
24
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/hiyouga/LlamaFactory"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related models
unslothai/unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama,...
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5,...
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
oumi-ai/oumi
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!