hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

/ 100

Verified

Supports modular fine-tuning approaches including supervised fine-tuning, reward modeling, and reinforcement learning methods (PPO, DPO, KTO, ORPO), with optimizations like Flash Attention, quantized LoRA, and advanced optimizers (GaLore, BAdam, Muon). Provides both CLI and Gradio web interface for model training and inference, integrating with vLLM/SGLang for OpenAI-compatible API deployment.

68,347 stars. Actively maintained with 24 commits in the last 30 days.

No Package No Dependents

Maintenance 23 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 21 / 25

How are scores calculated?

Stars

68,347

Forks

8,346

Language

Python

License

Apache-2.0

Compare

LlamaFactory and unsloth LlamaFactory and ms-swift LlamaFactory and oumi LlamaFactory and xTuring LlamaFactory and h2o-llmstudio LlamaFactory and FineTuningLLMs LlamaFactory and lorax LlamaFactory and LLM-Finetuning LlamaFactory and Finetune_LLMs LlamaFactory and training-custom-llama

Related models

unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama,...

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5,...

linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

oumi-ai/oumi

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Explore Transformer Models

All categories Trending Transformer directory Insights