huggingface/alignment-handbook

Robust recipes to align language models with human and AI preferences

/ 100

Established

Implements a full post-training pipeline spanning continued pretraining, supervised fine-tuning, and preference alignment techniques including DPO, ORPO, and Constitutional AI. Training scripts support distributed training via DeepSpeed ZeRO-3 and parameter-efficient approaches (LoRA/QLoRA), with reproducible YAML-based recipes for models like Zephyr and SmolLM. Integrates with Hugging Face Hub for dataset and model management, supporting both human feedback and AI preference signals.

5,523 stars and 151 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m

Maintenance 2 / 25

Adoption 15 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

5,523

Forks

474

Language

Python

License

Apache-2.0

Related models

agentscope-ai/Trinity-RFT

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement...

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO &...

zjunlp/EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

hyunwoongko/nanoRLHF

nanoRLHF: from-scratch journey into how LLMs and RLHF really work.

PKU-Alignment/align-anything

Align Anything: Training All-modality Model with Feedback

Explore Transformer Models

All categories Trending Transformer directory Insights