EvolvingLMMs-Lab/lmms-engine
A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
Supports 20+ model architectures including vision-language (Qwen3-VL, LLaVA), diffusion (WanVideo, SiT), and MoE variants with unified training abstractions. Built on PyTorch FSDP2 with composition-friendly distributed training, featuring production optimizations like Ulysses sequence parallelism for long contexts, Liger fused kernels (30% memory reduction), and bin-packing for 35-40% MFU gains. Integrates with torchrun/Accelerate launchers and provides Docker environments for rapid deployment across single-GPU to multi-node distributed setups.
740 stars. Actively maintained with 7 commits in the last 30 days.
Stars
740
Forks
32
Language
Python
License
—
Category
Last pushed
Mar 12, 2026
Commits (30d)
7
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/EvolvingLMMs-Lab/lmms-engine"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
aalok-sathe/surprisal
A unified interface for computing surprisal (log probabilities) from language models! Supports...
FunnySaltyFish/Better-Ruozhiba
【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集
reasoning-machines/pal
PaL: Program-Aided Language Models (ICML 2023)
microsoft/monitors4codegen
Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static...
YutongWang1216/DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based...