Tencent-Hunyuan/HunyuanImage-3.0

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

/ 100

Established

Employs an autoregressive Transformer architecture departing from conventional diffusion-based approaches, unifying text and image understanding within a single framework. Features a Mixture-of-Experts design with 80 billion total parameters (13 billion active per token) for enhanced capacity, supporting both text-to-image and image-to-image generation with integrated reasoning capabilities. Integrates with HuggingFace Transformers, vLLM for accelerated inference, and provides distilled checkpoints optimized for efficient deployment with minimal sampling steps.

2,921 stars.

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 18 / 25

How are scores calculated?

Stars

2,921

Forks

149

Language

Python

License

—

Related models

hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation.

thu-ml/TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

ModelTC/LightX2V

Light Image Video Generation Inference Framework

PKU-YuanGroup/Helios

Helios: Real Real-Time Long Video Generation Model

PKU-YuanGroup/MagicTime

[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Explore Diffusion Models

All categories Trending Diffusion directory Insights