Tencent-Hunyuan/HunyuanImage-3.0
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
Employs an autoregressive Transformer architecture departing from conventional diffusion-based approaches, unifying text and image understanding within a single framework. Features a Mixture-of-Experts design with 80 billion total parameters (13 billion active per token) for enhanced capacity, supporting both text-to-image and image-to-image generation with integrated reasoning capabilities. Integrates with HuggingFace Transformers, vLLM for accelerated inference, and provides distilled checkpoints optimized for efficient deployment with minimal sampling steps.
2,921 stars.
Stars
2,921
Forks
149
Language
Python
License
—
Category
Last pushed
Feb 03, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/Tencent-Hunyuan/HunyuanImage-3.0"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
hao-ai-lab/FastVideo
A unified inference and post-training framework for accelerated video generation.
thu-ml/TurboDiffusion
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
ModelTC/LightX2V
Light Image Video Generation Inference Framework
PKU-YuanGroup/Helios
Helios: Real Real-Time Long Video Generation Model
PKU-YuanGroup/MagicTime
[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators