SkyworkAI/UniPic
Open-source SOTA multi-image editing model
Unifies image editing, generation, and understanding in a single multimodal architecture across three model variants: UniPic-3 handles 1–6 image composition and editing with 8-step inference via consistency model and DMD distillation; UniPic-2 combines efficient diffusion post-training for text-to-image and fine-grained editing; UniPic-1 uses autoregressive transformer modeling for joint perception and synthesis. All variants available on HuggingFace with official PyTorch implementations.
863 stars.
Stars
863
Forks
43
Language
Python
License
MIT
Category
Last pushed
Jan 24, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/SkyworkAI/UniPic"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
EndlessSora/focal-frequency-loss
[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis
JIA-Lab-research/DreamOmni2
This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing...
PKU-YuanGroup/ChronoMagic-Bench
[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of...