Image-to-3D Generation Diffusion Models
Tools for generating 3D models, scenes, and reconstructions from 2D images using diffusion models. Includes single-image to 3D object generation, multi-view synthesis, scene reconstruction, and related 3D synthesis tasks. Does NOT include general 3D modeling, 2D image generation, or video generation.
There are 129 image-to-3d generation models tracked. 2 score above 50 (established tier). The highest-rated is jayin92/Skyfall-GS at 53/100 with 762 stars. 1 of the top 10 are actively maintained.
Get all 129 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=diffusion&subcategory=image-to-3d-generation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
jayin92/Skyfall-GS
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery |
|
Established |
| 2 |
Tencent-Hunyuan/Hunyuan3D-2
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models. |
|
Established |
| 3 |
deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D... |
|
Emerging |
| 4 |
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion. |
|
Emerging |
| 5 |
ActiveVisionLab/gaussctrl
[ECCV 2024] GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian... |
|
Emerging |
| 6 |
cvlab-columbia/zero123
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023) |
|
Emerging |
| 7 |
caiyuanhao1998/Open-DiffusionGS
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable... |
|
Emerging |
| 8 |
Visionary-Laboratory/visionary
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian... |
|
Emerging |
| 9 |
manycore-research/SpatialGen
[3DV 2026] SpatialGen: Layout-guided 3D Indoor Scene Generation |
|
Emerging |
| 10 |
aminshabani/house_diffusion
The implementation of "HouseDiffusion: Vector Floorplan Generation via a... |
|
Emerging |
| 11 |
ai4ce/GARF
[ICCV2025] GARF: Learning Generalizable 3D Reassembly for Real-World Fractures |
|
Emerging |
| 12 |
mohammadasim98/scenetok
[CVPR '26] SceneTok: A Compressed, Diffusable Token Space for 3D Scenes |
|
Emerging |
| 13 |
HKU-MedAI/GEM-3D
[IJCV'2026] Generative Enhancement for 3D Medical Images |
|
Emerging |
| 14 |
mohammadasim98/met3r
MEt3R: Measuring Multi-View Consistency in Generated Images |
|
Emerging |
| 15 |
nv-tlabs/Difix3D
[CVPR 2025 Oral & Best Paper Finalist] Difix3D+: Improving 3D... |
|
Emerging |
| 16 |
liuyuan-pal/SyncDreamer
[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images... |
|
Emerging |
| 17 |
MECLabTUDA/SurGrID
SurGrID: Controllable Surgical Simulation via Scene Graph to Image Diffusion... |
|
Emerging |
| 18 |
duanyiqun/DiffusionDepth
PyTorch Implementation of introducing diffusion approach to 3D depth... |
|
Emerging |
| 19 |
huanngzh/MV-Adapter
[ICCV 2025] Official impl. of "MV-Adapter: Multi-view Consistent Image... |
|
Emerging |
| 20 |
SadilKhan/MARVEL-FX3D
[CVPR 2025] Official Implementation of Marvel-FX3D from MARVEL-40M+:... |
|
Emerging |
| 21 |
nv-tlabs/SCube
[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats |
|
Emerging |
| 22 |
SUDO-AI-3D/zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view... |
|
Emerging |
| 23 |
guochengqian/Magic123
[ICLR24] Official PyTorch Implementation of Magic123: One Image to... |
|
Emerging |
| 24 |
TQTQliu/Free4D
[ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency |
|
Emerging |
| 25 |
NIRVANALAN/LN3Diff
[ECCV-2024] LN3Diff creates high-quality 3D object mesh from text within 8... |
|
Emerging |
| 26 |
Luo-Yihang/3DEnhancer
[CVPR 2025] 3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement |
|
Emerging |
| 27 |
thu-ml/CRM
[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with... |
|
Emerging |
| 28 |
lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D... |
|
Emerging |
| 29 |
tangjiapeng/DiffuScene
[CVPR 2024] DiffuScene: Denoising Diffusion Models for Generative Indoor... |
|
Emerging |
| 30 |
3DTopia/3DTopia-XL
[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via... |
|
Emerging |
| 31 |
hustvl/GaussianDreamer
[CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by... |
|
Emerging |
| 32 |
ForeverFancy/GVFDiffusion
[ICCV 2025] Gaussian Variation Field Diffusion for High-fidelity Video-to-4D... |
|
Emerging |
| 33 |
CVL-UESTC/PerLDiff
ICCV 2025-PerLDiff: Controllable Street View Synthesis Using... |
|
Emerging |
| 34 |
KupynOrest/epipolar-dpo
Official repo for: Epipolar Geometry Improves Video Generation Models |
|
Emerging |
| 35 |
WHU-USI3DV/VistaDream
[ICCV 2025] VistaDream: Sampling multiview consistent images for single-view... |
|
Emerging |
| 36 |
Masoudjafaripour/Vibe_CADing
Vibe-CADing: Conditional CAD Generation and Retrieval for a Text-to-CAD... |
|
Emerging |
| 37 |
iamNCJ/DiLightNet
Official Code Release for [SIGGRAPH 2024] DiLightNet: Fine-grained Lighting... |
|
Emerging |
| 38 |
theEricMa/ScaleDreamer
[ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous... |
|
Emerging |
| 39 |
byeongjun-park/HarmonyView
[CVPR 2024] Official pytorch implementation of "HarmonyView: Harmonizing... |
|
Emerging |
| 40 |
Lakonik/SSDNeRF
[ICCV 2023] Single-Stage Diffusion NeRF |
|
Emerging |
| 41 |
cvlab-columbia/pix2gestalt
Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes"... |
|
Emerging |
| 42 |
kxhit/zero123-hf
A diffuser implementation of Zero123. Zero-1-to-3: Zero-shot One Image to 3D... |
|
Emerging |
| 43 |
Yukun-Huang/DreamCube
[ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama... |
|
Emerging |
| 44 |
PKU-YuanGroup/HoloTime
[ACM MM 2025] HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene... |
|
Emerging |
| 45 |
hancyran/LiDAR-Diffusion
[CVPR 2024] Official implementation of "Towards Realistic Scene Generation... |
|
Emerging |
| 46 |
3DTopia/Imagine360
Official code of "Imagine360: Immersive 360 Video Generation from Perspective Anchor" |
|
Emerging |
| 47 |
River-Zhang/SIFU
[CVPR 2024 Highlight] Official repository for paper "SIFU: Side-view... |
|
Emerging |
| 48 |
yuhengliu02/pyramid-discrete-diffusion
Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene... |
|
Emerging |
| 49 |
vzyrianov/lidargen
Official implementation of "Learning to Generate Realistic LiDAR Point... |
|
Emerging |
| 50 |
flymin/MagicDrive3D
Official implementation of the paper “MagicDrive3D: Controllable 3D... |
|
Emerging |
| 51 |
eric-zqwang/puzzlefusion-plusplus
Code for paper "PuzzleFusion++: Auto-agglomerative 3D Fracture Assembly by... |
|
Emerging |
| 52 |
Shen-Lab/LDM-3DG
[ICLR 2024] "Latent 3D Graph Diffusion" by Yuning You, Ruida Zhou, Jiwoong... |
|
Emerging |
| 53 |
Fsoft-AIC/Language-Conditioned-Affordance-Pose-Detection-in-3D-Point-Clouds
[ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds |
|
Emerging |
| 54 |
wenyuqing/panacea
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable... |
|
Emerging |
| 55 |
spin-me-round/SpinMeRound
[ICCV 2025] SpinMeRound: Consistent Multi-View Identity Generation Using... |
|
Emerging |
| 56 |
GONGJIA0208/Diffpose
[CVPR 2023] DiffPose: Toward More Reliable 3D Pose Estimation |
|
Emerging |
| 57 |
huanngzh/EpiDiff
[CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized... |
|
Emerging |
| 58 |
xiyichen/morphablediffusion
[CVPR 2024] Official implementation of Morphable Diffusion: 3D-Consistent... |
|
Emerging |
| 59 |
WinKawaks/DreamWire
[CVPR 2024] Wired Perspectives: Multi-View Wire Art Embraces Generative AI |
|
Emerging |
| 60 |
kazuto1011/r2dm
LiDAR Data Synthesis with Denoising Diffusion Probabilistic Models (ICRA 2024) |
|
Emerging |
| 61 |
YangLing0818/SGDiff
Official implementation for "Diffusion-Based Scene Graph to Image Generation... |
|
Experimental |
| 62 |
ubc-vision/vivid123
[CVPR 2024 Highlight] ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models |
|
Experimental |
| 63 |
Sin3DM/Sin3DM
single 3D shape diffusion model |
|
Experimental |
| 64 |
donydchen/sem2nerf
😺 [ECCV'22] Sem2NeRF: Converting Single-View Semantic Masks to NeRFs |
|
Experimental |
| 65 |
JiejiangWu/FaceG2E
Official code for CVPR2024 paper "text-guided 3d face synthesis - from... |
|
Experimental |
| 66 |
astra-vision/LiDPM
[IV 2025, Oral] Official code of "LiDPM: Rethinking Point Diffusion for... |
|
Experimental |
| 67 |
VamosC/CapHuman
[CVPR2024] CapHuman: Capture Your Moments in Parallel Universes |
|
Experimental |
| 68 |
pals-ttic/sjc
Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D... |
|
Experimental |
| 69 |
CUHK-AIM-Group/Polyp-Gen
[ICRA 2025] Polyp-Gen: Realistic and Diverse Polyp Image Generation for... |
|
Experimental |
| 70 |
iCVTEAM/IPSM
How to Use Diffusion Priors under Sparse Views? (NeurIPS 2024) |
|
Experimental |
| 71 |
lukasuz/MotionDreamer
[3DV 2025] MotionDreamer: Exploring Semantic Video Diffusion features for... |
|
Experimental |
| 72 |
boqian-li/GarmentDreamer
[3DV 2025] GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse... |
|
Experimental |
| 73 |
zhizdev/mvdfusion
[CVPR 2024] MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation |
|
Experimental |
| 74 |
EdwardFerdian/diff-3
Synthetic 3D echo with paired labels using Latent Diffusion Model |
|
Experimental |
| 75 |
Fsoft-AIC/LGD
[CVPR 2024] Dataset and Code for "Language-driven Grasp Detection." |
|
Experimental |
| 76 |
nnanhuang/Customize-it-3D
[ICRA 2025] Official implementation of Customize-It-3D: High-Quality 3D... |
|
Experimental |
| 77 |
agneet42/robustness_depth_lang
[CVPR 2024] "On the Robustness of Language Guidance for Low-Level Vision... |
|
Experimental |
| 78 |
RodinHD/RodinHD
[ECCV 2024] RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models |
|
Experimental |
| 79 |
lyndonzheng/Free3D
[CVPR'24] Consistent Novel View Synthesis without 3D Representation |
|
Experimental |
| 80 |
IDEA-Research/HumanArt
[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A... |
|
Experimental |
| 81 |
MECLabTUDA/SG2VID
SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis (MICCAI... |
|
Experimental |
| 82 |
MKnoche/warp3d_reposing
Reposing Humans by Warping 3D Features |
|
Experimental |
| 83 |
Karbo123/RGBD-Diffusion
RGBD2: Generative Scene Synthesis via Incremental View Inpainting using RGBD... |
|
Experimental |
| 84 |
Westlake-AI/SEMA
Switch EMA: A Free Lunch for Better Flatness and Sharpness |
|
Experimental |
| 85 |
RahulSajnani/GeoDiffuser
[WACV 2025, Best Student Paper, Oral] GeoDiffuser: Geometry-Based Image... |
|
Experimental |
| 86 |
wrk226/DiffProxy
Official repo of "DiffProxy: Multi-View Human Mesh Recovery via... |
|
Experimental |
| 87 |
VITA-Group/NeuralLift-360
[CVPR 2023, Highlight] "NeuralLift-360: Lifting An In-the-wild 2D Photo to A... |
|
Experimental |
| 88 |
aidayang/MV-Adapter-OneClick
MV-Adapter一键启动整合包 |
|
Experimental |
| 89 |
pansanity666/Awesome-Avatars
List of recent advances for human avatars, including generation,... |
|
Experimental |
| 90 |
silence-tang/GaussianIP
[CVPR 2025] Official Implementation of GaussianIP: Identity-Preserving... |
|
Experimental |
| 91 |
sair-lab/SuperPC
[CVPR 2025] SuperPC: A Single Diffusion Model for Point Cloud Completion,... |
|
Experimental |
| 92 |
zhizdev/sparsefusion
[CVPR 2023] SparseFusion: Distilling View-conditioned Diffusion for 3D Reconstruction |
|
Experimental |
| 93 |
rese1f/CityGen
🏙️🌆🌃 Try Infinite and Controllable 3D City Layout Generation! |
|
Experimental |
| 94 |
songlin/d3roma
A diffusion model-based stereo depth estimation framework that can predict... |
|
Experimental |
| 95 |
atakan-topaloglu/OracleGS
[WACV 2026 Oral] Reference Implementation of the paper "OracleGS: Grounding... |
|
Experimental |
| 96 |
BenzinaN/360-head-removal
Automated AI solution to remove tripods/cameramen from 360° videos... |
|
Experimental |
| 97 |
SheldonTsui/Matlaber
MATLABER: Material-Aware Text-to-3D via LAtent BRDF auto-EncodeR |
|
Experimental |
| 98 |
WHU-USI3DV/FreeReg
[ICLR 2024] FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained... |
|
Experimental |
| 99 |
fabiotosi92/Diffusion4RobustDepth
[ECCV 2024] Diffusion Models for Monocular Depth Estimation: Overcoming... |
|
Experimental |
| 100 |
Adamdad/hash3D
Hash3D: Training-free Acceleration for 3D Generation |
|
Experimental |
| 101 |
xmed-lab/MuTri
CVPR 2025: MuTri: Multi-view Tri-alignment for OCT to OCTA 3D Image Translation |
|
Experimental |
| 102 |
divyakraman/HawkI2024
Codebase for the paper HawkI: HawkI: Homography & Mutual Information... |
|
Experimental |
| 103 |
minfenli/GenRC
[ECCV 2024] GenRC: 3D Indoor Scene Generation from Sparse Image Collections |
|
Experimental |
| 104 |
YouDream3D/YouDream
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals |
|
Experimental |
| 105 |
stalhabukhari/vsigd
Code for RA-L'25 paper: "Variational Shape Inference for Grasp Diffusion on SE(3)" |
|
Experimental |
| 106 |
BrandonHanx/HeadSculpt
[NeurIPS 2023] HeadSculpt: Crafting 3D Head Avatars with Text |
|
Experimental |
| 107 |
humansensinglab/fabric-diffusion
[SIGGRAPH Asia 2024] FabricDiffusion: High-Fidelity Texture Transfer for 3D... |
|
Experimental |
| 108 |
jeongho9413/lidargsu
Official implementation of LidarGSU |
|
Experimental |
| 109 |
VinAIResearch/EFHQ
Code and data for the CVPR24 paper "EFHQ: Multi-purpose ExtremePose-Face-HQ... |
|
Experimental |
| 110 |
jagennath-hari/Singularity3D
Single-image world synthesis using a generative panorama prior,... |
|
Experimental |
| 111 |
SOTAMak1r/GVGEN
[ECCV 2024] GVGEN: Text-to-3D Generation with Volumetric Representation |
|
Experimental |
| 112 |
zhaorw02/FlexiDreamer
An official implementation of FlexiDreamer: Single Image-to-3D Generation... |
|
Experimental |
| 113 |
VinAIResearch/DiverseDream
DiverseDream: A Technique to Generate Diverse 3D Objects from the Same Text... |
|
Experimental |
| 114 |
ntaquan0125/pointmap-conditioned-diffusion
[WACV 2026] Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis |
|
Experimental |
| 115 |
GxKirin/GaussianIP
[CVPR 2025] Official Implementation of GaussianIP: Identity-Preserving... |
|
Experimental |
| 116 |
DiET-GS/DiET-GS
[CVPR 2025] Official code of "DiET-GS: Diffusion Prior and Event... |
|
Experimental |
| 117 |
yuanze-lin/IllumiCraft
[NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and... |
|
Experimental |
| 118 |
Fsoft-AIC/Language-Driven-6-DoF-Grasp-Detection-Using-Negative-Prompt-Guidance
[ECCV 2024] Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance |
|
Experimental |
| 119 |
Sense-GVT/Hi3D
From Geometry to Texture: A Hierarchical Framework for Efficient Text-to-3D... |
|
Experimental |
| 120 |
DavideGrecoGit/3D-reconstruction-under-occlusion
Explore the impact of occlusion on Pix2Vox, a multi-view 3D reconstruction... |
|
Experimental |
| 121 |
J-F-Cheng/G-FARS-3DPartGrouping
[CVPR 2024] G-FARS: Gradient-Field-based Auto-Regressive Sampling for 3D... |
|
Experimental |
| 122 |
hamnaanaa/3D-Scene-Diffusion-Guidance-using-Scene-Graphs
The implementation for "3D Scene Diffusion Guidance using Scene Graphs"... |
|
Experimental |
| 123 |
cederican/FR3-D
FR3-D: A Regressor-Guided SE(3)-Equivariant conditioned Diffusion Model for... |
|
Experimental |
| 124 |
THU-LYJ-Lab/O2-Recon
[AAAI 2024] O2-Recon: Completing 3D Reconstruction of Occluded Objects in... |
|
Experimental |
| 125 |
PhilipSanM/Homecraft
TT 2025-A038 |
|
Experimental |
| 126 |
desaixie/carve3d
Code for Carve3D: Improving Multi-view Reconstruction Consistency for... |
|
Experimental |
| 127 |
imyaash/ImaginFusion
A text guided 3D model generation model & application. An implemntation of... |
|
Experimental |
| 128 |
zju3dv/UniVerse
[ICCV 2025] UniVerse: Unleashing the Scene Prior of Video Diffusion Models... |
|
Experimental |
| 129 |
zcdliuwei/liuwei
[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting |
|
Experimental |