pengsongyou/openscene
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
Combines vision-language models (OpenSeg/LSeg) with 3D scene geometry through multi-view feature fusion and distillation into lightweight 3D neural networks. Supports arbitrary open-vocabulary queries on indoor and outdoor scenes (ScanNet, Matterport3D, nuScenes, Replica) without requiring retraining for new concepts. Provides both a real-time interactive demo and pre-trained models for semantic segmentation, instance segmentation, and property prediction tasks.
800 stars. No commits in the last 6 months.
Stars
800
Forks
65
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 27, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/pengsongyou/openscene"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
col14m/cadrille
[ICLR2026] cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning
filaPro/cad-recode
[ICCV2025] CAD-Recode: Reverse Engineering CAD Code from Point Clouds
cambrian-mllm/cambrian-s
Cambrian-S: Towards Spatial Supersensing in Video
worldbench/3EED
[NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D
Gorilla-Lab-SCUT/PaDT
[ICLR 2026] Official implementation of "Patch-as-Decodable-Token: Towards Unified Multi-Modal...