intelligolabs/Le-RNR-Map
[ICCV 23] Official repository for Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language
Combines CLIP embeddings with neural radiance field maps to enable spatial querying of 3D scenes using natural language, supporting both single-object localization and multi-object discovery. The architecture integrates a pretrained autoencoder (GSN) for encoding rendered views into compact spatial representations, which are then indexed by CLIP embeddings for semantic matching. Built on Habitat simulator and Gibson 3D scene datasets, it generates queryable maps through navigation-based scene exploration and produces heatmap outputs indicating object locations.
No commits in the last 6 months.
Stars
17
Forks
1
Language
Python
License
—
Category
Last pushed
Dec 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/intelligolabs/Le-RNR-Map"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
holisticon/multimodal-rag-demo
🧠🖼️📄 Multimodal RAG Demo based on Qwen3-VL Embedding and Reranker models
debanjan06/geospatial-rag
AI Framework for Remote Sensing Image Analysis using RAG - 88%+ accuracy, multi-modal queries,...
aimagelab/ReT
[CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
berntpopp/phentrieve
AI-powered system for mapping clinical text to Human Phenotype Ontology (HPO) terms using...
hadil1999-creator/RAG_Hack_team
Our AI Financial Advisor is designed to revolutionize how users interact with financial and...