eric-ai-lab/R2H
Official implementation of the EMNLP 2023 paper "R2H: Building Multimodal Navigation Helpers that Respond to Help Requests"
Introduces two task formulations—Respond to Dialog History (RDH) for single-turn response generation and Respond during Interaction (RdI) for real-time cooperative navigation—converting three existing vision-and-dialog datasets (CVDN, AVDN, DialFRED) across photo-realistic and synthetic environments. Proposes SeeRee, a multimodal response generation model combining dialog history and language inquiries with visual observations plus oracle trajectory imagery, deployable both offline for RDH evaluation and as a live API in Matterport3D simulators for RdI interactions. Provides baseline comparisons with zero-shot multimodal LLM approaches and includes human evaluation across distinct environment types.
No commits in the last 6 months.
Stars
5
Forks
1
Language
Python
License
—
Category
Last pushed
Jun 19, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/agents/eric-ai-lab/R2H"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Benaah/amaniquery
A Retrieval-Augmented Generation (RAG) system for Kenyan legal, parliamentary, and news...
haiha-ux/raganything-agno-wrapper
Production-ready wrapper bringing RAG-Anything's advanced multimodal capabilities to Agno...
dev-infinity101/MAYA-AI
A RAG based multi-Agent chatbot for govt scheme navigation and Personalized guidance
yashdew3/ragbot
AI-Powered RAG Chatbot using LangChain, FAISS & Streamlit | Smart Q&A from CSV | Open-Source!
OrlandContreras/rag-agent-crew
🎓 Sistema Multi-Agente RAG 100% Local Educativo - CrewAI + Ollama + Qdrant para aprendizaje y...