eric-ai-lab/R2H

Official implementation of the EMNLP 2023 paper "R2H: Building Multimodal Navigation Helpers that Respond to Help Requests"

17
/ 100
Experimental

Introduces two task formulations—Respond to Dialog History (RDH) for single-turn response generation and Respond during Interaction (RdI) for real-time cooperative navigation—converting three existing vision-and-dialog datasets (CVDN, AVDN, DialFRED) across photo-realistic and synthetic environments. Proposes SeeRee, a multimodal response generation model combining dialog history and language inquiries with visual observations plus oracle trajectory imagery, deployable both offline for RDH evaluation and as a live API in Matterport3D simulators for RdI interactions. Provides baseline comparisons with zero-shot multimodal LLM approaches and includes human evaluation across distinct environment types.

No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 4 / 25
Maturity 1 / 25
Community 12 / 25

How are scores calculated?

Stars

5

Forks

1

Language

Python

License

Last pushed

Jun 19, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/eric-ai-lab/R2H"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.