Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.07954
Cited By
Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding
15 October 2020
Alexander Ku
Peter Anderson
Roma Patel
Eugene Ie
Jason Baldridge
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding"
50 / 223 papers shown
Title
Into the Unknown: Generating Geospatial Descriptions for New Environments
Tzuf Paz-Argaman
John Palowitch
Sayali Kulkarni
Reut Tsarfaty
Jason Baldridge
34
1
0
28 Jun 2024
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions
Minghan Li
Heng Li
Zhi-Qi Cheng
Yifei Dong
Yuxuan Zhou
Jun-Yan He
Qi Dai
Teruko Mitamura
Alexander G. Hauptmann
LM&Ro
43
4
0
27 Jun 2024
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Mengjiao Shen
Jingwei Yang
Chengju Liu
Qijun Chen
VLM
33
2
0
25 Jun 2024
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought
Gabriel H. Sarch
Lawrence Jang
Michael J. Tarr
William W. Cohen
Kenneth Marino
Katerina Fragkiadaki
LLMAG
50
0
0
20 Jun 2024
CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information
Jungdae Lee
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
Daichi Azuma
Yutaka Matsuo
Nakamasa Inoue
47
14
0
20 Jun 2024
Contrast Sets for Evaluating Language-Guided Robot Policies
Abrar Anwar
Rohan Gupta
Jesse Thomason
32
3
0
19 Jun 2024
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
LM&Ro
40
7
0
14 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
47
1
0
09 Jun 2024
InstructNav: Zero-shot System for Generic Instruction Navigation in Unexplored Environment
Yuxing Long
Wenzhe Cai
Hongcheng Wang
Guanqi Zhan
Hao Dong
35
23
0
07 Jun 2024
CoNav: A Benchmark for Human-Centered Collaborative Navigation
Changhao Li
Xinyu Sun
Peihao Chen
Jugang Fan
Zixu Wang
Yanxia Liu
Jinhui Zhu
Chuang Gan
Mingkui Tan
56
1
0
04 Jun 2024
Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
38
3
0
04 Jun 2024
Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Bingqian Lin
Yunshuang Nie
Ziming Wei
Yi Zhu
Hang Xu
Shikui Ma
Jianzhuang Liu
Xiaodan Liang
LM&Ro
37
6
0
29 May 2024
MC-GPT: Empowering Vision-and-Language Navigation with Memory Map and Reasoning Chains
Zhaohuan Zhan
Lisha Yu
Sijie Yu
Guang Tan
LLMAG
LM&Ro
56
10
0
17 May 2024
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models
Gabriel H. Sarch
Sahil Somani
Raghav Kapoor
Michael J. Tarr
Katerina Fragkiadaki
LM&Ro
LLMAG
37
3
0
29 Apr 2024
Vision-and-Language Navigation via Causal Learning
Liuyi Wang
Zongtao He
Ronghao Dang
Mengjiao Shen
Chengju Liu
Qijun Chen
CML
52
14
0
16 Apr 2024
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
Mukul Khanna
Ram Ramrakhya
Gunjan Chhablani
Sriram Yenamandra
Théophile Gervet
Matthew Chang
Z. Kira
Devendra Singh Chaplot
Dhruv Batra
Roozbeh Mottaghi
LM&Ro
59
23
0
09 Apr 2024
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
Mengfei Du
Binhao Wu
Jiwen Zhang
Zhihao Fan
Zejun Li
Ruipu Luo
Xuanjing Huang
Zhongyu Wei
33
3
0
02 Apr 2024
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Junjie Hu
Ming Jiang
Shuqiang Jiang
47
16
0
02 Apr 2024
Semantic Map-based Generation of Navigation Instructions
Chengzu Li
Chao Zhang
Simone Teufel
R. Doddipatla
Svetlana Stoyanchev
34
2
0
28 Mar 2024
Scaling Vision-and-Language Navigation With Offline RL
Valay Bundele
Mahesh Bhupati
Biplab Banerjee
Aditya Grover
OffRL
29
1
0
27 Mar 2024
OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation
Ganlong Zhao
Guanbin Li
Weikai Chen
Yizhou Yu
37
4
0
26 Mar 2024
Continual Vision-and-Language Navigation
Seongjun Jeong
Gi-Cheon Kang
Seongho Choi
Joochan Kim
Byoung-Tak Zhang
41
2
0
22 Mar 2024
Prioritized Semantic Learning for Zero-shot Instance Navigation
Xander Sun
Louis Lau
Hoyard Zhi
Ronghe Qiu
Junwei Liang
40
8
0
18 Mar 2024
Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation
Ming Xu
Zilong Xie
35
2
0
18 Mar 2024
Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis
Vishnu Sashank Dorbala
Sanjoy Chowdhury
Dinesh Manocha
LM&Ro
38
0
0
18 Mar 2024
Tur[k]ingBench: A Challenge Benchmark for Web Agents
Kevin Xu
Yeganeh Kordi
Kate Sanders
Yizhong Wang
Adam Byerly
Kate Sanders
Adam Byerly
Jingyu Zhang
Benjamin Van Durme
Daniel Khashabi
LLMAG
75
6
0
18 Mar 2024
Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation
Francesco Taioli
Stefano Rosa
A. Castellini
Lorenzo Natale
Alessio Del Bue
Alessandro Farinelli
Marco Cristani
Yiming Wang
41
5
0
15 Mar 2024
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Bingqian Lin
Yunshuang Nie
Ziming Wei
Jiaqi Chen
Shikui Ma
Jianhua Han
Hang Xu
Xiaojun Chang
Xiaodan Liang
LM&Ro
LRM
62
20
0
12 Mar 2024
Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Ronghao Dang
Huiyi Chen
Chengju Liu
Qi Chen
41
1
0
06 Mar 2024
Where Do We Go from Here? Multi-scale Allocentric Relational Inference from Natural Spatial Descriptions
Tzuf Paz-Argaman
Sayali Kulkarni
John Palowitch
Jason Baldridge
Reut Tsarfaty
24
3
0
26 Feb 2024
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Jiazhao Zhang
Kunyu Wang
Rongtao Xu
Gengze Zhou
Yicong Hong
Xiaomeng Fang
Qi Wu
Zhizheng Zhang
Wang He
LM&Ro
40
45
0
24 Feb 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Fei-Yue Wang
Ruyue Yuan
LM&Ro
40
2
0
22 Feb 2024
OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models
Yuxuan Kuang
Hai Lin
Meng Jiang
LM&Ro
44
26
0
16 Feb 2024
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li
Aishwarya Padmakumar
Gaurav Sukhatme
Mohit Bansal
29
6
0
05 Feb 2024
NavHint: Vision and Language Navigation Agent with a Hint Generator
Yue Zhang
Quan Guo
Parisa Kordjamshidi
LLMAG
32
9
0
04 Feb 2024
Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
Kiana Ehsani
Tanmay Gupta
Rose Hendrix
Jordi Salvador
Luca Weihs
...
Alvaro Herrasti
Ranjay Krishna
Dustin Schwenk
Eli VanderBilt
Aniruddha Kembhavi
30
12
0
05 Dec 2023
Towards Learning a Generalist Model for Embodied Navigation
Duo Zheng
Shijia Huang
Lin Zhao
Yiwu Zhong
Liwei Wang
LM&Ro
41
41
0
04 Dec 2023
Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty
Cheng-Fu Yang
Haoyang Xu
Te-Lin Wu
Xiaofeng Gao
Kai-Wei Chang
Feng Gao
DiffM
38
8
0
02 Dec 2023
Which way is `right'?: Uncovering limitations of Vision-and-Language Navigation model
Meera Hahn
Amit Raj
James M. Rehg
30
3
0
30 Nov 2023
Does VLN Pretraining Work with Nonsensical or Irrelevant Instructions?
Wang Zhu
Ishika Singh
Yuan Huang
Robin Jia
Jesse Thomason
34
2
0
28 Nov 2023
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
Junyu Gao
Xuan Yao
Changsheng Xu
TTA
30
3
0
22 Nov 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
41
251
0
21 Nov 2023
Advances in Embodied Navigation Using Large Language Models: A Survey
Jinzhou Lin
Han Gao
Xuxiang Feng
Rongtao Xu
Changwei Wang
Man Zhang
Li Guo
Shibiao Xu
LM&Ro
LLMAG
68
9
0
01 Nov 2023
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Daniela Ben-David
Tzuf Paz-Argaman
Reut Tsarfaty
MoE
26
0
0
25 Oct 2023
Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models
Gabriel H. Sarch
Yue Wu
Michael J. Tarr
Katerina Fragkiadaki
LM&Ro
LLMAG
27
19
0
23 Oct 2023
Semantic and Expressive Variation in Image Captions Across Languages
Andre Ye
Sebastin Santy
Jena D. Hwang
Amy X. Zhang
Ranjay Krishna
VLM
58
3
0
22 Oct 2023
Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots
Xavi Puig
Eric Undersander
Andrew Szot
Mikael Dallaire Cote
Tsung-Yen Yang
...
Singh Chaplot
Unnat Jain
Dhruv Batra
†. AksharaRai
†. RoozbehMottaghi
16
115
0
19 Oct 2023
LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following
Cheng Yang
Yen-Chun Chen
Jianwei Yang
Xiyang Dai
Lu Yuan
Yu-Chiang Frank Wang
Kai-Wei Chang
LM&Ro
25
9
0
18 Oct 2023
Vision and Language Navigation in the Real World via Online Visual Language Mapping
Chengguang Xu
Hieu T. Nguyen
Christopher Amato
Lawson L. S. Wong
32
9
0
16 Oct 2023
Evaluating Explanation Methods for Vision-and-Language Navigation
Guanqi Chen
Lei Yang
Guanhua Chen
Jia Pan
XAI
23
0
0
10 Oct 2023
Previous
1
2
3
4
5
Next