Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.12255
Cited By
v1
v2
v3 (latest)
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation
29 May 2019
Vihan Jain
Gabriel Ilharco
Alexander Ku
Ashish Vaswani
Eugene Ie
Jason Baldridge
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation"
30 / 30 papers shown
Title
HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard
Yifei Dong
Fengyi Wu
Qi He
Heng Li
Minghan Li
...
Yuxuan Zhou
Jingdong Sun
Qi Dai
Zhi-Qi Cheng
Alexander G. Hauptmann
LM&Ro
81
0
0
18 Mar 2025
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
Dongping Li
Tielong Cai
Tianci Tang
Wenhao Chai
Katherine Rose Driggs-Campbell
Gaoang Wang
LM&Ro
214
0
0
11 Mar 2025
TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation
Navid Rajabi
Jana Kosecka
LM&Ro
3DV
133
0
0
11 Feb 2025
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Xinshuai Song
Weixing Chen
Yang Liu
Weikai Chen
Guanbin Li
Liang Lin
194
7
0
12 Dec 2024
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Bingqian Lin
Yunshuang Nie
Ziming Wei
Jiaqi Chen
Shikui Ma
Jianhua Han
Hang Xu
Xiaojun Chang
Xiaodan Liang
LM&Ro
LRM
123
28
0
12 Mar 2024
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling
Tsu-Jui Fu
Xinze Wang
Matthew F. Peterson
Scott T. Grafton
Miguel P. Eckstein
William Yang Wang
105
43
0
17 Nov 2019
Multi-modal Discriminative Model for Vision-and-Language Navigation
Haoshuo Huang
Vihan Jain
Harsh Mehta
Jason Baldridge
Eugene Ie
LM&Ro
68
26
0
31 May 2019
Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
Chih-Yao Ma
Jiasen Lu
Zuxuan Wu
G. Al-Regib
Z. Kira
R. Socher
Caiming Xiong
LM&Ro
92
278
0
10 Jan 2019
Touchdown: Natural Language Navigation and Spatial Reasoning in Visual Street Environments
Howard Chen
Alane Suhr
Dipendra Kumar Misra
Noah Snavely
Yoav Artzi
86
390
0
29 Nov 2018
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
Xin Eric Wang
Qiuyuan Huang
Asli Celikyilmaz
Jianfeng Gao
Dinghan Shen
Yuan-fang Wang
William Yang Wang
Lei Zhang
LM&Ro
SSL
117
541
0
25 Nov 2018
Mapping Navigation Instructions to Continuous Control Actions with Position-Visitation Prediction
Valts Blukis
Dipendra Kumar Misra
Ross A. Knepper
Yoav Artzi
69
82
0
10 Nov 2018
Shifting the Baseline: Single Modality Performance on Visual Navigation & QA
Jesse Thomason
Daniel Gordon
Yonatan Bisk
89
75
0
01 Nov 2018
Mapping Instructions to Actions in 3D Environments with Visual Goal Prediction
Dipendra Kumar Misra
Andrew Bennett
Valts Blukis
Eyvind Niklasson
Max Shatkhin
Yoav Artzi
LM&Ro
84
188
0
04 Sep 2018
On Evaluation of Embodied Navigation Agents
Peter Anderson
Angel X. Chang
Devendra Singh Chaplot
Alexey Dosovitskiy
Saurabh Gupta
...
Jana Kosecka
Jitendra Malik
Roozbeh Mottaghi
Manolis Savva
Amir Zamir
120
805
0
18 Jul 2018
Talk the Walk: Navigating New York City through Grounded Dialogue
H. D. Vries
Kurt Shuster
Dhruv Batra
Devi Parikh
Jason Weston
Douwe Kiela
70
124
0
09 Jul 2018
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
319
505
0
07 Jun 2018
FollowNet: Robot Navigation by Following Natural Language Directions with Deep Reinforcement Learning
Pararth Shah
Marek Fiser
Aleksandra Faust
J. Kew
Dilek Z. Hakkani-Tür
83
52
0
16 May 2018
CHALET: Cornell House Agent Learning Environment
Claudia Yan
Dipendra Kumar Misra
Andrew Bennett
Aaron Walsman
Yonatan Bisk
Yoav Artzi
LM&Ro
76
93
0
23 Jan 2018
Learning Interpretable Spatial Operations in a Rich 3D Blocks World
Yonatan Bisk
Kevin J. Shih
Yejin Choi
D. Marcu
65
63
0
10 Dec 2017
Video Captioning via Hierarchical Reinforcement Learning
Xin Eric Wang
Wenhu Chen
Jiawei Wu
Yuan-fang Wang
William Yang Wang
101
229
0
29 Nov 2017
Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
Peter Anderson
Qi Wu
Damien Teney
Jake Bruce
Mark Johnson
Niko Sünderhauf
Ian Reid
Stephen Gould
Anton Van Den Hengel
LM&Ro
118
1,324
0
20 Nov 2017
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
157
1,004
0
26 Nov 2016
Learning Transferable Policies for Monocular Reactive MAV Control
S. Daftry
J. Andrew Bagnell
Martial Hebert
68
85
0
01 Aug 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
207
8,881
0
04 Feb 2016
Stacked Attention Networks for Image Question Answering
Zichao Yang
Xiaodong He
Jianfeng Gao
Li Deng
Alex Smola
BDL
118
1,884
0
07 Nov 2015
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Haonan Yu
Jiang Wang
Zhiheng Huang
Yi Yang
Wenyuan Xu
90
560
0
26 Oct 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
233
5,509
0
03 May 2015
From Captions to Visual Concepts and Back
Hao Fang
Saurabh Gupta
F. Iandola
R. Srivastava
Li Deng
...
Xiaodong He
Margaret Mitchell
John C. Platt
C. L. Zitnick
Geoffrey Zweig
VLM
122
1,312
0
18 Nov 2014
Show and Tell: A Neural Image Caption Generator
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
262
6,036
0
17 Nov 2014
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
173
6,057
0
17 Nov 2014
1