Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1903.00401
Cited By
Learning To Follow Directions in Street View
1 March 2019
Karl Moritz Hermann
Mateusz Malinowski
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
R. Hadsell
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning To Follow Directions in Street View"
44 / 44 papers shown
Title
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
63
19
0
31 Dec 2024
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Xinhao Liu
Jiyang Li
Yichen Jiang
Niranjan Sujay
Zhiyong Yang
Juexiao Zhang
John Abanes
Jing Zhang
Chen Feng
114
1
0
26 Nov 2024
Perceive, Reflect, and Plan: Designing LLM Agent for Goal-Directed City Navigation without Instructions
Qingbin Zeng
Qinglong Yang
Shunan Dong
Heming Du
Liang Zheng
Fengli Xu
Yong Li
LLMAG
LM&Ro
45
9
0
08 Aug 2024
Can ChatGPT assist visually impaired people with micro-navigation?
Junxian He
Shrinivas J. Pundlik
Gang Luo
27
0
0
31 Jul 2024
CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information
Jungdae Lee
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
Daichi Azuma
Yutaka Matsuo
Nakamasa Inoue
47
14
0
20 Jun 2024
Semantic Map-based Generation of Navigation Instructions
Chengzu Li
Chao Zhang
Simone Teufel
R. Doddipatla
Svetlana Stoyanchev
34
2
0
28 Mar 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Fei Wang
Ruyue Yuan
LM&Ro
43
2
0
22 Feb 2024
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li
Aishwarya Padmakumar
Gaurav Sukhatme
Mohit Bansal
29
6
0
05 Feb 2024
Multi-model fusion for Aerial Vision and Dialog Navigation based on human attention aids
Xinyi Wang
Xuan Cui
Danxu Li
Fang Liu
Licheng Jiao
18
0
0
27 Aug 2023
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
Yibo Cui
Liang Xie
Yakun Zhang
Meishan Zhang
Ye Yan
Erwei Yin
LM&Ro
34
16
0
24 Aug 2023
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View
Raphael Schumann
Wanrong Zhu
Weixi Feng
Tsu-jui Fu
Stefan Riezler
William Yang Wang
LM&Ro
29
63
0
12 Jul 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
38
19
0
07 Mar 2023
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation
Chia-Wen Kuo
Chih-Yao Ma
Judy Hoffman
Z. Kira
35
10
0
20 Nov 2022
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents
Ziqiao Ma
B. VanDerPloeg
Cristian-Paul Bara
Yidong Huang
Eui-In Kim
Felix Gervits
M. Marge
J. Chai
63
7
0
22 Oct 2022
ViLPAct: A Benchmark for Compositional Generalization on Multimodal Human Activities
Terry Yue Zhuo
Yaqing Liao
Yuecheng Lei
Lizhen Qu
Gerard de Melo
Xiaojun Chang
Yazhou Ren
Zenglin Xu
42
2
0
11 Oct 2022
A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues
Jason Armitage
L. Impett
Rico Sennrich
36
5
0
24 Jul 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
158
437
0
10 Jul 2022
Learning Local Implicit Fourier Representation for Image Warping
Jae-Won Lee
K. Choi
Kyong Hwan Jin
14
16
0
05 Jul 2022
EnvEdit: Environment Editing for Vision-and-Language Navigation
Jialu Li
Hao Tan
Joey Tianyi Zhou
31
80
0
29 Mar 2022
Analyzing Generalization of Vision and Language Navigation to Unseen Outdoor Areas
Raphael Schumann
Stefan Riezler
21
26
0
25 Mar 2022
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
Jing Gu
Eliana Stefani
Qi Wu
Jesse Thomason
Qing Guo
LM&Ro
30
104
0
22 Mar 2022
Are you doing what I say? On modalities alignment in ALFRED
Ting-Rui Chiang
Yi-Ting Yeh
Ta-Chung Chi
Yau-Shian Wang
27
1
0
12 Oct 2021
Waypoint Models for Instruction-guided Navigation in Continuous Environments
Jacob Krantz
Aaron Gokaslan
Dhruv Batra
Stefan Lee
Oleksandr Maksymets
LM&Ro
137
76
0
05 Oct 2021
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
17
19
0
26 Aug 2021
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
47
45
0
26 Jun 2021
Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks
Van-Quang Nguyen
Masanori Suganuma
Takayuki Okatani
LM&Ro
18
35
0
01 Jun 2021
Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Problem
Raphael Schumann
Stefan Riezler
22
27
0
30 Dec 2020
Visually Grounding Language Instruction for History-Dependent Manipulation
Hyemin Ahn
Obin Kwon
Kyungdo Kim
Jaeyeon Jeong
Howoong Jun
Hongjung Lee
Dongheui Lee
Songhwai Oh
LM&Ro
21
6
0
16 Dec 2020
ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments
Hyounghun Kim
Abhaysinh Zala
Graham Burri
Hao Tan
Joey Tianyi Zhou
LM&Ro
27
16
0
15 Nov 2020
Safe Reinforcement Learning with Natural Language Constraints
Tsung-Yen Yang
Michael Y. Hu
Yinlam Chow
Peter J. Ramadge
Karthik Narasimhan
19
29
0
11 Oct 2020
Zero-Shot Compositional Policy Learning via Language Grounding
Tianshi Cao
Jingkang Wang
Yining Zhang
S. Manivasagam
LM&Ro
34
1
0
15 Apr 2020
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
Jacob Krantz
Erik Wijmans
Arjun Majumdar
Dhruv Batra
Stefan Lee
24
266
0
06 Apr 2020
Visual Grounding in Video for Unsupervised Word Translation
Gunnar A. Sigurdsson
Jean-Baptiste Alayrac
Aida Nematzadeh
Lucas Smaira
Mateusz Malinowski
João Carreira
Phil Blunsom
Andrew Zisserman
VGen
11
49
0
11 Mar 2020
MVP: Unified Motion and Visual Self-Supervised Learning for Large-Scale Robotic Navigation
Marvin Chancán
Michael Milford
SSL
33
8
0
02 Mar 2020
Retouchdown: Adding Touchdown to StreetLearn as a Shareable Resource for Language Grounding Tasks in Street View
Harsh Mehta
Yoav Artzi
Jason Baldridge
Eugene Ie
Piotr Wojciech Mirowski
14
25
0
10 Jan 2020
Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Martin Weiss
Simon Chamorro
Roger Girgis
Margaux Luck
Samira Ebrahimi Kahou
Joseph Paul Cohen
Derek Nowrouzezahrai
Doina Precup
Florian Golemo
C. Pal
42
10
0
29 Oct 2019
HIGhER : Improving instruction following with Hindsight Generation for Experience Replay
Geoffrey Cideron
Mathieu Seurin
Florian Strub
Olivier Pietquin
24
37
0
21 Oct 2019
CityLearn: Diverse Real-World Environments for Sample-Efficient Navigation Policy Learning
Marvin Chancán
Michael Milford
SSL
27
5
0
10 Oct 2019
Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention and Spatial Memory
A. Vasudevan
Ahmed K. Farahat
Chetan Gupta
LM&Ro
33
2
0
04 Oct 2019
Transferable Representation Learning in Vision-and-Language Navigation
Haoshuo Huang
Vihan Jain
Harsh Mehta
Alexander Ku
Gabriel Ilharco
Jason Baldridge
Eugene Ie
LM&Ro
25
85
0
09 Aug 2019
Cross-View Policy Learning for Street Navigation
Ang Li
Huiyi Hu
Piotr Wojciech Mirowski
Mehrdad Farajtabar
30
27
0
13 Jun 2019
Multi-modal Discriminative Model for Vision-and-Language Navigation
Haoshuo Huang
Vihan Jain
Harsh Mehta
Jason Baldridge
Eugene Ie
LM&Ro
25
26
0
31 May 2019
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
260
498
0
07 Jun 2018
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCL
VLM
176
840
0
17 May 2016
1