Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.10674
Cited By
Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation
21 April 2021
Muhammad Zubair Irshad
Chih-Yao Ma
Z. Kira
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"
36 / 36 papers shown
Title
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
56
0
0
01 May 2025
Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Zerui Li
Gengze Zhou
Haodong Hong
Yanyan Shao
Wenqi Lyu
Yanyuan Qiao
Qi Wu
68
1
0
26 Feb 2025
Beyond Confidence: Adaptive Abstention in Dual-Threshold Conformal Prediction for Autonomous System Perception
Divake Kumar
Nastaran Darabi
Sina Tayebati
A. R. Trivedi
74
0
0
11 Feb 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
63
18
0
31 Dec 2024
Neural Fields in Robotics: A Survey
Muhammad Zubair Irshad
Mauro Comi
Yen-Chen Lin
Nick Heppert
Abhinav Valada
Rares Ambrus
Z. Kira
Jonathan Tremblay
AI4CE
50
3
0
26 Oct 2024
Language-guided Robust Navigation for Mobile Robots in Dynamically-changing Environments
Cody Simons
Zhichao Liu
Brandon Marcus
A. Roy-Chowdhury
Konstantinos Karydis
23
0
0
28 Sep 2024
StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation
Muraleekrishna Gopinathan
Jumana Abu-Khalaf
David Suter
Martin Masek
29
0
0
09 Sep 2024
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Jiaqi Chen
Bingqian Lin
Xinmin Liu
Lin Ma
Xiaodan Liang
Kwan-Yee K. Wong
LM&Ro
44
9
0
08 Jul 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Fei-Yue Wang
Ruyue Yuan
LM&Ro
37
2
0
22 Feb 2024
Safe-VLN: Collision Avoidance for Vision-and-Language Navigation of Autonomous Robots Operating in Continuous Environments
Lu Yue
Dongliang Zhou
Liang Xie
Feitian Zhang
Ye Yan
Erwei Yin
34
9
0
06 Nov 2023
FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects
Mayank Lunayach
Sergey Zakharov
Dian Chen
Rares Ambrus
Z. Kira
Muhammad Zubair Irshad
3DPC
33
12
0
19 Oct 2023
Vision and Language Navigation in the Real World via Online Visual Language Mapping
Chengguang Xu
Hieu T. Nguyen
Christopher Amato
Lawson L. S. Wong
32
9
0
16 Oct 2023
Meta-Optimization for Higher Model Generalizability in Single-Image Depth Prediction
Cho-Ying Wu
Yiqi Zhong
Junying Wang
Ulrich Neumann
MDE
30
5
0
12 May 2023
Accessible Instruction-Following Agent
Kairui Zhou
34
1
0
08 May 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
22
19
0
07 Mar 2023
MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
Zongtao He
Liuyi Wang
Shu Li
Qingqing Yan
Chengju Liu
Qi Chen
19
7
0
02 Mar 2023
Learning Bidirectional Action-Language Translation with Limited Supervision and Incongruent Input
Ozan Ozdemir
Matthias Kerzel
C. Weber
Jae Hee Lee
Muhammad Burhan Hafez
P. Bruns
S. Wermter
21
1
0
09 Jan 2023
Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation
Gyan Tatiya
Jonathan M Francis
Luca Bondi
Ingrid Navarro
Eric Nyberg
Jivko Sinapov
Jean Oh
27
8
0
21 Dec 2022
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents
Ziqiao Ma
B. VanDerPloeg
Cristian-Paul Bara
Yidong Huang
Eui-In Kim
Felix Gervits
M. Marge
J. Chai
60
7
0
22 Oct 2022
Iterative Vision-and-Language Navigation
Jacob Krantz
Shurjo Banerjee
Wang Zhu
Jason J. Corso
Peter Anderson
Stefan Lee
Jesse Thomason
LM&Ro
40
18
0
06 Oct 2022
Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Kanishk Jain
Varun Chhangani
Amogh Tiwari
K. M. Krishna
Vineet Gandhi
LM&Ro
18
27
0
24 Sep 2022
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
55
46
0
24 Aug 2022
ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization
Muhammad Zubair Irshad
Sergey Zakharov
Rares Ambrus
Thomas Kollar
Z. Kira
Adrien Gaidon
3DH
19
63
0
27 Jul 2022
Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments
Jacob Krantz
Stefan Lee
17
36
0
20 Apr 2022
Embodied Navigation at the Art Gallery
Roberto Bigazzi
Federico Landi
S. Cascianelli
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
LM&Ro
21
3
0
19 Apr 2022
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
Jing Gu
Eliana Stefani
Qi Wu
Jesse Thomason
Qing Guo
LM&Ro
30
104
0
22 Mar 2022
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Yicong Hong
Zun Wang
Qi Wu
Stephen Gould
3DV
26
64
0
05 Mar 2022
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
28
139
0
23 Feb 2022
A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility
Andrea Burns
Deniz Arsan
Sanjna Agrawal
Ranjitha Kumar
Kate Saenko
Bryan A. Plummer
39
59
0
04 Feb 2022
Waypoint Models for Instruction-guided Navigation in Continuous Environments
Jacob Krantz
Aaron Gokaslan
Dhruv Batra
Stefan Lee
Oleksandr Maksymets
LM&Ro
137
76
0
05 Oct 2021
Focus on Impact: Indoor Exploration with Intrinsic Motivation
Roberto Bigazzi
Federico Landi
S. Cascianelli
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
OffRL
29
13
0
14 Sep 2021
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Muhammad Zubair Irshad
Niluthpol Chowdhury Mithun
Zachary Seymour
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
LM&Ro
23
49
0
26 Aug 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Junha Roh
Karthik Desingh
Ali Farhadi
D. Fox
22
95
0
07 Jul 2021
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
47
45
0
26 Jun 2021
Neural Modular Control for Embodied Question Answering
Abhishek Das
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
LM&Ro
132
127
0
26 Oct 2018
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
260
496
0
07 Jun 2018
1