ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.10674
  4. Cited By
Hierarchical Cross-Modal Agent for Robotics Vision-and-Language
  Navigation

Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation

21 April 2021
Muhammad Zubair Irshad
Chih-Yao Ma
Z. Kira
    LM&Ro
ArXivPDFHTML

Papers citing "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

36 / 36 papers shown
Title
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
56
0
0
01 May 2025
Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Zerui Li
Gengze Zhou
Haodong Hong
Yanyan Shao
Wenqi Lyu
Yanyuan Qiao
Qi Wu
68
1
0
26 Feb 2025
Beyond Confidence: Adaptive Abstention in Dual-Threshold Conformal Prediction for Autonomous System Perception
Beyond Confidence: Adaptive Abstention in Dual-Threshold Conformal Prediction for Autonomous System Perception
Divake Kumar
Nastaran Darabi
Sina Tayebati
A. R. Trivedi
74
0
0
11 Feb 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
63
18
0
31 Dec 2024
Neural Fields in Robotics: A Survey
Neural Fields in Robotics: A Survey
Muhammad Zubair Irshad
Mauro Comi
Yen-Chen Lin
Nick Heppert
Abhinav Valada
Rares Ambrus
Z. Kira
Jonathan Tremblay
AI4CE
50
3
0
26 Oct 2024
Language-guided Robust Navigation for Mobile Robots in
  Dynamically-changing Environments
Language-guided Robust Navigation for Mobile Robots in Dynamically-changing Environments
Cody Simons
Zhichao Liu
Brandon Marcus
A. Roy-Chowdhury
Konstantinos Karydis
23
0
0
28 Sep 2024
StratXplore: Strategic Novelty-seeking and Instruction-aligned
  Exploration for Vision and Language Navigation
StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation
Muraleekrishna Gopinathan
Jumana Abu-Khalaf
David Suter
Martin Masek
29
0
0
09 Sep 2024
Affordances-Oriented Planning using Foundation Models for Continuous
  Vision-Language Navigation
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Jiaqi Chen
Bingqian Lin
Xinmin Liu
Lin Ma
Xiaodan Liang
Kwan-Yee K. Wong
LM&Ro
44
9
0
08 Jul 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Fei-Yue Wang
Ruyue Yuan
LM&Ro
37
2
0
22 Feb 2024
Safe-VLN: Collision Avoidance for Vision-and-Language Navigation of
  Autonomous Robots Operating in Continuous Environments
Safe-VLN: Collision Avoidance for Vision-and-Language Navigation of Autonomous Robots Operating in Continuous Environments
Lu Yue
Dongliang Zhou
Liang Xie
Feitian Zhang
Ye Yan
Erwei Yin
34
9
0
06 Nov 2023
FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects
FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects
Mayank Lunayach
Sergey Zakharov
Dian Chen
Rares Ambrus
Z. Kira
Muhammad Zubair Irshad
3DPC
33
12
0
19 Oct 2023
Vision and Language Navigation in the Real World via Online Visual
  Language Mapping
Vision and Language Navigation in the Real World via Online Visual Language Mapping
Chengguang Xu
Hieu T. Nguyen
Christopher Amato
Lawson L. S. Wong
32
9
0
16 Oct 2023
Meta-Optimization for Higher Model Generalizability in Single-Image
  Depth Prediction
Meta-Optimization for Higher Model Generalizability in Single-Image Depth Prediction
Cho-Ying Wu
Yiqi Zhong
Junying Wang
Ulrich Neumann
MDE
30
5
0
12 May 2023
Accessible Instruction-Following Agent
Accessible Instruction-Following Agent
Kairui Zhou
34
1
0
08 May 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation
  Using Scene Object Spectrum Grounding
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
22
19
0
07 Mar 2023
MLANet: Multi-Level Attention Network with Sub-instruction for
  Continuous Vision-and-Language Navigation
MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
Zongtao He
Liuyi Wang
Shu Li
Qingqing Yan
Chengju Liu
Qi Chen
19
7
0
02 Mar 2023
Learning Bidirectional Action-Language Translation with Limited
  Supervision and Incongruent Input
Learning Bidirectional Action-Language Translation with Limited Supervision and Incongruent Input
Ozan Ozdemir
Matthias Kerzel
C. Weber
Jae Hee Lee
Muhammad Burhan Hafez
P. Bruns
S. Wermter
21
1
0
09 Jan 2023
Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied
  Navigation
Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation
Gyan Tatiya
Jonathan M Francis
Luca Bondi
Ingrid Navarro
Eric Nyberg
Jivko Sinapov
Jean Oh
27
8
0
21 Dec 2022
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in
  Interactive Autonomous Driving Agents
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents
Ziqiao Ma
B. VanDerPloeg
Cristian-Paul Bara
Yidong Huang
Eui-In Kim
Felix Gervits
M. Marge
J. Chai
60
7
0
22 Oct 2022
Iterative Vision-and-Language Navigation
Iterative Vision-and-Language Navigation
Jacob Krantz
Shurjo Banerjee
Wang Zhu
Jason J. Corso
Peter Anderson
Stefan Lee
Jesse Thomason
LM&Ro
40
18
0
06 Oct 2022
Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Kanishk Jain
Varun Chhangani
Amogh Tiwari
K. M. Krishna
Vineet Gandhi
LM&Ro
18
27
0
24 Sep 2022
Learning from Unlabeled 3D Environments for Vision-and-Language
  Navigation
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
55
46
0
24 Aug 2022
ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and
  Pose Optimization
ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization
Muhammad Zubair Irshad
Sergey Zakharov
Rares Ambrus
Thomas Kollar
Z. Kira
Adrien Gaidon
3DH
19
63
0
27 Jul 2022
Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous
  Environments
Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments
Jacob Krantz
Stefan Lee
17
36
0
20 Apr 2022
Embodied Navigation at the Art Gallery
Embodied Navigation at the Art Gallery
Roberto Bigazzi
Federico Landi
S. Cascianelli
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
LM&Ro
21
3
0
19 Apr 2022
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future
  Directions
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
Jing Gu
Eliana Stefani
Qi Wu
Jesse Thomason
Qing Guo
LM&Ro
30
104
0
22 Mar 2022
Bridging the Gap Between Learning in Discrete and Continuous
  Environments for Vision-and-Language Navigation
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Yicong Hong
Zun Wang
Qi Wu
Stephen Gould
3DV
26
64
0
05 Mar 2022
Think Global, Act Local: Dual-scale Graph Transformer for
  Vision-and-Language Navigation
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
28
139
0
23 Feb 2022
A Dataset for Interactive Vision-Language Navigation with Unknown
  Command Feasibility
A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility
Andrea Burns
Deniz Arsan
Sanjna Agrawal
Ranjitha Kumar
Kate Saenko
Bryan A. Plummer
39
59
0
04 Feb 2022
Waypoint Models for Instruction-guided Navigation in Continuous
  Environments
Waypoint Models for Instruction-guided Navigation in Continuous Environments
Jacob Krantz
Aaron Gokaslan
Dhruv Batra
Stefan Lee
Oleksandr Maksymets
LM&Ro
137
76
0
05 Oct 2021
Focus on Impact: Indoor Exploration with Intrinsic Motivation
Focus on Impact: Indoor Exploration with Intrinsic Motivation
Roberto Bigazzi
Federico Landi
S. Cascianelli
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
OffRL
29
13
0
14 Sep 2021
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for
  Vision-and-Language Navigation in Continuous Environments
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Muhammad Zubair Irshad
Niluthpol Chowdhury Mithun
Zachary Seymour
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
LM&Ro
23
49
0
26 Aug 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Junha Roh
Karthik Desingh
Ali Farhadi
D. Fox
22
95
0
07 Jul 2021
Core Challenges in Embodied Vision-Language Planning
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
47
45
0
26 Jun 2021
Neural Modular Control for Embodied Question Answering
Neural Modular Control for Embodied Question Answering
Abhishek Das
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
LM&Ro
132
127
0
26 Oct 2018
Speaker-Follower Models for Vision-and-Language Navigation
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
260
496
0
07 Jun 2018
1