ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.04957
  4. Cited By
Vision-and-Dialog Navigation

Vision-and-Dialog Navigation

10 July 2019
Jesse Thomason
Michael Murray
Maya Cakmak
Luke Zettlemoyer
    LM&Ro
ArXivPDFHTML

Papers citing "Vision-and-Dialog Navigation"

50 / 230 papers shown
Title
Continual Vision-and-Language Navigation
Continual Vision-and-Language Navigation
Seongjun Jeong
Gi-Cheon Kang
Seongho Choi
Joochan Kim
Byoung-Tak Zhang
49
2
0
22 Mar 2024
Prioritized Semantic Learning for Zero-shot Instance Navigation
Prioritized Semantic Learning for Zero-shot Instance Navigation
Xander Sun
Louis Lau
Hoyard Zhi
Ronghe Qiu
Junwei Liang
45
8
0
18 Mar 2024
DiaLoc: An Iterative Approach to Embodied Dialog Localization
DiaLoc: An Iterative Approach to Embodied Dialog Localization
Chao Zhang
Mohan Li
Ignas Budvytis
Stephan Liwicki
52
2
0
11 Mar 2024
Towards Deviation-Robust Agent Navigation via Perturbation-Aware
  Contrastive Learning
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning
Bingqian Lin
Yanxin Long
Yi Zhu
Fengda Zhu
Xiaodan Liang
QiXiang Ye
Liang Lin
37
5
0
09 Mar 2024
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language
  Navigation
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Jiazhao Zhang
Kunyu Wang
Rongtao Xu
Gengze Zhou
Yicong Hong
Xiaomeng Fang
Qi Wu
Zhizheng Zhang
Wang He
LM&Ro
42
45
0
24 Feb 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Fei-Yue Wang
Ruyue Yuan
LM&Ro
43
2
0
22 Feb 2024
A Landmark-Aware Visual Navigation Dataset
A Landmark-Aware Visual Navigation Dataset
Faith Johnson
Bryan Bo Cao
Kristin J. Dana
Shubham Jain
Ashwin Ashok
3DV
34
0
0
22 Feb 2024
Improving Agent Interactions in Virtual Environments with Language
  Models
Improving Agent Interactions in Virtual Environments with Language Models
Jack Zhang
LLMAG
37
0
0
08 Feb 2024
Learning Communication Policies for Different Follower Behaviors in a
  Collaborative Reference Game
Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
P. Sadler
Sherzod Hakimov
David Schlangen
29
1
0
07 Feb 2024
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language
  Navigation
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li
Aishwarya Padmakumar
Gaurav Sukhatme
Mohit Bansal
29
6
0
05 Feb 2024
Taking Action Towards Graceful Interaction: The Effects of Performing
  Actions on Modelling Policies for Instruction Clarification Requests
Taking Action Towards Graceful Interaction: The Effects of Performing Actions on Modelling Policies for Instruction Clarification Requests
Brielen Madureira
David Schlangen
50
2
0
30 Jan 2024
Towards Learning a Generalist Model for Embodied Navigation
Towards Learning a Generalist Model for Embodied Navigation
Duo Zheng
Shijia Huang
Lin Zhao
Yiwu Zhong
Liwei Wang
LM&Ro
41
41
0
04 Dec 2023
Which way is `right'?: Uncovering limitations of Vision-and-Language
  Navigation model
Which way is `right'?: Uncovering limitations of Vision-and-Language Navigation model
Meera Hahn
Amit Raj
James M. Rehg
32
3
0
30 Nov 2023
Does VLN Pretraining Work with Nonsensical or Irrelevant Instructions?
Does VLN Pretraining Work with Nonsensical or Irrelevant Instructions?
Wang Zhu
Ishika Singh
Yuan Huang
Robin Jia
Jesse Thomason
39
2
0
28 Nov 2023
Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with
  Spatial Relation Matching
Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
Meng Chu
Zhedong Zheng
Wei Ji
Tingyu Wang
Tat-Seng Chua
23
10
0
21 Nov 2023
Advances in Embodied Navigation Using Large Language Models: A Survey
Advances in Embodied Navigation Using Large Language Models: A Survey
Jinzhou Lin
Han Gao
Xuxiang Feng
Rongtao Xu
Changwei Wang
Man Zhang
Li Guo
Shibiao Xu
LM&Ro
LLMAG
73
9
0
01 Nov 2023
Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots
Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots
Xavi Puig
Eric Undersander
Andrew Szot
Mikael Dallaire Cote
Tsung-Yen Yang
...
Singh Chaplot
Unnat Jain
Dhruv Batra
†. AksharaRai
†. RoozbehMottaghi
16
116
0
19 Oct 2023
Think, Act, and Ask: Open-World Interactive Personalized Robot
  Navigation
Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation
Yinpei Dai
Run Peng
Sikai Li
Joyce Chai
LM&Ro
40
24
0
12 Oct 2023
LangNav: Language as a Perceptual Representation for Navigation
LangNav: Language as a Perceptual Representation for Navigation
Bowen Pan
Yikang Shen
SouYoung Jin
Rogerio Feris
Aude Oliva
Phillip Isola
Yoon Kim
LM&Ro
28
18
0
11 Oct 2023
GRID: A Platform for General Robot Intelligence Development
GRID: A Platform for General Robot Intelligence Development
Sai H. Vemprala
Shuhang Chen
Abhinav Shukla
Dinesh Narayanan
Ashish Kapoor
32
10
0
02 Oct 2023
HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI
  Assistants in the Real World
HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
Linghao Yang
Taein Kwon
Mahdi Rad
Bowen Pan
Ishani Chakraborty
...
Ashley Feniello
Rui Tian
Felipe Vieira Frujeri
Neel Joshi
Marc Pollefeys
EgoV
46
49
0
29 Sep 2023
Discuss Before Moving: Visual Language Navigation via Multi-expert
  Discussions
Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Yuxing Long
Xiaoqi Li
Wenzhe Cai
Hao Dong
LLMAG
LM&Ro
32
45
0
20 Sep 2023
Cognitive Architectures for Language Agents
Cognitive Architectures for Language Agents
T. Sumers
Shunyu Yao
Karthik Narasimhan
Thomas Griffiths
LLMAG
LM&Ro
56
154
0
05 Sep 2023
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language
  Navigation
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
Yibo Cui
Liang Xie
Yakun Zhang
Meishan Zhang
Ye Yan
Erwei Yin
LM&Ro
39
16
0
24 Aug 2023
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog
  Navigation
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation
Yi-Chiao Su
Dongyan An
Yuan Xu
Kehan Chen
Yan Huang
52
2
0
22 Aug 2023
ROSGPT_Vision: Commanding Robots Using Only Language Models' Prompts
ROSGPT_Vision: Commanding Robots Using Only Language Models' Prompts
Bilel Benjdira
Anis Koubaa
Anas M. Ali
LM&Ro
32
3
0
22 Aug 2023
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language
  Navigation
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation
Yanyuan Qiao
Zheng Yu
Qi Wu
VLM
22
16
0
20 Aug 2023
March in Chat: Interactive Prompting for Remote Embodied Referring
  Expression
March in Chat: Interactive Prompting for Remote Embodied Referring Expression
Yanyuan Qiao
Yuankai Qi
Zheng Yu
Jiaheng Liu
Qi Wu
LM&Ro
41
30
0
20 Aug 2023
AerialVLN: Vision-and-Language Navigation for UAVs
AerialVLN: Vision-and-Language Navigation for UAVs
Shubo Liu
Hongsheng Zhang
Yuankai Qi
Peifeng Wang
Yaning Zhang
Qi Wu
CoGe
34
42
0
13 Aug 2023
Learning to Model the World with Language
Learning to Model the World with Language
Jessy Lin
Yuqing Du
Olivia Watkins
Danijar Hafner
Pieter Abbeel
Dan Klein
Anca Dragan
LM&Ro
SyDa
44
51
0
31 Jul 2023
Scaling Data Generation in Vision-and-Language Navigation
Scaling Data Generation in Vision-and-Language Navigation
Zun Wang
Jialu Li
Yicong Hong
Yi Wang
Qi Wu
Joey Tianyi Zhou
Stephen Gould
Hao Tan
Yu Qiao
LM&Ro
43
56
0
28 Jul 2023
GridMM: Grid Memory Map for Vision-and-Language Navigation
GridMM: Grid Memory Map for Vision-and-Language Navigation
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
33
52
0
24 Jul 2023
Breaking Down the Task: A Unit-Grained Hybrid Training Framework for
  Vision and Language Decision Making
Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making
Ruipu Luo
Jiwen Zhang
Zhongyu Wei
VLM
16
0
0
16 Jul 2023
Robots That Ask For Help: Uncertainty Alignment for Large Language Model
  Planners
Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners
Allen Z. Ren
Anushri Dixit
Alexandra Bodrova
Sumeet Singh
Stephen Tu
...
Jacob Varley
Zhenjia Xu
Dorsa Sadigh
Andy Zeng
Anirudha Majumdar
LM&Ro
64
220
0
04 Jul 2023
HeGeL: A Novel Dataset for Geo-Location from Hebrew Text
HeGeL: A Novel Dataset for Geo-Location from Hebrew Text
Tzuf Paz-Argaman
Tal Bauman
Itai Mondshine
Itzhak Omer
S. Dalyot
Reut Tsarfaty
22
3
0
02 Jul 2023
Solving Dialogue Grounding Embodied Task in a Simulated Environment
  using Further Masked Language Modeling
Solving Dialogue Grounding Embodied Task in a Simulated Environment using Further Masked Language Modeling
Weijie Zhang
40
0
0
21 Jun 2023
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot
  Vision-and-Language Navigation
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation
Xiwen Liang
Liang Ma
Shanshan Guo
Jianhua Han
Hang Xu
Shikui Ma
Xiaodan Liang
LM&Ro
LLMAG
90
4
0
17 Jun 2023
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual
  Navigation in Noisy Environments
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments
Xiulong Liu
Sudipta Paul
Moitreya Chatterjee
A. Cherian
31
8
0
06 Jun 2023
Active Vision Reinforcement Learning under Limited Visual Observability
Active Vision Reinforcement Learning under Limited Visual Observability
Jinghuan Shang
Michael S. Ryoo
40
0
0
01 Jun 2023
Adaptive Coordination in Social Embodied Rearrangement
Adaptive Coordination in Social Embodied Rearrangement
Andrew Szot
Unnat Jain
Dhruv Batra
Z. Kira
Ruta Desai
Akshara Rai
44
13
0
31 May 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for
  Vision-and-Language Navigation
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Joey Tianyi Zhou
DiffM
31
49
0
30 May 2023
R2H: Building Multimodal Navigation Helpers that Respond to Help
  Requests
R2H: Building Multimodal Navigation Helpers that Respond to Help Requests
Yue Fan
Jing Gu
Kaizhi Zheng
Xin Wang
49
4
0
23 May 2023
Yes, this Way! Learning to Ground Referring Expressions into Actions
  with Intra-episodic Feedback from Supportive Teachers
Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers
P. Sadler
Sherzod Hakimov
David Schlangen
41
1
0
22 May 2023
Multimodal Contextualized Plan Prediction for Embodied Task Completion
Multimodal Contextualized Plan Prediction for Embodied Task Completion
Mert Inan
Aishwarya Padmakumar
Spandana Gella
P. Lange
Dilek Z. Hakkani-Tür
LM&Ro
55
0
0
10 May 2023
Improving Vision-and-Language Navigation by Generating Future-View Image
  Semantics
Improving Vision-and-Language Navigation by Generating Future-View Image Semantics
Jialu Li
Joey Tianyi Zhou
29
34
0
11 Apr 2023
ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous
  States in Realistic 3D Scenes
ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes
Ran Gong
Jiangyong Huang
Yizhou Zhao
Haoran Geng
Xiaofeng Gao
...
Ziheng Zhou
D. Terzopoulos
Song-Chun Zhu
Baoxiong Jia
Siyuan Huang
LM&Ro
45
45
0
09 Apr 2023
ETPNav: Evolving Topological Planning for Vision-Language Navigation in
  Continuous Environments
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
Dongyan An
H. Wang
Wenguan Wang
Zun Wang
Yan Huang
Keji He
Liang Wang
72
63
0
06 Apr 2023
KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation
KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation
Xiangyang Li
Zihan Wang
Jiahao Yang
Yaowei Wang
Shuqiang Jiang
LM&Ro
21
38
0
28 Mar 2023
Lana: A Language-Capable Navigator for Instruction Following and
  Generation
Lana: A Language-Capable Navigator for Instruction Following and Generation
Xiaohan Wang
Wenguan Wang
Jiayi Shao
Yi Yang
LLMAG
LM&Ro
43
38
0
15 Mar 2023
Accountable Textual-Visual Chat Learns to Reject Human Instructions in
  Image Re-creation
Accountable Textual-Visual Chat Learns to Reject Human Instructions in Image Re-creation
Zhiwei Zhang
Yuliang Liu
MLLM
30
0
0
10 Mar 2023
Previous
12345
Next