ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.06609
  4. Cited By
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation

GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation

9 April 2024
Mukul Khanna
Ram Ramrakhya
Gunjan Chhablani
Sriram Yenamandra
Théophile Gervet
Matthew Chang
Z. Kira
Devendra Singh Chaplot
Dhruv Batra
Roozbeh Mottaghi
    LM&Ro
ArXivPDFHTML

Papers citing "GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation"

26 / 26 papers shown
Title
UAV-VLN: End-to-End Vision Language guided Navigation for UAVs
UAV-VLN: End-to-End Vision Language guided Navigation for UAVs
Pranav Saxena
Nishant Raghuvanshi
Neena Goveas
74
0
0
30 Apr 2025
Multimodal Perception for Goal-oriented Navigation: A Survey
Multimodal Perception for Goal-oriented Navigation: A Survey
I-Tak Ieong
Hao Tang
LM&Ro
LRM
33
0
0
22 Apr 2025
Dexterous Manipulation through Imitation Learning: A Survey
Dexterous Manipulation through Imitation Learning: A Survey
Shan An
Ziyu Meng
Chao Tang
Y. Zhou
Tengyu Liu
...
Yao Mu
Ran Song
Wei Zhang
Zeng-Guang Hou
H. Zhang
51
0
0
04 Apr 2025
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks
W. Zhang
Mengna Wang
Gangao Liu
Xu Huixin
Yiwei Jiang
...
Hang Zhang
Xin Li
Weiming Lu
Peng Li
Y. Zhuang
LM&Ro
LRM
65
3
0
27 Mar 2025
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
Yiqi Zhu
Z. Wang
C. Zhang
Peng Li
Yang Liu
CoGe
VLM
68
0
0
18 Mar 2025
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach
Steeven Janny
Hervé Poirier
L. Antsfeld
G. Bono
G. Monaci
Boris Chidlovskii
Francesco Giuliari
Alessio Del Bue
Christian Wolf
LM&Ro
55
0
0
11 Mar 2025
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments
Luca Barsellotti
Roberto Bigazzi
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
93
1
0
20 Feb 2025
LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View
  Images to Third-Person-View BEV Maps for Universal Point Goal Navigation
LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation
Riku Uemura
Kanji Tanaka
Kenta Tsukahara
Daiki Iwata
26
1
0
23 Dec 2024
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Xinshuai Song
Weixing Chen
Y. Liu
Weikai Chen
Guanbin Li
Liang Lin
123
3
0
12 Dec 2024
TANGO: Training-free Embodied AI Agents for Open-world Tasks
TANGO: Training-free Embodied AI Agents for Open-world Tasks
Filippo Ziliotto
Tommaso Campari
Luciano Serafini
Lamberto Ballan
LLMAG
LM&Ro
MLLM
LRM
69
1
0
05 Dec 2024
3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning
3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning
Yuncong Yang
Han Yang
Jiachen Zhou
Peihao Chen
Hongxin Zhang
Yilun Du
Chuang Gan
64
1
0
23 Nov 2024
VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation
Bangguo Yu
Yuzhen Liu
Lei Han
H. Kasaei
Tingguang Li
M. Cao
LM&Ro
69
2
0
18 Nov 2024
Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation
Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation
Quanting Xie
So Yeon Min
Tianyi Zhang
Kedi Xu
Aarav Bajaj
Ruslan Salakhutdinov
Matthew Johnson-Roberson
Yonatan Bisk
Matthew Johnson-Roberson
Yonatan Bisk
LM&Ro
55
7
0
26 Sep 2024
Navigation with VLM framework: Go to Any Language
Navigation with VLM framework: Go to Any Language
Zecheng Yin
Chonghao Cheng
Lizhen
LM&Ro
32
0
0
18 Sep 2024
OVExp: Open Vocabulary Exploration for Object-Oriented Navigation
OVExp: Open Vocabulary Exploration for Object-Oriented Navigation
Meng Wei
Tai Wang
Yilun Chen
Hanqing Wang
Jiangmiao Pang
Xihui Liu
VLM
49
3
0
12 Jul 2024
Towards Probing Speech-Specific Risks in Large Multimodal Models: A
  Taxonomy, Benchmark, and Insights
Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights
Hao Yang
Lizhen Qu
Ehsan Shareghi
Gholamreza Haffari
28
0
0
25 Jun 2024
CityNav: Language-Goal Aerial Navigation Dataset with Geographic
  Information
CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information
Jungdae Lee
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
Daichi Azuma
Yutaka Matsuo
Nakamasa Inoue
47
12
0
20 Jun 2024
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language
  Navigation
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
LM&Ro
37
6
0
14 Jun 2024
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
270
4,244
0
30 Jan 2023
Habitat-Matterport 3D Semantics Dataset
Habitat-Matterport 3D Semantics Dataset
Karmesh Yadav
Ram Ramrakhya
Santhosh Kumar Ramakrishnan
Theo Gervet
John Turner
...
Angel X. Chang
Dhruv Batra
Manolis Savva
Alexander William Clegg
Devendra Singh Chaplot
3DV
MDE
84
83
0
11 Oct 2022
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in
  Embodied Rearrangement
VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement
Erik Wijmans
Irfan Essa
Dhruv Batra
OffRL
31
13
0
11 Oct 2022
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Arjun Majumdar
Gunjan Aggarwal
Bhavika Devnani
Judy Hoffman
Dhruv Batra
LM&Ro
147
149
0
24 Jun 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual
  Recognition
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Shoufa Chen
Chongjian Ge
Zhan Tong
Jiangliu Wang
Yibing Song
Jue Wang
Ping Luo
146
638
0
26 May 2022
No RL, No Simulation: Learning to Navigate without Navigating
No RL, No Simulation: Learning to Navigate without Navigating
Meera Hahn
Devendra Singh Chaplot
Shubham Tulsiani
Mustafa Mukadam
James M. Rehg
Abhinav Gupta
75
71
0
18 Oct 2021
FILM: Following Instructions in Language with Modular Methods
FILM: Following Instructions in Language with Modular Methods
So Yeon Min
Devendra Singh Chaplot
Pradeep Ravikumar
Yonatan Bisk
Ruslan Salakhutdinov
LM&Ro
214
159
0
12 Oct 2021
Waypoint Models for Instruction-guided Navigation in Continuous
  Environments
Waypoint Models for Instruction-guided Navigation in Continuous Environments
Jacob Krantz
Aaron Gokaslan
Dhruv Batra
Stefan Lee
Oleksandr Maksymets
LM&Ro
137
76
0
05 Oct 2021
1