ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.14158
  4. Cited By
Volumetric Environment Representation for Vision-Language Navigation

Volumetric Environment Representation for Vision-Language Navigation

21 March 2024
Rui Liu
Wenguan Wang
Yi Yang
ArXiv (abs)PDFHTML

Papers citing "Volumetric Environment Representation for Vision-Language Navigation"

50 / 56 papers shown
Title
Temporal Triplane Transformers as Occupancy World Models
Temporal Triplane Transformers as Occupancy World Models
Haoran Xu
Peixi Peng
Guang Tan
Yiqian Chang
Yisen Zhao
Yonghong Tian
158
0
0
10 Mar 2025
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Xinshuai Song
Weixing Chen
Yang Liu
Weikai Chen
Guanbin Li
Liang Lin
175
5
0
12 Dec 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
287
52
0
23 May 2024
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards
  Embodied AI
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
Tai Wang
Xiaohan Mao
Chenming Zhu
Runsen Xu
Ruiyuan Lyu
...
Tianfan Xue
Xihui Liu
Cewu Lu
Dahua Lin
Jiangmiao Pang
LM&Ro
70
73
0
26 Dec 2023
Omnidirectional Information Gathering for Knowledge Transfer-based
  Audio-Visual Navigation
Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation
Jinyu Chen
Wenguan Wang
Siying Liu
Hongsheng Li
Yi Yang
82
8
0
20 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
74
31
0
14 Aug 2023
Learning Navigational Visual Representations with Semantic Map
  Supervision
Learning Navigational Visual Representations with Semantic Map Supervision
Yicong Hong
Yang Zhou
Ruiyi Zhang
Franck Dernoncourt
Trung Bui
Stephen Gould
Hao Tan
SSL
56
22
0
23 Jul 2023
PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic
  Segmentation
PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
Yu-Quan Wang
Yuntao Chen
Xingyu Liao
Lue Fan
Zhaoxiang Zhang
122
77
0
16 Jun 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for
  Vision-and-Language Navigation
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Joey Tianyi Zhou
DiffM
84
53
0
30 May 2023
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving
Yi Wei
Linqing Zhao
Wenzhao Zheng
Zhengbiao Zhu
Jie Zhou
Jiwen Lu
3DPC
75
230
0
16 Mar 2023
Renderable Neural Radiance Map for Visual Navigation
Renderable Neural Radiance Map for Visual Navigation
Obin Kwon
Jeongho Park
Songhwai Oh
67
55
0
01 Mar 2023
Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction
Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction
Yuan-Ko Huang
Wenzhao Zheng
Yunpeng Zhang
Jie Zhou
Jiwen Lu
3DPC
74
300
0
15 Feb 2023
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
Dongyan An
Yuankai Qi
Yangguang Li
Yan Huang
Liangsheng Wang
Tieniu Tan
Jing Shao
74
62
0
08 Dec 2022
A New Path: Scaling Vision-and-Language Navigation with Synthetic
  Instructions and Imitation Learning
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Aishwarya Kamath
Peter Anderson
Su Wang
Jing Yu Koh
Alexander Ku
Austin Waters
Yinfei Yang
Jason Baldridge
Zarana Parekh
LM&Ro
93
48
0
06 Oct 2022
Delving into the Devils of Bird's-eye-view Perception: A Review,
  Evaluation and Recipe
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Hongyang Li
Chonghao Sima
Jifeng Dai
Wenhai Wang
Lewei Lu
...
Xiaosong Jia
Siqian Liu
Jianping Shi
Dahua Lin
Yu Qiao
145
145
0
12 Sep 2022
Target-Driven Structured Transformer Planner for Vision-Language
  Navigation
Target-Driven Structured Transformer Planner for Vision-Language Navigation
Yusheng Zhao
Jinyu Chen
Chen Gao
Wenguan Wang
Lirong Yang
Haibing Ren
Huaxia Xia
Si Liu
LM&Ro
75
59
0
19 Jul 2022
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera
  Images via Spatiotemporal Transformers
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
125
1,289
0
31 Mar 2022
Counterfactual Cycle-Consistent Learning for Instruction Following and
  Generation in Vision-Language Navigation
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
Hongru Wang
Wei Liang
Jianbing Shen
Luc Van Gool
Wenguan Wang
73
57
0
30 Mar 2022
EnvEdit: Environment Editing for Vision-and-Language Navigation
EnvEdit: Environment Editing for Vision-and-Language Navigation
Jialu Li
Hao Tan
Joey Tianyi Zhou
99
81
0
29 Mar 2022
HOP: History-and-Order Aware Pre-training for Vision-and-Language
  Navigation
HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Yanyuan Qiao
Yuankai Qi
Yicong Hong
Zheng Yu
Peifeng Wang
Qi Wu
AI4TS
92
74
0
22 Mar 2022
Cross-modal Map Learning for Vision and Language Navigation
Cross-modal Map Learning for Vision and Language Navigation
G. Georgakis
Karl Schmeckpeper
Karan Wanchoo
Soham Dan
E. Miltsakaki
Dan Roth
Kostas Daniilidis
74
64
0
10 Mar 2022
Think Global, Act Local: Dual-scale Graph Transformer for
  Vision-and-Language Navigation
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
92
142
0
23 Feb 2022
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for
  Vision-and-Language Navigation in Continuous Environments
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Muhammad Zubair Irshad
Niluthpol Chowdhury Mithun
Zachary Seymour
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
LM&Ro
68
49
0
26 Aug 2021
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Pierre-Louis Guhur
Makarand Tapaswi
Shizhe Chen
Ivan Laptev
Cordelia Schmid
LM&Ro
49
139
0
20 Aug 2021
3D Neural Scene Representations for Visuomotor Control
3D Neural Scene Representations for Visuomotor Control
Yunzhu Li
Shuang Li
Vincent Sitzmann
Pulkit Agrawal
Antonio Torralba
114
141
0
08 Jul 2021
Vision-Language Navigation with Random Environmental Mixup
Vision-Language Navigation with Random Environmental Mixup
Chong Liu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
Zongyuan Ge
Yi-Dong Shen
LM&Ro
92
86
0
15 Jun 2021
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Xiangru Lin
Guanbin Li
Yizhou Yu
LM&Ro
61
52
0
24 Mar 2021
3D Semantic Scene Completion: a Survey
3D Semantic Scene Completion: a Survey
Luis Roldão
Raoul de Charette
Anne Verroust-Blondet
3DV
93
96
0
12 Mar 2021
Structured Scene Memory for Vision-Language Navigation
Structured Scene Memory for Vision-Language Navigation
Hanqing Wang
Wenguan Wang
Wei Liang
Caiming Xiong
Jianbing Shen
LM&Ro
77
114
0
05 Mar 2021
Semantics for Robotic Mapping, Perception and Interaction: A Survey
Semantics for Robotic Mapping, Perception and Interaction: A Survey
Sourav Garg
Niko Sünderhauf
Feras Dayoub
D. Morrison
Akansel Cosgun
...
Tat-Jun Chin
Ian Reid
Stephen Gould
Peter Corke
Michael Milford
101
117
0
02 Jan 2021
Topological Planning with Transformers for Vision-and-Language
  Navigation
Topological Planning with Transformers for Vision-and-Language Navigation
Kevin Chen
Junshen K. Chen
Jo Chuang
Nathan Tsoi
Silvio Savarese
LM&Ro
82
99
0
09 Dec 2020
A Recurrent Vision-and-Language BERT for Navigation
A Recurrent Vision-and-Language BERT for Navigation
Yicong Hong
Qi Wu
Yuankai Qi
Cristian Rodriguez-Opazo
Stephen Gould
LM&Ro
104
301
0
26 Nov 2020
Learning 3D Dynamic Scene Representations for Robot Manipulation
Learning 3D Dynamic Scene Representations for Robot Manipulation
Zhenjia Xu
Zhanpeng He
Jiajun Wu
Shuran Song
52
55
0
03 Nov 2020
Language and Visual Entity Relationship Graph for Agent Navigation
Language and Visual Entity Relationship Graph for Agent Navigation
Yicong Hong
Cristian Rodriguez-Opazo
Yuankai Qi
Qi Wu
Stephen Gould
LM&Ro
222
134
0
19 Oct 2020
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
224
5,080
0
08 Oct 2020
Semantic MapNet: Building Allocentric Semantic Maps and Representations
  from Egocentric Views
Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views
Vincent Cartillier
Zhile Ren
Neha Jain
Stefan Lee
Irfan Essa
Dhruv Batra
3DPC
85
74
0
02 Oct 2020
Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by
  Implicitly Unprojecting to 3D
Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D
Jonah Philion
Sanja Fidler
94
1,051
0
13 Aug 2020
Pillar-based Object Detection for Autonomous Driving
Pillar-based Object Detection for Autonomous Driving
Yue Wang
Alireza Fathi
Abhijit Kundu
David A. Ross
C. Pantofaru
Thomas Funkhouser
Justin Solomon
3DPC
73
217
0
20 Jul 2020
Active Visual Information Gathering for Vision-Language Navigation
Active Visual Information Gathering for Vision-Language Navigation
Hanqing Wang
Wenguan Wang
Tianmin Shu
Wei Liang
Jianbing Shen
128
73
0
15 Jul 2020
Evolving Graphical Planner: Contextual Global Planning for
  Vision-and-Language Navigation
Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation
Zhiwei Deng
Karthik Narasimhan
Olga Russakovsky
66
87
0
11 Jul 2020
Neural Topological SLAM for Visual Navigation
Neural Topological SLAM for Visual Navigation
Devendra Singh Chaplot
Ruslan Salakhutdinov
Abhinav Gupta
Saurabh Gupta
107
295
0
25 May 2020
Improving Vision-and-Language Navigation with Image-Text Pairs from the
  Web
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Arjun Majumdar
Ayush Shrivastava
Stefan Lee
Peter Anderson
Devi Parikh
Dhruv Batra
LM&Ro
145
233
0
30 Apr 2020
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
Johanna Wald
Helisa Dhamo
Nassir Navab
Federico Tombari
3DV3DPC
71
217
0
08 Apr 2020
Towards Learning a Generic Agent for Vision-and-Language Navigation via
  Pre-training
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training
Weituo Hao
Chunyuan Li
Xiujun Li
Lawrence Carin
Jianfeng Gao
LM&Ro
75
279
0
25 Feb 2020
Vision-Language Navigation with Self-Supervised Auxiliary Reasoning
  Tasks
Vision-Language Navigation with Self-Supervised Auxiliary Reasoning Tasks
Fengda Zhu
Yi Zhu
Xiaojun Chang
Xiaodan Liang
LRM
67
240
0
18 Nov 2019
LXMERT: Learning Cross-Modality Encoder Representations from
  Transformers
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Hao Tan
Joey Tianyi Zhou
VLMMLLM
247
2,483
0
20 Aug 2019
Chasing Ghosts: Instruction Following as Bayesian State Tracking
Chasing Ghosts: Instruction Following as Bayesian State Tracking
Peter Anderson
Ayush Shrivastava
Devi Parikh
Dhruv Batra
Stefan Lee
56
74
0
03 Jul 2019
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor
  Environments
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
Yuankai Qi
Qi Wu
Peter Anderson
Xinze Wang
Wenjie Wang
Chunhua Shen
Anton Van Den Hengel
LM&Ro
93
324
0
23 Apr 2019
Objects as Points
Objects as Points
Xingyi Zhou
Dequan Wang
Philipp Krahenbuhl
3DPC
110
3,257
0
16 Apr 2019
Learning to Navigate Unseen Environments: Back Translation with
  Environmental Dropout
Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout
Hao Tan
Licheng Yu
Joey Tianyi Zhou
SSL
88
318
0
08 Apr 2019
12
Next