ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.07954
  4. Cited By
Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense
  Spatiotemporal Grounding

Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding

15 October 2020
Alexander Ku
Peter Anderson
Roma Patel
Eugene Ie
Jason Baldridge
ArXivPDFHTML

Papers citing "Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding"

23 / 223 papers shown
Title
Indoor Semantic Scene Understanding using Multi-modality Fusion
Indoor Semantic Scene Understanding using Multi-modality Fusion
Muraleekrishna Gopinathan
Giang Truong
Jumana Abu-Khalaf
19
0
0
17 Aug 2021
Embodied BERT: A Transformer Model for Embodied, Language-guided Visual
  Task Completion
Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion
Alessandro Suglia
Qiaozi Gao
Jesse Thomason
Govind Thattai
Gaurav Sukhatme
LM&Ro
34
77
0
10 Aug 2021
Neural Abstructions: Abstractions that Support Construction for Grounded
  Language Learning
Neural Abstructions: Abstractions that Support Construction for Grounded Language Learning
Kaylee Burns
Christopher D. Manning
Li Fei-Fei
24
0
0
20 Jul 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Joey Tianyi Zhou
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
202
405
0
13 Jul 2021
A Persistent Spatial Semantic Representation for High-level Natural
  Language Instruction Execution
A Persistent Spatial Semantic Representation for High-level Natural Language Instruction Execution
Valts Blukis
Chris Paxton
D. Fox
Animesh Garg
Yoav Artzi
LM&Ro
212
134
0
12 Jul 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Junha Roh
Karthik Desingh
Ali Farhadi
D. Fox
22
95
0
07 Jul 2021
Deep Learning for Embodied Vision Navigation: A Survey
Deep Learning for Embodied Vision Navigation: A Survey
Fengda Zhu
Yi Zhu
Vincent CS Lee
Xiaodan Liang
Xiaojun Chang
EgoV
LM&Ro
42
0
0
07 Jul 2021
Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Andrew Szot
Alexander Clegg
Eric Undersander
Erik Wijmans
Yili Zhao
...
Z. Kira
V. Koltun
Jitendra Malik
Manolis Savva
Dhruv Batra
LM&Ro
39
497
0
28 Jun 2021
Core Challenges in Embodied Vision-Language Planning
Core Challenges in Embodied Vision-Language Planning
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
47
45
0
26 Jun 2021
Grounding 'Grounding' in NLP
Grounding 'Grounding' in NLP
Khyathi Raghavi Chandu
Yonatan Bisk
A. Black
30
51
0
04 Jun 2021
Pathdreamer: A World Model for Indoor Navigation
Pathdreamer: A World Model for Indoor Navigation
Jing Yu Koh
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
26
79
0
18 May 2021
Episodic Transformer for Vision-and-Language Navigation
Episodic Transformer for Vision-and-Language Navigation
Alexander Pashevich
Cordelia Schmid
Chen Sun
LM&Ro
43
193
0
13 May 2021
Improving Cross-Modal Alignment in Vision Language Navigation via
  Syntactic Information
Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information
Jialu Li
Hao Hao Tan
Joey Tianyi Zhou
38
33
0
19 Apr 2021
GridToPix: Training Embodied Agents with Minimal Supervision
GridToPix: Training Embodied Agents with Minimal Supervision
Unnat Jain
Iou-Jen Liu
Svetlana Lazebnik
Aniruddha Kembhavi
Luca Weihs
A. Schwing
28
23
0
14 Apr 2021
Diagnosing Vision-and-Language Navigation: What Really Matters
Diagnosing Vision-and-Language Navigation: What Really Matters
Wanrong Zhu
Yuankai Qi
P. Narayana
Kazoo Sone
Sugato Basu
Qing Guo
Qi Wu
M. Eckstein
Luu Anh Tuan
LM&Ro
27
50
0
30 Mar 2021
PanGEA: The Panoramic Graph Environment Annotation Toolkit
PanGEA: The Panoramic Graph Environment Annotation Toolkit
Alexander Ku
Peter Anderson
Jordi Pont-Tuset
Jason Baldridge
8
2
0
23 Mar 2021
On the Evaluation of Vision-and-Language Navigation Instructions
On the Evaluation of Vision-and-Language Navigation Instructions
Mingde Zhao
Peter Anderson
Vihan Jain
Su Wang
Alexander Ku
Jason Baldridge
Eugene Ie
233
51
0
26 Jan 2021
Generating Landmark Navigation Instructions from Maps as a Graph-to-Text
  Problem
Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Problem
Raphael Schumann
Stefan Riezler
17
27
0
30 Dec 2020
Visually Grounding Language Instruction for History-Dependent
  Manipulation
Visually Grounding Language Instruction for History-Dependent Manipulation
Hyemin Ahn
Obin Kwon
Kyungdo Kim
Jaeyeon Jeong
Howoong Jun
Hongjung Lee
Dongheui Lee
Songhwai Oh
LM&Ro
21
6
0
16 Dec 2020
A Recurrent Vision-and-Language BERT for Navigation
A Recurrent Vision-and-Language BERT for Navigation
Yicong Hong
Qi Wu
Yuankai Qi
Cristian Rodriguez-Opazo
Stephen Gould
LM&Ro
38
298
0
26 Nov 2020
Sim-to-Real Transfer for Vision-and-Language Navigation
Sim-to-Real Transfer for Vision-and-Language Navigation
Peter Anderson
Ayush Shrivastava
Joanne Truong
Arjun Majumdar
Devi Parikh
Dhruv Batra
Stefan Lee
LM&Ro
36
106
0
07 Nov 2020
Speaker-Follower Models for Vision-and-Language Navigation
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
260
498
0
07 Jun 2018
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
Previous
12345