Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.12335
Cited By
Learning Navigational Visual Representations with Semantic Map Supervision
23 July 2023
Yicong Hong
Yang Zhou
Ruiyi Zhang
Franck Dernoncourt
Trung Bui
Stephen Gould
Hao Tan
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Navigational Visual Representations with Semantic Map Supervision"
50 / 60 papers shown
Title
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Joey Tianyi Zhou
DiffM
101
55
0
30 May 2023
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
129
48
0
24 Aug 2022
LATTE: LAnguage Trajectory TransformEr
A. Bucker
Luis F. C. Figueredo
Sami Haddadin
Ashish Kapoor
Shuang Ma
Sai H. Vemprala
Rogerio Bonatti
LM&Ro
112
59
0
04 Aug 2022
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
Dongyan An
Zun Wang
Yangguang Li
Yi Wang
Yicong Hong
Yan Huang
Liang Wang
Jing Shao
63
14
0
23 Jun 2022
Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items
Laura Downs
Anthony G. Francis
Nate Koenig
Brandon Kinman
R. Hickman
Krista Reymann
T. B. McHugh
Vincent Vanhoucke
LM&Ro
112
500
0
25 Apr 2022
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?
Yuchen Cui
S. Niekum
Abhi Gupta
Vikash Kumar
Aravind Rajeswaran
LM&Ro
86
80
0
23 Apr 2022
Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments
Jacob Krantz
Stefan Lee
45
38
0
20 Apr 2022
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Ram Ramrakhya
Eric Undersander
Dhruv Batra
Abhishek Das
LM&Ro
122
119
0
07 Apr 2022
EnvEdit: Environment Editing for Vision-and-Language Navigation
Jialu Li
Hao Tan
Joey Tianyi Zhou
101
83
0
29 Mar 2022
HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Yanyuan Qiao
Yuankai Qi
Yicong Hong
Zheng Yu
Peifeng Wang
Qi Wu
AI4TS
92
76
0
22 Mar 2022
Stubborn: A Strong Baseline for Indoor Object Navigation
Haokuan Luo
Albert Yue
Zhang-Wei Hong
Pulkit Agrawal
85
45
0
14 Mar 2022
Cross-modal Map Learning for Vision and Language Navigation
G. Georgakis
Karl Schmeckpeper
Karan Wanchoo
Soham Dan
E. Miltsakaki
Dan Roth
Kostas Daniilidis
87
66
0
10 Mar 2022
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
92
148
0
23 Feb 2022
Simple but Effective: CLIP Embeddings for Embodied AI
Apoorv Khandelwal
Luca Weihs
Roozbeh Mottaghi
Aniruddha Kembhavi
VLM
LM&Ro
97
230
0
18 Nov 2021
No RL, No Simulation: Learning to Navigate without Navigating
Meera Hahn
Devendra Singh Chaplot
Shubham Tulsiani
Mustafa Mukadam
James M. Rehg
Abhinav Gupta
114
73
0
18 Oct 2021
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans
Ainaz Eftekhar
Alexander Sax
Roman Bachmann
Jitendra Malik
Amir Zamir
MedIm
114
300
0
11 Oct 2021
Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments
Sonia Raychaudhuri
Saim Wani
Shivansh Patel
Unnat Jain
Angel X. Chang
LM&Ro
73
54
0
30 Sep 2021
CLIPort: What and Where Pathways for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
126
661
0
24 Sep 2021
Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
Santhosh Kumar Ramakrishnan
Aaron Gokaslan
Erik Wijmans
Oleksandr Maksymets
Alexander Clegg
...
Andrew Westbury
Angel X. Chang
Manolis Savva
Yili Zhao
Dhruv Batra
90
393
0
16 Sep 2021
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
Muhammad Zubair Irshad
Niluthpol Chowdhury Mithun
Zachary Seymour
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
LM&Ro
73
51
0
26 Aug 2021
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Pierre-Louis Guhur
Makarand Tapaswi
Shizhe Chen
Ivan Laptev
Cordelia Schmid
LM&Ro
52
144
0
20 Aug 2021
iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday Household Tasks
Chengshu Li
Fei Xia
Roberto Martín-Martín
Michael Lingelbach
S. Srivastava
...
Karen Liu
H. Gweon
Jiajun Wu
Li Fei-Fei
Silvio Savarese
LM&Ro
228
237
0
06 Aug 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Joey Tianyi Zhou
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
259
410
0
13 Jul 2021
Learning to Map for Active Semantic Goal Navigation
G. Georgakis
Bernadette Bucher
Karl Schmeckpeper
Siddharth Singh
Kostas Daniilidis
74
78
0
29 Jun 2021
Habitat 2.0: Training Home Assistants to Rearrange their Habitat
Andrew Szot
Alexander Clegg
Eric Undersander
Erik Wijmans
Yili Zhao
...
Z. Kira
V. Koltun
Jitendra Malik
Manolis Savva
Dhruv Batra
LM&Ro
109
527
0
28 Jun 2021
Vision-Language Navigation with Random Environmental Mixup
Chong Liu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
Zongyuan Ge
Yi-Dong Shen
LM&Ro
102
87
0
15 Jun 2021
RobustNav: Towards Benchmarking Robustness in Embodied Navigation
Prithvijit Chattopadhyay
Judy Hoffman
Roozbeh Mottaghi
Aniruddha Kembhavi
69
55
0
08 Jun 2021
Curious Representation Learning for Embodied Intelligence
Yilun Du
Chuang Gan
Phillip Isola
SSL
LM&Ro
160
40
0
03 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
730
6,135
0
29 Apr 2021
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Yuankai Qi
Zizheng Pan
Yicong Hong
Ming-Hsuan Yang
Anton Van Den Hengel
Qi Wu
LM&Ro
70
69
0
09 Apr 2021
Environment Predictive Coding for Embodied Agents
Santhosh Kumar Ramakrishnan
Tushar Nagarajan
Ziad Al-Halah
Kristen Grauman
59
14
0
03 Feb 2021
Memory-Augmented Reinforcement Learning for Image-Goal Navigation
Lina Mezghani
Sainbayar Sukhbaatar
Thibaut Lavril
Oleksandr Maksymets
Dhruv Batra
Piotr Bojanowski
Alahari Karteek
75
70
0
13 Jan 2021
A Recurrent Vision-and-Language BERT for Navigation
Yicong Hong
Qi Wu
Yuankai Qi
Cristian Rodriguez-Opazo
Stephen Gould
LM&Ro
104
303
0
26 Nov 2020
Language and Visual Entity Relationship Graph for Agent Navigation
Yicong Hong
Cristian Rodriguez-Opazo
Yuankai Qi
Qi Wu
Stephen Gould
LM&Ro
224
134
0
19 Oct 2020
Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding
Alexander Ku
Peter Anderson
Roma Patel
Eugene Ie
Jason Baldridge
96
315
0
15 Oct 2020
Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views
Vincent Cartillier
Zhile Ren
Neha Jain
Stefan Lee
Irfan Essa
Dhruv Batra
3DPC
108
74
0
02 Oct 2020
Contrastive Learning of Medical Visual Representations from Paired Images and Text
Yuhao Zhang
Hang Jiang
Yasuhide Miura
Christopher D. Manning
C. Langlotz
MedIm
147
767
0
02 Oct 2020
Self-Supervised Policy Adaptation during Deployment
Nicklas Hansen
Rishabh Jangir
Yu Sun
Guillem Alenyà
Pieter Abbeel
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
70
163
0
08 Jul 2020
Object Goal Navigation using Goal-Oriented Semantic Exploration
Devendra Singh Chaplot
Dhiraj Gandhi
Abhinav Gupta
Ruslan Salakhutdinov
94
524
0
01 Jul 2020
Neural Topological SLAM for Visual Navigation
Devendra Singh Chaplot
Ruslan Salakhutdinov
Abhinav Gupta
Saurabh Gupta
117
296
0
25 May 2020
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Arjun Majumdar
Ayush Shrivastava
Stefan Lee
Peter Anderson
Devi Parikh
Dhruv Batra
LM&Ro
166
235
0
30 Apr 2020
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform
Matt Deitke
Winson Han
Alvaro Herrasti
Aniruddha Kembhavi
Eric Kolve
...
Eli VanderBilt
Matthew Wallingford
Luca Weihs
Mark Yatskar
Ali Farhadi
LM&Ro
113
241
0
14 Apr 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
390
18,897
0
13 Feb 2020
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
Mohit Shridhar
Jesse Thomason
Daniel Gordon
Yonatan Bisk
Winson Han
Roozbeh Mottaghi
Luke Zettlemoyer
Dieter Fox
LM&Ro
122
781
0
03 Dec 2019
Vision-Language Navigation with Self-Supervised Auxiliary Reasoning Tasks
Fengda Zhu
Yi Zhu
Xiaojun Chang
Xiaodan Liang
LRM
90
243
0
18 Nov 2019
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames
Erik Wijmans
Abhishek Kadian
Ari S. Morcos
Stefan Lee
Irfan Essa
Devi Parikh
Manolis Savva
Dhruv Batra
95
484
0
01 Nov 2019
The Replica Dataset: A Digital Replica of Indoor Spaces
Julian Straub
Thomas Whelan
Lingni Ma
Yufan Chen
Erik Wijmans
...
H. Strasdat
R. D. Nardi
Michael Goesele
S. Lovegrove
Richard Newcombe
3DV
134
858
0
13 Jun 2019
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
Yuankai Qi
Qi Wu
Peter Anderson
Xinze Wang
Wenjie Wang
Chunhua Shen
Anton Van Den Hengel
LM&Ro
110
330
0
23 Apr 2019
Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout
Hao Tan
Licheng Yu
Joey Tianyi Zhou
SSL
91
322
0
08 Apr 2019
Habitat: A Platform for Embodied AI Research
Manolis Savva
Abhishek Kadian
Oleksandr Maksymets
Yili Zhao
Erik Wijmans
...
Jia-Wei Liu
V. Koltun
Jitendra Malik
Devi Parikh
Dhruv Batra
LM&Ro
129
1,423
0
02 Apr 2019
1
2
Next