Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.01616
Cited By
VisualEchoes: Spatial Image Representation Learning through Echolocation
4 May 2020
Ruohan Gao
Changan Chen
Ziad Al-Halah
Carl Schissler
Kristen Grauman
MDE
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VisualEchoes: Spatial Image Representation Learning through Echolocation"
50 / 67 papers shown
Title
NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields
Amandine Brunetto
Sascha Hornauer
Fabien Moutarde
85
2
0
28 May 2024
Images that Sound: Composing Images and Sounds on a Single Canvas
Ziyang Chen
Daniel Geng
Andrew Owens
DiffM
85
9
0
20 May 2024
Music Gesture for Visual Sound Separation
Chuang Gan
Deng Huang
Hang Zhao
J. Tenenbaum
Antonio Torralba
86
204
0
20 Apr 2020
Look, Listen, and Act: Towards Audio-Visual Embodied Navigation
Chuang Gan
Yiwei Zhang
Jiajun Wu
Boqing Gong
J. Tenenbaum
52
138
0
25 Dec 2019
DepthTransfer: Depth Extraction from Video Using Non-parametric Sampling
Kevin Karsch
Ce Liu
S. B. Kang
MDE
51
586
0
24 Dec 2019
SoundSpaces: Audio-Visual Navigation in 3D Environments
Changan Chen
Unnat Jain
Carl Schissler
S. V. A. Garí
Ziad Al-Halah
V. Ithapu
Philip Robinson
Kristen Grauman
48
26
0
24 Dec 2019
BatVision: Learning to See 3D Spatial Layout with Two Ears
J. H. Christensen
Sascha Hornauer
Stella X. Yu
37
57
0
15 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
72
251
0
10 Dec 2019
Self-supervised Moving Vehicle Tracking with Stereo Sound
Chuang Gan
Hang Zhao
Peihao Chen
David D. Cox
Antonio Torralba
48
147
0
25 Oct 2019
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
Evangelos Kazakos
Arsha Nagrani
Andrew Zisserman
Dima Damen
EgoV
51
337
0
22 Aug 2019
DIODE: A Dense Indoor and Outdoor DEpth Dataset
Igor Vasiljevic
Nicholas I. Kolkin
Shanyi Zhang
Ruotian Luo
Haochen Wang
...
Andrea F. Daniele
Mohammadreza Mostajabi
Steven Basart
Matthew R. Walter
Gregory Shakhnarovich
MDE
3DV
69
231
0
01 Aug 2019
The Replica Dataset: A Digital Replica of Indoor Spaces
Julian Straub
Thomas Whelan
Lingni Ma
Yufan Chen
Erik Wijmans
...
H. Strasdat
R. D. Nardi
Michael Goesele
S. Lovegrove
Richard Newcombe
3DV
123
849
0
13 Jun 2019
Scaling and Benchmarking Self-Supervised Visual Representation Learning
Priya Goyal
D. Mahajan
Abhinav Gupta
Ishan Misra
SSL
54
397
0
03 May 2019
Co-Separating Sounds of Visual Objects
Ruohan Gao
Kristen Grauman
115
208
0
16 Apr 2019
Bounce and Learn: Modeling Scene Dynamics with Real-World Bounces
Senthil Purushwalkam
Abhinav Gupta
D. Kaufman
Bryan C. Russell
3DH
SSL
50
21
0
15 Apr 2019
Habitat: A Platform for Embodied AI Research
Manolis Savva
Abhishek Kadian
Oleksandr Maksymets
Yili Zhao
Erik Wijmans
...
Jia-Wei Liu
V. Koltun
Jitendra Malik
Devi Parikh
Dhruv Batra
LM&Ro
99
1,401
0
02 Apr 2019
2.5D Visual Sound
Ruohan Gao
Kristen Grauman
VGen
104
130
0
11 Dec 2018
Self-Supervised Generation of Spatial Audio for 360 Video
Pedro Morgado
Nuno Vasconcelos
Timothy R. Langlois
Oliver Wang
MDE
56
173
0
07 Sep 2018
Gibson Env: Real-World Perception for Embodied Agents
F. Xia
Amir Zamir
Zhi-Yang He
Alexander Sax
Jitendra Malik
Silvio Savarese
AI4CE
LM&Ro
77
822
0
31 Aug 2018
On Evaluation of Embodied Navigation Agents
Peter Anderson
Angel X. Chang
Devendra Singh Chaplot
Alexey Dosovitskiy
Saurabh Gupta
...
Jana Kosecka
Jitendra Malik
Roozbeh Mottaghi
Manolis Savva
Amir Zamir
112
795
0
18 Jul 2018
Deep Ordinal Regression Network for Monocular Depth Estimation
Huan Fu
Biwei Huang
Chaohui Wang
Kayhan Batmanghelich
Dacheng Tao
MDE
391
1,725
0
06 Jun 2018
Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation
Anurag Ranjan
Varun Jampani
Lukas Balles
Kihwan Kim
Deqing Sun
Jonas Wulff
Michael J. Black
SSL
56
591
0
24 May 2018
A 64mW DNN-based Visual Navigation Engine for Autonomous Nano-Drones
Daniele Palossi
Antonio Loquercio
Francesco Conti
Eric Flamand
Davide Scaramuzza
Luca Benini
208
158
0
04 May 2018
Taskonomy: Disentangling Task Transfer Learning
Amir Zamir
Alexander Sax
Bokui (William) Shen
Leonidas Guibas
Jitendra Malik
Silvio Savarese
97
1,215
0
23 Apr 2018
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Andrew Owens
Alexei A. Efros
SSL
86
748
0
10 Apr 2018
The Sound of Pixels
Hang Zhao
Chuang Gan
Andrew Rouditchenko
Carl Vondrick
Josh H. McDermott
Antonio Torralba
VLM
85
535
0
09 Apr 2018
Learning to Separate Object Sounds by Watching Unlabeled Video
Ruohan Gao
Rogerio Feris
Kristen Grauman
SSL
63
284
0
05 Apr 2018
Audio-Visual Event Localization in Unconstrained Videos
Yapeng Tian
Jing Shi
Bochen Li
Zhiyao Duan
Chenliang Xu
92
435
0
23 Mar 2018
Revisiting Single Image Depth Estimation: Toward Higher Resolution Maps with Accurate Object Boundaries
Junjie Hu
Mete Ozay
Yan Zhang
Takayuki Okatani
3DV
77
375
0
23 Mar 2018
Unsupervised Representation Learning by Predicting Image Rotations
Spyros Gidaris
Praveer Singh
N. Komodakis
OOD
SSL
DRL
229
3,283
0
21 Mar 2018
Learning to Localize Sound Source in Visual Scenes
Arda Senocak
Tae-Hyun Oh
Junsik Kim
Ming-Hsuan Yang
In So Kweon
SSL
64
344
0
10 Mar 2018
Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning
Andrew Owens
Jiajun Wu
Josh H. McDermott
William T. Freeman
Antonio Torralba
SSL
65
176
0
20 Dec 2017
Objects that Sound
Relja Arandjelović
Andrew Zisserman
ObjD
VOS
92
529
0
18 Dec 2017
Visual to Sound: Generating Natural Sound for Videos in the Wild
Yipin Zhou
Zhaowen Wang
Chen Fang
Trung Bui
Tamara L. Berg
VGen
63
207
0
04 Dec 2017
Cross-Domain Self-supervised Multi-task Feature Learning using Synthetic Imagery
Zhongzheng Ren
Yong Jae Lee
SSL
OOD
71
212
0
24 Nov 2017
Unsupervised Learning of Geometry with Edge-aware Depth-Normal Consistency
Zhenheng Yang
Peng Wang
Wenyuan Xu
Liang Zhao
Ram Nevatia
3DV
MDE
62
155
0
10 Nov 2017
Matterport3D: Learning from RGB-D Data in Indoor Environments
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
159
1,893
0
18 Sep 2017
Look, Listen and Learn
Relja Arandjelović
Andrew Zisserman
SSL
108
901
0
23 May 2017
Unsupervised Learning of Depth and Ego-Motion from Video
Tinghui Zhou
Matthew A. Brown
Noah Snavely
D. Lowe
MDE
114
2,571
0
25 Apr 2017
SfM-Net: Learning of Structure and Motion from Video
Sudheendra Vijayanarasimhan
Susanna Ricco
Cordelia Schmid
Rahul Sukthankar
Katerina Fragkiadaki
MDE
66
440
0
25 Apr 2017
Learning to Fly by Crashing
Dhiraj Gandhi
Lerrel Pinto
Abhinav Gupta
SSL
89
276
0
19 Apr 2017
Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation
Dan Xu
Elisa Ricci
Wanli Ouyang
Xiaogang Wang
N. Sebe
87
416
0
07 Apr 2017
Colorization as a Proxy Task for Visual Understanding
Gustav Larsson
Michael Maire
Gregory Shakhnarovich
SSL
142
497
0
11 Mar 2017
A Survey on Deep Learning in Medical Image Analysis
G. Litjens
Thijs Kooi
B. Bejnordi
A. Setio
F. Ciompi
Mohsen Ghafoorian
Jeroen van der Laak
Bram van Ginneken
C. I. Sánchez
OOD
602
10,741
0
19 Feb 2017
DeMoN: Depth and Motion Network for Learning Monocular Stereo
Benjamin Ummenhofer
Huizhong Zhou
J. Uhrig
N. Mayer
Eddy Ilg
Alexey Dosovitskiy
Thomas Brox
3DV
MDE
95
701
0
07 Dec 2016
Pyramid Scene Parsing Network
Hengshuang Zhao
Jianping Shi
Xiaojuan Qi
Xiaogang Wang
Jiaya Jia
VOS
SSeg
539
11,984
0
04 Dec 2016
Object-Centric Representation Learning from Unlabeled Videos
Ruohan Gao
Dinesh Jayaraman
Kristen Grauman
46
36
0
01 Dec 2016
Self-Supervised Video Representation Learning With Odd-One-Out Networks
Basura Fernando
Hakan Bilen
E. Gavves
Stephen Gould
SSL
42
450
0
21 Nov 2016
SoundNet: Learning Sound Representations from Unlabeled Video
Y. Aytar
Carl Vondrick
Antonio Torralba
SSL
105
1,040
0
27 Oct 2016
Unsupervised Monocular Depth Estimation with Left-Right Consistency
Clément Godard
Oisin Mac Aodha
Gabriel J. Brostow
MDE
119
2,881
0
13 Sep 2016
1
2
Next