ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.00003
  4. Cited By
Learning Spatial Common Sense with Geometry-Aware Recurrent Networks

Learning Spatial Common Sense with Geometry-Aware Recurrent Networks

31 December 2018
H. Tung
Ricson Cheng
Katerina Fragkiadaki
ArXivPDFHTML

Papers citing "Learning Spatial Common Sense with Geometry-Aware Recurrent Networks"

50 / 53 papers shown
Title
RayZer: A Self-supervised Large View Synthesis Model
RayZer: A Self-supervised Large View Synthesis Model
Hanwen Jiang
Hao Tan
Peng Wang
Haian Jin
Yue Zhao
...
Kai Zhang
Fujun Luan
Kalyan Sunkavalli
Qixing Huang
Georgios Pavlakos
70
0
0
01 May 2025
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse
  Views
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views
Chao Xu
Ang Li
Linghao Chen
Yulin Liu
Ruoxi Shi
Hao Su
Minghua Liu
3DGS
59
21
0
19 Aug 2024
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary
  Robotic Grasping
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping
Yuhang Zheng
Xiangyu Chen
Yupeng Zheng
Songen Gu
Runyi Yang
...
Chao Yang
Dawei Wang
Zhen Chen
Xiaoxiao Long
Meiqing Wang
63
44
0
14 Mar 2024
Brain Decodes Deep Nets
Brain Decodes Deep Nets
Huzheng Yang
James C. Gee
Jianbo Shi
38
7
0
03 Dec 2023
TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting
TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting
Rohan Choudhury
Kris Kitani
László A. Jeni
3DH
42
18
0
14 Sep 2023
3D View Prediction Models of the Dorsal Visual Stream
3D View Prediction Models of the Dorsal Visual Stream
Gabriel H. Sarch
Hsiao-Yu Fish Tung
A. Wang
Jacob S. Prince
Michael J. Tarr
MDE
29
2
0
04 Sep 2023
Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation
Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation
Bokui (William) Shen
Ge Yang
Alan Yu
J. Wong
L. Kaelbling
Phillip Isola
VLM
34
104
0
27 Jul 2023
Act3D: 3D Feature Field Transformers for Multi-Task Robotic Manipulation
Act3D: 3D Feature Field Transformers for Multi-Task Robotic Manipulation
Théophile Gervet
Zhou Xian
N. Gkanatsios
Katerina Fragkiadaki
53
65
0
30 Jun 2023
FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses
  via Pixel-Aligned Scene Flow
FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow
Cameron Smith
Yilun Du
A. Tewari
Vincent Sitzmann
3DH
37
28
0
31 May 2023
VoxDet: Voxel Learning for Novel Instance Detection
VoxDet: Voxel Learning for Novel Instance Detection
Bowen Li
Jiashun Wang
Yaoyu Hu
Chen Wang
Sebastian Scherer
38
6
0
26 May 2023
Neural Volumetric Memory for Visual Locomotion Control
Neural Volumetric Memory for Visual Locomotion Control
Ruihan Yang
Ge Yang
Xiaolong Wang
30
53
0
03 Apr 2023
Few-View Object Reconstruction with Unknown Categories and Camera Poses
Few-View Object Reconstruction with Unknown Categories and Camera Poses
Hanwen Jiang
Zhenyu Jiang
Kristen Grauman
Yuke Zhu
35
42
0
08 Dec 2022
Visual Reinforcement Learning with Self-Supervised 3D Representations
Visual Reinforcement Learning with Self-Supervised 3D Representations
Yanjie Ze
Nicklas Hansen
Yinbo Chen
Mohit Jain
Xiaolong Wang
SSL
34
49
0
13 Oct 2022
MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without
  Camera Pose
MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Pose
Yang Fu
Ishan Misra
Xiaolong Wang
MDE
30
9
0
13 Oct 2022
Neural Groundplans: Persistent Neural Scene Representations from a
  Single Image
Neural Groundplans: Persistent Neural Scene Representations from a Single Image
Prafull Sharma
A. Tewari
Yilun Du
Sergey Zakharov
Rares Andrei Ambrus
Adrien Gaidon
William T. Freeman
F. Durand
J. Tenenbaum
Vincent Sitzmann
SSL
OCL
26
16
0
22 Jul 2022
Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric
  Images to Allocentric Semantics with Vision Transformers
Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers
Chang Chen
Jiaming Zhang
Kailun Yang
Kunyu Peng
Rainer Stiefelhagen
ViT
34
8
0
13 Jul 2022
Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?
Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?
Adam W. Harley
Zhaoyuan Fang
Jie Li
Rares Andrei Ambrus
Katerina Fragkiadaki
49
117
0
16 Jun 2022
TartanDrive: A Large-Scale Dataset for Learning Off-Road Dynamics Models
TartanDrive: A Large-Scale Dataset for Learning Off-Road Dynamics Models
S. Triest
Matthew Sivaprakasam
Sean J. Wang
Wenshan Wang
Aaron M. Johnson
Sebastian Scherer
32
55
0
03 May 2022
Episodic Memory Question Answering
Episodic Memory Question Answering
Samyak Datta
Sameer Dharur
Vincent Cartillier
Ruta Desai
Mukul Khanna
Dhruv Batra
Devi Parikh
EgoV
19
31
0
03 May 2022
Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects
Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects
Yujie Lu
Jianren Wang
Vikash Kumar
31
4
0
31 Mar 2022
Neural Part Priors: Learning to Optimize Part-Based Object Completion in
  RGB-D Scans
Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans
Alexey Bokhovkin
Angela Dai
3DPC
21
4
0
17 Mar 2022
The Right Spin: Learning Object Motion from Rotation-Compensated Flow
  Fields
The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields
Pia Bideau
Erik Learned-Miller
Cordelia Schmid
Alahari Karteek
OCL
29
6
0
28 Feb 2022
HVH: Learning a Hybrid Neural Volumetric Representation for Dynamic Hair
  Performance Capture
HVH: Learning a Hybrid Neural Volumetric Representation for Dynamic Hair Performance Capture
Ziyan Wang
Giljoo Nam
Tuur Stuyck
Stephen Lombardi
Michael Zollhoefer
Jessica Hodgins
Christoph Lassner
3DH
37
28
0
13 Dec 2021
Video Autoencoder: self-supervised disentanglement of static 3D
  structure and motion
Video Autoencoder: self-supervised disentanglement of static 3D structure and motion
Zihang Lai
Sifei Liu
Alexei A. Efros
Xiaolong Wang
VGen
46
31
0
06 Oct 2021
Embodied AI-Driven Operation of Smart Cities: A Concise Review
Embodied AI-Driven Operation of Smart Cities: A Concise Review
Farzan Shenavarmasouleh
F. Mohammadi
M. Amini
H. Arabnia
33
8
0
22 Aug 2021
STR-GQN: Scene Representation and Rendering for Unknown Cameras Based on
  Spatial Transformation Routing
STR-GQN: Scene Representation and Rendering for Unknown Cameras Based on Spatial Transformation Routing
Wen-Cheng Chen
Min-Chun Hu
Chu-Song Chen
22
6
0
06 Aug 2021
Fast and Explicit Neural View Synthesis
Fast and Explicit Neural View Synthesis
Pengsheng Guo
Miguel Angel Bautista
Alex Colburn
Liang Yang
Daniel Ulbricht
J. Susskind
Qi Shan
3DV
24
34
0
12 Jul 2021
3D Neural Scene Representations for Visuomotor Control
3D Neural Scene Representations for Visuomotor Control
Yunzhu Li
Shuang Li
Vincent Sitzmann
Pulkit Agrawal
Antonio Torralba
30
138
0
08 Jul 2021
Light Field Networks: Neural Scene Representations with
  Single-Evaluation Rendering
Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering
Vincent Sitzmann
Semon Rezchikov
William T. Freeman
J. Tenenbaum
F. Durand
3DV
54
288
0
04 Jun 2021
A Geometry-Informed Deep Learning Framework for Ultra-Sparse 3D
  Tomographic Image Reconstruction
A Geometry-Informed Deep Learning Framework for Ultra-Sparse 3D Tomographic Image Reconstruction
Liyue Shen
Wei Zhao
D. Capaldi
John M. Pauly
Lei Xing
32
28
0
25 May 2021
MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions
MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions
Zhenpei Yang
Zhile Ren
Qi Shan
Qi-Xing Huang
3DV
53
51
0
27 Apr 2021
CoCoNets: Continuous Contrastive 3D Scene Representations
CoCoNets: Continuous Contrastive 3D Scene Representations
Shamit Lal
Mihir Prabhudesai
Ishita Mediratta
Adam W. Harley
Katerina Fragkiadaki
SSL
3DH
3DPC
36
25
0
08 Apr 2021
Learning Neural Representation of Camera Pose with Matrix Representation
  of Pose Shift via View Synthesis
Learning Neural Representation of Camera Pose with Matrix Representation of Pose Shift via View Synthesis
Y. Zhu
Ruiqi Gao
Siyuan Huang
Song-Chun Zhu
Ying Nian Wu
SSL
24
9
0
04 Apr 2021
HyperDynamics: Meta-Learning Object and Agent Dynamics with
  Hypernetworks
HyperDynamics: Meta-Learning Object and Agent Dynamics with Hypernetworks
Zhou Xian
Shamit Lal
H. Tung
Emmanouil Antonios Platanios
Katerina Fragkiadaki
AI4CE
35
23
0
17 Mar 2021
Deep Continuous Fusion for Multi-Sensor 3D Object Detection
Deep Continuous Fusion for Multi-Sensor 3D Object Detection
Ming Liang
Binh Yang
Shenlong Wang
R. Urtasun
3DPC
208
841
0
20 Dec 2020
3D-OES: Viewpoint-Invariant Object-Factorized Environment Simulators
3D-OES: Viewpoint-Invariant Object-Factorized Environment Simulators
H. Tung
Xian Zhou
Mihir Prabhudesai
Shamit Lal
Katerina Fragkiadaki
28
28
0
12 Nov 2020
Disentangling 3D Prototypical Networks For Few-Shot Concept Learning
Disentangling 3D Prototypical Networks For Few-Shot Concept Learning
Mihir Prabhudesai
Shamit Lal
Darshan Patil
H. Tung
Adam W. Harley
Katerina Fragkiadaki
OCL
3DV
3DPC
24
20
0
06 Nov 2020
3D Object Recognition By Corresponding and Quantizing Neural 3D Scene
  Representations
3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations
Mihir Prabhudesai
Shamit Lal
H. Tung
Adam W. Harley
Shubhankar Potdar
Katerina Fragkiadaki
3DPC
20
2
0
30 Oct 2020
Semantic MapNet: Building Allocentric Semantic Maps and Representations
  from Egocentric Views
Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views
Vincent Cartillier
Zhile Ren
Neha Jain
Stefan Lee
Irfan Essa
Dhruv Batra
3DPC
29
74
0
02 Oct 2020
Learning to Set Waypoints for Audio-Visual Navigation
Learning to Set Waypoints for Audio-Visual Navigation
Changan Chen
Sagnik Majumder
Ziad Al-Halah
Ruohan Gao
Santhosh Kumar Ramakrishnan
Kristen Grauman
SSL
20
5
0
21 Aug 2020
Tracking Emerges by Looking Around Static Scenes, with Neural 3D Mapping
Tracking Emerges by Looking Around Static Scenes, with Neural 3D Mapping
Adam W. Harley
S. K. Lakshmikanth
Paul Schydlo
Katerina Fragkiadaki
3DPC
19
9
0
04 Aug 2020
Continuous Object Representation Networks: Novel View Synthesis without
  Target View Supervision
Continuous Object Representation Networks: Novel View Synthesis without Target View Supervision
Nicolai Häni
Selim Engin
Jun-Jee Chao
Volkan Isler
3DV
6
0
0
30 Jul 2020
Novel Object Viewpoint Estimation through Reconstruction Alignment
Novel Object Viewpoint Estimation through Reconstruction Alignment
Mohamed El Banani
Jason J. Corso
David Fouhey
26
14
0
05 Jun 2020
Differentiable Mapping Networks: Learning Structured Map Representations
  for Sparse Visual Localization
Differentiable Mapping Networks: Learning Structured Map Representations for Sparse Visual Localization
Peter Karkus
A. Angelova
Vincent Vanhoucke
Rico Jonschkowski
25
11
0
19 May 2020
Epipolar Transformers
Epipolar Transformers
Yihui He
Rui Yan
Katerina Fragkiadaki
Shoou-I Yu
31
168
0
10 May 2020
CoReNet: Coherent 3D scene reconstruction from a single RGB image
CoReNet: Coherent 3D scene reconstruction from a single RGB image
S. Popov
Pablo Bauszat
V. Ferrari
3DPC
3DV
28
71
0
27 Apr 2020
Semantic Implicit Neural Scene Representations With Semi-Supervised
  Training
Semantic Implicit Neural Scene Representations With Semi-Supervised Training
Amit Kohli
Vincent Sitzmann
Gordon Wetzstein
3DPC
25
14
0
28 Mar 2020
A Neural Rendering Framework for Free-Viewpoint Relighting
A Neural Rendering Framework for Free-Viewpoint Relighting
Zhaoyu Chen
Anpei Chen
Guli Zhang
Chengyuan Wang
Yu Ji
Kiriakos N. Kutulakos
Jingyi Yu
27
50
0
26 Nov 2019
Embodied Language Grounding with 3D Visual Feature Representations
Embodied Language Grounding with 3D Visual Feature Representations
Mihir Prabhudesai
H. Tung
Syed Ashar Javed
Maximilian Sieb
Adam W. Harley
Katerina Fragkiadaki
28
21
0
02 Oct 2019
Learning from Unlabelled Videos Using Contrastive Predictive Neural 3D
  Mapping
Learning from Unlabelled Videos Using Contrastive Predictive Neural 3D Mapping
Adam W. Harley
S. K. Lakshmikanth
Fangyu Li
Xian Zhou
H. Tung
Katerina Fragkiadaki
SSL
20
5
0
10 Jun 2019
12
Next