Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.04405
Cited By
v1
v2 (latest)
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
14 February 2017
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
3DPC
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes"
50 / 2,387 papers shown
Title
Scaling 4D Representations
João Carreira
Dilara Gokay
Michael King
Chuhan Zhang
Ignacio Rocco
...
Viorica Patraucean
Dima Damen
Pauline Luc
Mehdi S. M. Sajjadi
Andrew Zisserman
138
5
0
19 Dec 2024
3D Registration in 30 Years: A Survey
Jiaqi Yang
Chu’ai Zhang
Zhengbao Wang
Xinyue Cao
Xuan Ouyang
...
Borui Lu
Zhiyi Xia
Qian Zhang
Yulan Guo
Yanning Zhang
3DPC
171
2
0
18 Dec 2024
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion
Massimiliano Viola
Kevin Qu
Nando Metzger
Bingxin Ke
Alexander Becker
Konrad Schindler
Anton Obukhov
VLM
MDE
231
6
0
18 Dec 2024
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
Jihan Yang
Shusheng Yang
Anjali W. Gupta
Rilyn Han
Li Fei-Fei
Saining Xie
LRM
212
107
0
18 Dec 2024
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Hanwen Jiang
Zexiang Xu
Desai Xie
Zheyu Chen
Haian Jin
...
Xin Sun
Jiuxiang Gu
Qixing Huang
Georgios Pavlakos
Hao Tan
480
4
0
18 Dec 2024
Towards a Training Free Approach for 3D Scene Editing
Vivek Madhavaram
Shivangana Rawat
Chaitanya Devaguptapu
Charu Sharma
Manohar Kaul
DiffM
136
0
0
17 Dec 2024
Coherent 3D Scene Diffusion From a Single RGB Image
Manuel Dahnert
Angela Dai
Norman Muller
Matthias Nießner
127
0
0
13 Dec 2024
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Baorui Ma
Huachen Gao
Haoge Deng
Zhengxiong Luo
Tiejun Huang
Lulu Tang
Xinlong Wang
DiffM
VGen
266
16
0
09 Dec 2024
Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors
Alex Rich
Noah Stier
P. Sen
Tobias Höllerer
MDE
142
0
0
08 Dec 2024
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances
Wenting Xu
Viorela Ila
Luping Zhou
Craig T. Jin
166
1
0
07 Dec 2024
TANGO: Training-free Embodied AI Agents for Open-world Tasks
Filippo Ziliotto
Tommaso Campari
Luciano Serafini
Lamberto Ballan
LLMAG
LM&Ro
MLLM
LRM
150
2
0
05 Dec 2024
Multi-view Image Diffusion via Coordinate Noise and Fourier Attention
Justin D. Theiss
Norman Müller
Daeil Kim
Aayush Prakash
102
0
0
04 Dec 2024
Planar Gaussian Splatting
F. G. Zanjani
H. Cai
Hanno Ackermann
Leila Mirvakhabova
Fatih Porikli
3DGS
117
2
0
02 Dec 2024
World-consistent Video Diffusion with Explicit 3D Modeling
Qihang Zhang
Shuangfei Zhai
Miguel Angel Bautista
Kevin Miao
Alexander Toshev
J. Susskind
Jiatao Gu
VGen
139
9
0
02 Dec 2024
3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting
Ziyang Yan
Lei Li
Yihua Shao
Siyu Chen
Wuzong Kai
Lei Li
Hao Zhao
Fabio Remondino
3DGS
168
3
0
02 Dec 2024
RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian Splatting
Zhenzhong Cao
Chenyang Zhao
Qianyi Zhang
Jinzheng Guang
Yinuo Song Jingtai Liu
144
1
0
02 Dec 2024
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
Hongyan Zhi
Peihao Chen
Junyan Li
Shuailei Ma
Xinyu Sun
Tianhang Xiang
Yinjie Lei
Mingkui Tan
Chuang Gan
174
8
0
02 Dec 2024
FlashSLAM: Accelerated RGB-D SLAM for Real-Time 3D Scene Reconstruction with Gaussian Splatting
Phu-Cuong Pham
Damon Conover
Aniket Bera
3DGS
110
0
0
01 Dec 2024
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation
Yunpeng Bai
Qixing Huang
DiffM
172
0
0
01 Dec 2024
Density-aware Global-Local Attention Network for Point Cloud Segmentation
Chade Li
Pengju Zhang
Yihong Wu
3DPC
102
0
0
30 Nov 2024
Uni-SLAM: Uncertainty-Aware Neural Implicit SLAM for Real-Time Dense Indoor Scene Reconstruction
Shaoxiang Wang
Yaxu Xie
Chun-Peng Chang
Christen Millerdurai
A. Pagani
Didier Stricker
171
2
0
29 Nov 2024
AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos
Yuze He
Wang Zhao
Shaohui Liu
Yubin Hu
Yushi Bai
Yu-Hui Wen
Yang Liu
122
1
0
29 Nov 2024
Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding
Weinan Zhang
Lu Zhang
Ping Hu
Liqian Ma
Yunzhi Zhuge
Huchuan Lu
3DGS
134
2
0
29 Nov 2024
InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception
Haijie Li
Y. Wu
Jiarui Meng
Qiankun Gao
Zhiyao Zhang
Ronggang Wang
Jian Zhang
ISeg
157
4
0
28 Nov 2024
PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors
Guangshun Wei
Yuan Feng
Long Ma
Chen Wang
Yuanfeng Zhou
Changjian Li
566
0
0
28 Nov 2024
Point Cloud Unsupervised Pre-training via 3D Gaussian Splatting
Hao Liu
Minglin Chen
Yanni Ma
Haihong Xiao
Ying He
3DGS
3DPC
127
1
0
27 Nov 2024
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation
Duc-Hai Pham
Tung Do
P. Nguyen
Binh-Son Hua
K. Nguyen
Rang Nguyen
MDE
144
3
0
27 Nov 2024
Towards Cross-device and Training-free Robotic Grasping in 3D Open World
Weiguang Zhao
Chenru Jiang
Chengrui Zhang
Jie Sun
Yuyao Yan
Rui Zhang
K. Huang
128
1
0
27 Nov 2024
Pixel-aligned RGB-NIR Stereo Imaging and Dataset for Robot Vision
Jinnyeong Kim
Seung-Hwan Baek
3DV
129
0
0
27 Nov 2024
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Junyuan Deng
Wei Yin
Xiaoyang Guo
Qian Zhang
Xiaotao Hu
Weiqiang Ren
Xiaoxiao Long
P. Tan
DiffM
MDE
155
1
0
26 Nov 2024
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Zhongyu Xia
Jishuo Li
Zhiwei Lin
Xinhao Wang
Yansen Wang
Ming-Hsuan Yang
VLM
171
3
0
26 Nov 2024
Revisiting Point Cloud Completion: Are We Ready For The Real-World?
Stuti Pathak
Prashant Kumar
Nicholus Mboga
Gunther Steenackers
R. Penne
Rudi Penne
521
0
0
26 Nov 2024
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
Gyeongjin Kang
Jisang Yoo
Jihyeon Park
Seungtae Nam
Hyeonsoo Im
Sangheon Shin
Sangpil Kim
Eunbyung Park
3DGS
332
6
0
26 Nov 2024
PreF3R: Pose-Free Feed-Forward 3D Gaussian Splatting from Variable-length Image Sequence
Zequn Chen
Jiezhi Yang
Heng Yang
3DGS
123
4
0
25 Nov 2024
Probing the Mid-level Vision Capabilities of Self-Supervised Learning
Xuweiyi Chen
Markus Marks
Zezhou Cheng
173
0
0
25 Nov 2024
Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking
P. Nguyen
Minh Luu
Anh Tran
Cuong Pham
K. Nguyen
3DPC
119
0
0
25 Nov 2024
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
Chan Hee Song
Valts Blukis
Jonathan Tremblay
Stephen Tyree
Yu-Chuan Su
Stan Birchfield
247
20
0
25 Nov 2024
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Ziyao Zeng
Jingcheng Ni
Daniel Wang
Patrick Rim
Younjoon Chung
Fengyu Yang
Byung-Woo Hong
A. Wong
DiffM
MDE
287
2
0
24 Nov 2024
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Rui Huang
Henry Zheng
Yan Wang
Zhuofan Xia
Marco Pavone
Gao Huang
3DPC
VLM
132
1
0
23 Nov 2024
3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning
Yuncong Yang
Han Yang
Jiachen Zhou
Peihao Chen
Hongxin Zhang
Yilun Du
Chuang Gan
128
0
0
23 Nov 2024
Generating 3D-Consistent Videos from Unposed Internet Photos
Gene Chou
Kai Zhang
Sai Bi
Hao Tan
Zexiang Xu
Fujun Luan
Bharath Hariharan
Noah Snavely
3DGS
VGen
164
3
0
20 Nov 2024
DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Weicai Ye
Xinyu Chen
Ruohao Zhan
Di Huang
Xiaoshui Huang
Haoyi Zhu
Hujun Bao
Wanli Ouyang
Tong He
Guofeng Zhang
133
5
0
20 Nov 2024
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation
Umamaheswaran Raman Kumar
A. Fayjie
Jurgen Hannaert
Patrick Vandewalle
3DV
3DPC
137
1
0
20 Nov 2024
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Ziyi Wang
Yijiao Wang
Xumin Yu
Jie Zhou
Jiwen Lu
100
0
0
20 Nov 2024
VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation
Bangguo Yu
Yuzhen Liu
Lei Han
Hamidreza Kasaei
Tingguang Li
M. Cao
LM&Ro
182
3
0
18 Nov 2024
The Oxford Spires Dataset: Benchmarking Large-Scale LiDAR-Visual Localisation, Reconstruction and Radiance Field Methods
Yifu Tao
Miguel Ángel Muñoz-Bañón
Lintong Zhang
Jiahao Wang
L. Fu
Maurice F. Fallon
60
7
0
15 Nov 2024
TESGNN: Temporal Equivariant Scene Graph Neural Networks for Efficient and Robust Multi-View 3D Scene Understanding
Quang P.M. Pham
Khoi T.N. Nguyen
Lan C. Ngo
Dezhen Song
Truong Do
Truong-Son Hy
3DPC
125
2
0
15 Nov 2024
DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization
Yueming Xu
Haochen Jiang
Zhongyang Xiao
Jianfeng Feng
Li Zhang
3DGS
78
13
0
13 Nov 2024
DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Zhaoyu Chen
Bing Li
66
0
0
13 Nov 2024
S
E
(
3
)
SE(3)
SE
(
3
)
Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation
Yinshuang Xu
Dian Chen
Katherine Liu
Sergey Zakharov
Rares Andrei Ambrus
Kostas Daniilidis
Vitor Campagnolo Guizilini
MDE
66
1
0
11 Nov 2024
Previous
1
2
3
...
6
7
8
...
46
47
48
Next