Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.04405
Cited By
v1
v2 (latest)
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
14 February 2017
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
3DPC
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes"
50 / 2,387 papers shown
Title
Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors
Katja Schwarz
Norman Mueller
Peter Kontschieder
3DGS
155
2
0
17 Mar 2025
3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o
Dingning Liu
Cheng Wang
Peng Gao
Renrui Zhang
Xinzhu Ma
Yuan Meng
Zhihui Wang
LRM
92
0
0
17 Mar 2025
3D Human Interaction Generation: A Survey
Siyuan Fan
Wenke Huang
Xiantao Cai
Di Lin
VGen
116
0
0
17 Mar 2025
Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning
Xueying Jiang
Wenhao Li
Xiaoqin Zhang
Ling Shao
Shijian Lu
LRM
147
1
0
17 Mar 2025
Less Biased Noise Scale Estimation for Threshold-Robust RANSAC
Johan Edstedt
151
0
0
17 Mar 2025
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
Erik Daxberger
Nina Wenzel
David Griffiths
Haiming Gang
Justin Lazarow
...
Kai Kang
Marcin Eichner
Yue Yang
Afshin Dehghan
Peter Grasch
126
5
0
17 Mar 2025
SatDepth: A Novel Dataset for Satellite Image Matching
Rahul P. Deshmukh
A. Kak
MDE
92
0
0
17 Mar 2025
SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs
Guibiao Liao
Qing Li
Zhenyu Bao
Guoping Qiu
Kanglin Liu
3DGS
90
2
0
16 Mar 2025
Deblur Gaussian Splatting SLAM
Francesco Girlanda
D. Rozumnyi
Marc Pollefeys
Martin R. Oswald
3DGS
88
0
0
16 Mar 2025
BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis
Weiguang Zhao
Rui Zhang
Qiufeng Wang
Guangliang Cheng
K. Huang
99
1
0
16 Mar 2025
TACO: Taming Diffusion for in-the-wild Video Amodal Completion
Ruijie Lu
Yixin Chen
Yu Liu
Jiaxiang Tang
Junfeng Ni
Diwen Wan
Gang Zeng
Siyuan Huang
DiffM
VGen
143
3
0
15 Mar 2025
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
Byeongjun Park
Hyojun Go
Hyelin Nam
Byung-Hoon Kim
Hyungjin Chung
Changick Kim
VGen
LLMSV
113
1
0
15 Mar 2025
Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space
Weichen Zhang
Zile Zhou
Zhiheng Zheng
Chen Gao
Jinqiang Cui
Yongqian Li
Xinlei Chen
Xiao-Ping Zhang
LRM
135
5
0
14 Mar 2025
Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation
Hongyu Wen
Yiming Zuo
Venkat Subramanian
Patrick Chen
Jia Deng
3DV
168
0
0
14 Mar 2025
VGGT: Visual Geometry Grounded Transformer
Jianyuan Wang
Minghao Chen
Nikita Karaev
Andrea Vedaldi
Christian Rupprecht
David Novotny
ViT
129
38
0
14 Mar 2025
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations
Xunzhi Zheng
Dan Xu
AI4CE
78
1
0
13 Mar 2025
OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions
Maxim Popov
Regina Kurkova
Mikhail Iumanov
Jaafar Mahmoud
Sergey Kolyubin
97
0
0
13 Mar 2025
VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames
Zhiqi Li
Chengrui Dong
Yiming Chen
Zhangchi Huang
Peidong Liu
3DGS
ViT
107
2
0
13 Mar 2025
Speedy MASt3R
Jingxing Li
Yongjae Lee
Abhay Kumar Yadav
Cheng-Fang Peng
Rama Chellappa
Deliang Fan
3DGS
132
0
0
13 Mar 2025
Monte Carlo Diffusion for Generalizable Learning-Based RANSAC
Jiadong Wang
Chen Zhao
Wei Ke
Tong Zhang
DiffM
94
0
0
12 Mar 2025
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Hyeonho Jeong
Suhyeon Lee
Jong Chul Ye
VGen
492
2
0
12 Mar 2025
SAS: Segment Any 3D Scene with Integrated 2D Priors
Zechao Li
Jiahao Lu
Jiacheng Deng
Hanzhi Chang
Lifan Wu
Yanzhe Liang
Tianzhu Zhang
112
0
0
11 Mar 2025
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability
Weijie Zhou
Manli Tao
Chaoyang Zhao
Haiyun Guo
Honghui Dong
Ming Tang
Jinqiao Wang
112
2
0
11 Mar 2025
Ev-Layout: A Large-scale Event-based Multi-modal Dataset for Indoor Layout Estimation and Tracking
Xucheng Guo
Yiran Shen
Xiaofang Xiao
Yuanfeng Zhou
Lin Wang
3DV
3DPC
MDE
151
0
0
11 Mar 2025
PE3R: Perception-Efficient 3D Reconstruction
Jie Hu
Shizun Wang
Xinchao Wang
117
1
0
10 Mar 2025
Fixing the RANSAC Stopping Criterion
Johannes L. Schonberger
Viktor Larsson
Marc Pollefeys
76
0
0
10 Mar 2025
LBM: Latent Bridge Matching for Fast Image-to-Image Translation
Clement Chadebec
O. Tasar
Sanjeev Sreetharan
Benjamin Aubin
148
0
0
10 Mar 2025
DaD: Distilled Reinforcement Learning for Diverse Keypoint Detection
Johan Edstedt
Georg Bökman
Mårten Wadenbäck
Michael Felsberg
91
0
0
10 Mar 2025
Infinite Leagues Under the Sea: Photorealistic 3D Underwater Terrain Generation by Latent Fractal Diffusion Models
Tianyi Zhang
Weiming Zhi
Joshua Mangelson
Matthew Johnson-Roberson
100
0
0
09 Mar 2025
PointDiffuse: A Dual-Conditional Diffusion Model for Enhanced Point Cloud Semantic Segmentation
Yong-xing He
Hongshan Yu
Mingtao Feng
Tongjia Chen
Zechuan Li
Anwaar Ulhaq
Saeed Anwar
Ajmal Mian
DiffM
169
0
0
08 Mar 2025
StreamGS: Online Generalizable Gaussian Splatting Reconstruction for Unposed Image Streams
Yang LI
Jinglu Wang
Lei Chu
Xiao Li
Shiu-hong Kao
Ying Chen
Yan Lu
3DGS
124
1
0
08 Mar 2025
EDM: Efficient Deep Feature Matching
Xi Li
Tong Rao
Cihui Pan
96
0
0
07 Mar 2025
HexPlane Representation for 3D Semantic Scene Understanding
Zeren Chen
Yuenan Hou
Yulin Chen
Li Liu
Xiao Sun
Lu Sheng
3DPC
100
0
0
07 Mar 2025
GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding
Xihan Wang
Dianyi Yang
Yu Gao
Yufeng Yue
Yi Yang
M. Fu
3DGS
83
0
0
06 Mar 2025
DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
Jingzhou Luo
Yang Liu
Weixing Chen
Zhen Li
Yansen Wang
G. Li
Liang Lin
129
3
0
05 Mar 2025
JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba
Xiaoyong Lu
Songlin Du
Mamba
136
2
0
05 Mar 2025
StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts
Zhaoxing Gan
Mengtian Li
Ruhua Chen
Zhongxia Ji
Sichen Guo
Huanling Hu
Guangnan Ye
Zuo Hu
DiffM
VGen
102
0
0
04 Mar 2025
Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views
Yingji Zhong
Kaichen Zhou
Zhihao Li
Lanqing Hong
Zhiyu Li
Dan Xu
156
1
0
04 Mar 2025
LLM-Safety Evaluations Lack Robustness
Tim Beyer
Sophie Xhonneux
Simon Geisler
Gauthier Gidel
Leo Schwinn
Stephan Günnemann
ALM
ELM
485
2
0
04 Mar 2025
Category-level Meta-learned NeRF Priors for Efficient Object Mapping
Saad Ejaz
Hriday Bavle
Laura Ribeiro
Holger Voos
J. López
150
0
0
03 Mar 2025
vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding
Ali Tourani
Saad Ejaz
Hriday Bavle
David Morilla-Cabello
J. López
Holger Voos
123
2
0
03 Mar 2025
MUSt3R: Multi-view Network for Stereo 3D Reconstruction
Yohann Cabon
Lucas Stoffl
L. Antsfeld
G. Csurka
Boris Chidlovskii
Jérôme Revaud
Vincent Leroy
3DGS
3DV
111
3
0
03 Mar 2025
OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
Yijie Tang
JIazhao Zhang
Yuqing Lan
Yulan Guo
Dezun Dong
Chenyang Zhu
K. Xu
430
1
0
03 Mar 2025
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning
Hanxun Yu
Wentong Li
Song Wang
Jintai Chen
Jianke Zhu
3DV
LRM
159
6
0
01 Mar 2025
RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges
Thibaut Loiseau
Guillaume Bourmaud
3DV
VLM
137
1
0
27 Feb 2025
UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler
Luigi Piccinelli
Daniel Gehrig
Yifan Yang
Mattia Segu
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
144
10
0
27 Feb 2025
You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving
Guangfeng Jiang
Jun Liu
Yongxuan Lv
Yongpeng Wu
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
3DPC
110
0
0
27 Feb 2025
Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison
J. Zhong
Ming Li
Armin Gruen
Konrad Schindler
Xuan Liao
Qinghua Guo
140
0
0
27 Feb 2025
From Thousands to Billions: 3D Visual Language Grounding via Render-Supervised Distillation from 2D VLMs
Ang Cao
Sergio Arnaud
Oleksandr Maksymets
Jianing Yang
Ayush Jain
...
Aravind Rajeswaran
Franziska Meier
Justin Johnson
Jeong Joon Park
Alexander Sax
144
0
0
27 Feb 2025
ATLAS Navigator: Active Task-driven LAnguage-embedded Gaussian Splatting
Dexter Ong
Yuezhan Tao
Varun Murali
Igor Spasojevic
Vijay Kumar
Pratik Chaudhari
3DGS
165
1
0
27 Feb 2025
Previous
1
2
3
4
5
6
...
46
47
48
Next