Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.04405
Cited By
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
14 February 2017
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
3DPC
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes"
50 / 899 papers shown
Title
seg_3D_by_PC2D: Multi-View Projection for Domain Generalization and Adaptation in 3D Semantic Segmentation
Andrew Caunes
Thierry Chateau
Vincent Frémont
3DPC
3DV
24
0
0
21 May 2025
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation
Zihan Wang
Seungjun Lee
Gim Hee Lee
VGen
24
0
0
16 May 2025
Depth Anything with Any Prior
Zehan Wang
Siyu Chen
Lihe Yang
Jialei Wang
Ziang Zhang
Hengshuang Zhao
Zhou Zhao
3DGS
VLM
MDE
44
0
0
15 May 2025
Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians
Ma Changfeng
Bi Ran
Guo Jie
Wang Chongjun
Guo Yanwen
3DPC
38
0
0
14 May 2025
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis
Bingxin Ke
Kevin Qu
Tianfu Wang
Nando Metzger
Shengyu Huang
Bo Li
Anton Obukhov
Konrad Schindler
DiffM
VLM
41
0
0
14 May 2025
SLAG: Scalable Language-Augmented Gaussian Splatting
Laszlo Szilagyi
Francis Engelmann
Jeannette Bohg
3DGS
59
0
0
12 May 2025
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes
Ahmed Abdelreheem
Filippo Aleotti
Jamie Watson
Z. Qureshi
Abdelrahman Eldesokey
Peter Wonka
Gabriel J. Brostow
Sara Vicente
Guillermo Garcia-Hernando
DiffM
66
0
0
08 May 2025
SITE: towards Spatial Intelligence Thorough Evaluation
Wenjie Wang
Reuben Tan
Pengyue Zhu
Jianwei Yang
Zhengyuan Yang
Lijuan Wang
Andrey Kolobov
Jianfeng Gao
Boqing Gong
54
0
0
08 May 2025
SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models
Shun Taguchi
Hideki Deguchi
Takumi Hamazaki
Hiroyuki Sakai
ReLM
LRM
57
0
0
08 May 2025
Occupancy World Model for Robots
Zhang Zhang
Qiang Zhang
Wei Cui
Shuai Shi
Yijie Guo
...
Hao-Ran Cheng
Xiaozhu Ju
Zhengping Che
Renjing Xu
Jian-Bo Tang
43
0
0
07 May 2025
GSsplat: Generalizable Semantic Gaussian Splatting for Novel-view Synthesis in 3D Scenes
Feng Xiao
Hongbin Xu
Wanlin Liang
Wenxiong Kang
3DGS
54
0
0
07 May 2025
AS3D: 2D-Assisted Cross-Modal Understanding with Semantic-Spatial Scene Graphs for 3D Visual Grounding
Feng Xiao
Hongbin Xu
Guocan Zhao
Wenxiong Kang
56
0
0
07 May 2025
Matching Distance and Geometric Distribution Aided Learning Multiview Point Cloud Registration
Shiqi Li
Jihua Zhu
Yifan Xie
Naiwen Hu
Di Wang
3DPC
79
3
0
06 May 2025
LiftFeat: 3D Geometry-Aware Local Feature Matching
Yepeng Liu
Wenpeng Lai
Zhou Zhao
Yuxuan Xiong
Jinchi Zhu
Jun Cheng
Yongchao Xu
46
0
0
06 May 2025
Focus What Matters: Matchability-Based Reweighting for Local Feature Matching
Dongyue Li
210
0
0
04 May 2025
A Birotation Solution for Relative Pose Problems
Hongbo Zhao
Ziwei Long
Mengtan Zhang
Haoyu Wang
Qijun Chen
Rui Fan
30
0
0
04 May 2025
Probabilistic Interactive 3D Segmentation with Hierarchical Neural Processes
Jie Liu
Pan Zhou
Zehao Xiao
Jiayi Shen
Wenzhe Yin
Jan-Jakob Sonke
E. Gavves
34
0
0
03 May 2025
3DWG: 3D Weakly Supervised Visual Grounding via Category and Instance-Level Alignment
Xianrui Li
Jing Liu
Nuowei Han
Liang Heng
Y. Guo
Hao Dong
Yang Liu
74
0
0
03 May 2025
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
Kwon Byung-Ki
Qi Dai
Lee Hyoseok
Chong Luo
Tae-Hyun Oh
73
0
0
01 May 2025
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
Feng Xue
Wenzhuang Xu
Guofeng Zhong
Anlong Minga
N. Sebe
65
0
0
01 May 2025
Direct Motion Models for Assessing Generated Videos
Kelsey R. Allen
Carl Doersch
Guangyao Zhou
Mohammed Suhail
Danny Driess
...
Thomas Kipf
Mehdi S. M. Sajjadi
Kevin P. Murphy
João Carreira
Sjoerd van Steenkiste
EGVM
DiffM
VGen
88
0
0
30 Apr 2025
Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video
Hoang Chuong Nguyen
Wei Mao
Jose M. Alvarez
Miaomiao Liu
57
0
0
28 Apr 2025
MISO: Multiresolution Submap Optimization for Efficient Globally Consistent Neural Implicit Reconstruction
Yulun Tian
Hanwen Cao
Sunghwan Kim
Nikolay Atanasov
202
0
0
27 Apr 2025
SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models
Nader Zantout
Haochen Zhang
Pujith Kachana
J. Qiu
Ji Zhang
Wenshan Wang
LM&Ro
LRM
243
0
0
25 Apr 2025
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation
Mingxia Zhan
Li Zhang
Xiaomeng Chu
Beibei Wang
MDE
66
0
0
21 Apr 2025
ROI-Guided Point Cloud Geometry Compression Towards Human and Machine Vision
Xie Liang
Gao Wei
Zhenghui Ming
Li Ge
3DPC
49
1
0
19 Apr 2025
Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding
Yuchen Rao
Stefan Ainetter
Sinisa Stekovic
Vincent Lepetit
Friedrich Fraundorfer
3DPC
3DV
272
0
0
18 Apr 2025
3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap
Minmin Yang
Huantao Ren
Senem Velipasalar
3DPC
56
0
0
16 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
72
0
0
15 Apr 2025
To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition
Davide Sferrazza
Gabriele Berton
Gabriele Trivigno
Carlo Masone
39
0
0
08 Apr 2025
Learning Affine Correspondences by Integrating Geometric Constraints
Pengju Sun
Banglei Guan
Zhenbao Yu
Yang Shang
Qifeng Yu
Dániel Baráth
31
1
0
07 Apr 2025
FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
Ulas Gunes
Matias Turkulainen
Xuqian Ren
Dieter Büchler
Arno Solin
Esa Rahtu
3DV
52
0
0
02 Apr 2025
ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation
Haoyang Li
Yuecong Xu
Junjie Chen
Kemi Ding
3DPC
CLL
47
0
0
02 Apr 2025
Improved Visual-Spatial Reasoning via R1-Zero-Like Training
Zhenyi Liao
Qingsong Xie
Yanhao Zhang
Zijian Kong
Haonan Lu
Zhenyu Yang
Zhijie Deng
ReLM
VLM
LRM
109
1
1
01 Apr 2025
STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
Heng Chang
Yuyao Zhang
Tao Lin
Xiangrui Liu
Wenxiao Cai
Zhengyang Liang
Bo Zhao
LRM
65
4
0
31 Mar 2025
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
Jiahui Zhang
Yurui Chen
Yanpeng Zhou
Yueming Xu
Ze Huang
...
Xinyue Cai
G. Huang
Xingyue Quan
Hang Xu
Li Zhang
LRM
100
1
0
29 Mar 2025
Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
Iñigo Pikabea
Iñaki Lacunza
Oriol Pareras
Carlos Escolano
Aitor Gonzalez-Agirre
Javier Hernando
Marta Villegas
VLM
66
0
0
28 Mar 2025
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
J. Huang
Baoxiong Jia
Yansen Wang
Ziyu Zhu
Xiongkun Linghu
Qing Li
Song-Chun Zhu
Siyuan Huang
87
3
0
28 Mar 2025
MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction
Wenyuan Zhang
Yixiao Yang
Han Huang
Liang Han
Kanle Shi
Yu-Shen Liu
Zhizhong Han
MDE
68
3
0
24 Mar 2025
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
Yikun Ma
Yiqing Li
Jiawei Wu
Xing Luo
Zhi Jin
DiffM
VGen
72
0
0
22 Mar 2025
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Jinlong Li
Cristiano Saltori
Fabio Poiesi
N. Sebe
267
0
0
20 Mar 2025
SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes
Weixiao Gao
Liangliang Nan
H. Ledoux
3DV
3DPC
48
0
0
19 Mar 2025
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
Haoyu Guo
He Zhu
Sida Peng
Haotong Lin
Yunzhi Yan
Tao Xie
Wenguan Wang
Xiaowei Zhou
Hujun Bao
3DV
MDE
87
1
0
18 Mar 2025
RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Pietro Michiardi
69
0
0
18 Mar 2025
Less Biased Noise Scale Estimation for Threshold-Robust RANSAC
Johan Edstedt
66
0
0
17 Mar 2025
Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space
Weichen Zhang
Zile Zhou
Zhiheng Zheng
Chen Gao
Jinqiang Cui
Yongqian Li
Xinlei Chen
Xiao-Ping Zhang
LRM
68
1
0
14 Mar 2025
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Hyeonho Jeong
Suhyeon Lee
Jong Chul Ye
VGen
265
0
0
12 Mar 2025
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability
Weijie Zhou
Manli Tao
Chaoyang Zhao
Haiyun Guo
Honghui Dong
Ming Tang
Jinqiao Wang
51
1
0
11 Mar 2025
Infinite Leagues Under the Sea: Photorealistic 3D Underwater Terrain Generation by Latent Fractal Diffusion Models
Tianyi Zhang
Weiming Zhi
Joshua Mangelson
Matthew Johnson-Roberson
45
0
0
09 Mar 2025
PointDiffuse: A Dual-Conditional Diffusion Model for Enhanced Point Cloud Semantic Segmentation
Yong-xing He
Hongshan Yu
Mingtao Feng
Tongjia Chen
Zechuan Li
Anwaar Ulhaq
Saeed Anwar
Ajmal Mian
DiffM
81
0
0
08 Mar 2025
1
2
3
4
...
16
17
18
Next