ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.04405
  4. Cited By
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
v1v2 (latest)

ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes

14 February 2017
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
    3DPC3DV
ArXiv (abs)PDFHTML

Papers citing "ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes"

50 / 2,387 papers shown
Title
Empowering Large Language Models with 3D Situation Awareness
Empowering Large Language Models with 3D Situation Awareness
Zhihao Yuan
Yibo Peng
Jinke Ren
Yinghong Liao
Yatong Han
Chun-Mei Feng
Hengshuang Zhao
G. Li
Shuguang Cui
Zhen Li
127
0
0
29 Mar 2025
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
Jiahui Zhang
Yurui Chen
Yanpeng Zhou
Yueming Xu
Ze Huang
...
Xinyue Cai
G. Huang
Xingyue Quan
Hang Xu
Li Zhang
LRM
186
2
0
29 Mar 2025
Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
Iñigo Pikabea
Iñaki Lacunza
Oriol Pareras
Carlos Escolano
Aitor Gonzalez-Agirre
Javier Hernando
Marta Villegas
VLM
203
1
0
28 Mar 2025
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
J. Huang
Baoxiong Jia
Yansen Wang
Ziyu Zhu
Xiongkun Linghu
Qing Li
Song-Chun Zhu
Siyuan Huang
175
5
0
28 Mar 2025
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces
Wonhyeok Choi
K. Hwang
Minwoo Choi
Kiljoon Han
Wonjoon Choi
Mingyu Shin
S. Im
MDE
96
0
0
28 Mar 2025
MVSAnywhere: Zero-Shot Multi-View Stereo
MVSAnywhere: Zero-Shot Multi-View Stereo
Sergio Izquierdo
Mohamed Sayed
Michael Firman
Guillermo Garcia-Hernando
Daniyar Turmukhambetov
Javier Civera
Oisin Mac Aodha
Gabriel J. Brostow
Jamie Watson
3DV
125
4
0
28 Mar 2025
Can Video Diffusion Model Reconstruct 4D Geometry?
Can Video Diffusion Model Reconstruct 4D Geometry?
Jinjie Mai
Wenxuan Zhu
Haozhe Liu
Bing Li
Cheng Zheng
Jürgen Schmidhuber
Bernard Ghanem
VGenMDE
157
0
0
27 Mar 2025
HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM
HS-SLAM: Hybrid Representation with Structural Supervision for Improved Dense SLAM
Ziren Gong
Fabio Tosi
Youmin Zhang
S. Mattoccia
Matteo Poggi
98
0
0
27 Mar 2025
STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM
STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM
Yanjie Wang
Xu Cao
Weiyun Yi
Zhaoxin Fan
74
0
0
27 Mar 2025
UFM: Unified Feature Matching Pre-training with Multi-Modal Image Assistants
UFM: Unified Feature Matching Pre-training with Multi-Modal Image Assistants
Yide Di
Yun Liao
Hao Zhou
Kaijun Zhu
Qing Duan
Junhui Liu
Mingyu Lu
61
0
0
26 Mar 2025
MMGen: Unified Multi-modal Image Generation and Understanding in One Go
MMGen: Unified Multi-modal Image Generation and Understanding in One Go
Jiepeng Wang
Zhaoqing Wang
H. Pan
Yuan Liu
Dongdong Yu
Changhu Wang
Wenping Wang
DiffM
145
1
0
26 Mar 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
123
0
0
26 Mar 2025
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better
Zihang Lai
Andrea Vedaldi
71
1
0
25 Mar 2025
Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals
Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals
Zhirui Dai
Hojoon Shin
Yulun Tian
Ki Myung Brian Lee
Nikolay Atanasov
73
0
0
25 Mar 2025
Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders
Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders
Paul Koch
Jörg Krüger
Ankit Chowdhury
O. Heimann
MDE
99
0
0
25 Mar 2025
PAVE: Patching and Adapting Video Large Language Models
PAVE: Patching and Adapting Video Large Language Models
Zhuoming Liu
Yiquan Li
Khoi Duc Nguyen
Yiwu Zhong
Yin Li
KELMLRM
134
1
0
25 Mar 2025
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models
Sangwon Beak
Hyeonwoo Kim
Hanbyul Joo
106
0
0
25 Mar 2025
OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations
OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations
Christina Kassab
Sacha Morin
Martin Buchner
Matías Mattamala
Kumaraditya Gupta
Abhinav Valada
Liam Paull
Maurice F. Fallon
3DVELM
72
0
0
25 Mar 2025
MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction
MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction
Wenyuan Zhang
Yixiao Yang
Han Huang
Liang Han
Kanle Shi
Yu-Shen Liu
Zhizhong Han
MDE
143
3
0
24 Mar 2025
Good Keypoints for the Two-View Geometry Estimation Problem
Good Keypoints for the Two-View Geometry Estimation Problem
Konstantin Pakulev
Alexander Vakhitov
Gonzalo Ferrer
70
0
0
24 Mar 2025
Aether: Geometric-Aware Unified World Modeling
Aether: Geometric-Aware Unified World Modeling
Aether Team
Haoyi Zhu
Yanjie Wang
Jianjun Zhou
Wenzheng Chang
...
Zizun Li
Junyi Chen
Chunhua Shen
Jiangmiao Pang
Tong He
DiffMVGen
121
9
0
24 Mar 2025
NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction
NeRFPrior: Learning Neural Radiance Field as a Prior for Indoor Scene Reconstruction
Wenyuan Zhang
Emily Yue-ting Jia
Junsheng Zhou
Baorui Ma
Kanle Shi
Yu-Shen Liu
Zhizhong Han
175
3
0
24 Mar 2025
Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces
Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces
Chenyangguang Zhang
Alexandros Delitzas
Fangjinhua Wang
Ruida Zhang
Xiangyang Ji
Marc Pollefeys
Francis Engelmann
3DV3DPC
134
4
0
24 Mar 2025
PanopticSplatting: End-to-End Panoptic Gaussian Splatting
PanopticSplatting: End-to-End Panoptic Gaussian Splatting
Yuxuan Xie
Xuan Yu
Changjian Jiang
Sitong Mao
Shunbo Zhou
Rui Fan
R. Xiong
Yansen Wang
3DGS
78
1
0
23 Mar 2025
MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation
MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation
Jiaxin Huang
Runnan Chen
Ziwen Li
Zhengqing Gao
Xiao He
Yandong Guo
Mingming Gong
Tongliang Liu
LRM
106
1
0
23 Mar 2025
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
Yue Li
Qi Ma
Runyi Yang
Huapeng Li
Mengjiao Ma
...
E. Konukoglu
Theo Gevers
Luc Van Gool
Martin R. Oswald
Danda Pani Paudel
3DGSVLM
235
2
0
23 Mar 2025
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
Hongjia Zhai
Haoyang Li
Zhenzhe Li
Xiaokun Pan
Yijia He
Guofeng Zhang
93
0
0
23 Mar 2025
Unified Geometry and Color Compression Framework for Point Clouds via Generative Diffusion Priors
Unified Geometry and Color Compression Framework for Point Clouds via Generative Diffusion Priors
Tianxin Huang
Gim Hee Lee
85
0
0
23 Mar 2025
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
MotionDiff: Training-free Zero-shot Interactive Motion Editing via Flow-assisted Multi-view Diffusion
Yikun Ma
Yiqing Li
Jiawei Wu
Xing Luo
Zhi Jin
DiffMVGen
150
0
0
22 Mar 2025
Distilling Monocular Foundation Model for Fine-grained Depth Completion
Distilling Monocular Foundation Model for Fine-grained Depth Completion
Yingping Liang
Yutao Hu
Wenqi Shao
Ying Fu
MDE
101
1
0
21 Mar 2025
ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail
ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail
Chandan Yeshwanth
Dávid Rozenberszki
Angela Dai
145
0
0
21 Mar 2025
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors
Wonbong Jang
Philippe Weinzaepfel
Vincent Leroy
Lourdes Agapito
Jérôme Revaud
106
5
0
21 Mar 2025
Enhancing Steering Estimation with Semantic-Aware GNNs
Enhancing Steering Estimation with Semantic-Aware GNNs
Fouad Makiyeh
Huy-Dung Nguyen
Patrick Chareyre
Ramin Hasani
Marc Blanchon
Daniela Rus
3DPC
95
0
0
21 Mar 2025
OffsetOPT: Explicit Surface Reconstruction without Normals
OffsetOPT: Explicit Surface Reconstruction without Normals
Huan Lei
3DPC
137
0
0
20 Mar 2025
IRef-VLA: A Benchmark for Interactive Referential Grounding with Imperfect Language in 3D Scenes
IRef-VLA: A Benchmark for Interactive Referential Grounding with Imperfect Language in 3D Scenes
Haochen Zhang
Nader Zantout
Pujith Kachana
Ji Zhang
Wenshan Wang
VGen
83
0
0
20 Mar 2025
MapGlue: Multimodal Remote Sensing Image Matching
MapGlue: Multimodal Remote Sensing Image Matching
Peihao Wu
Yongxiang Yao
Wenfei Zhang
Dong Wei
Y. Wan
Yansheng Li
Yongjun Zhang
78
0
0
20 Mar 2025
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Jinlong Li
Cristiano Saltori
Fabio Poiesi
N. Sebe
494
2
0
20 Mar 2025
Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection
Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection
Jiangyi Wang
Na Zhao
125
0
0
20 Mar 2025
UniK3D: Universal Camera Monocular 3D Estimation
UniK3D: Universal Camera Monocular 3D Estimation
Luigi Piccinelli
Daniel Gehrig
Mattia Segu
Yifan Yang
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
88
1
0
20 Mar 2025
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Zhaochong An
Guolei Sun
Yun Liu
Runjia Li
Junlin Han
Ender Konukoglu
Serge Belongie
VLM
169
2
0
20 Mar 2025
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector
Zechuan Li
Hongshan Yu
Yihao Ding
Jinhao Qiao
Basim Azam
Naveed Akhtar
3DPC
126
0
0
19 Mar 2025
Deep Polycuboid Fitting for Compact 3D Representation of Indoor Scenes
Deep Polycuboid Fitting for Compact 3D Representation of Indoor Scenes
Gahye Lee
Hyejeong Yoon
Jungeon Kim
Seungyong Lee
3DPC3DV
95
0
0
19 Mar 2025
Universal Scene Graph Generation
Universal Scene Graph Generation
Shengqiong Wu
Hao Fei
Tat-Seng Chua
139
0
0
19 Mar 2025
Object-Centric Pretraining via Target Encoder Bootstrapping
Object-Centric Pretraining via Target Encoder Bootstrapping
Nikola Đukić
Tim Lebailly
Tinne Tuytelaars
OCL
129
0
0
19 Mar 2025
SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes
SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes
Weixiao Gao
Liangliang Nan
H. Ledoux
3DV3DPC
73
0
0
19 Mar 2025
RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment
RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Pietro Michiardi
244
0
0
18 Mar 2025
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
Haoyu Guo
He Zhu
Sida Peng
Haotong Lin
Yunzhi Yan
Tao Xie
Wenguan Wang
Xiaowei Zhou
Hujun Bao
3DVMDE
122
1
0
18 Mar 2025
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
137
1
0
18 Mar 2025
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting
Runsong Zhu
Shi Qiu
Zhengzhe Liu
Ka-Hei Hui
Qianyi Wu
Pheng Ann Heng
Chi-Wing Fu
3DGS3DV
128
2
0
18 Mar 2025
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
Erik Daxberger
Nina Wenzel
David Griffiths
Haiming Gang
Justin Lazarow
...
Kai Kang
Marcin Eichner
Yue Yang
Afshin Dehghan
Peter Grasch
126
5
0
17 Mar 2025
Previous
12345...464748
Next