Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.02523
Cited By
v1
v2
v3
v4
v5 (latest)
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
4 November 2020
Mike Roberts
Jason Ramapuram
Anurag Ranjan
Atulit Kumar
Miguel Angel Bautista
Nathan Paczan
Russ Webb
Joshua M. Susskind
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding"
50 / 113 papers shown
Title
DepthART: Monocular Depth Estimation as Autoregressive Refinement Task
Bulat Gabdullin
Nina Konovalova
Nikolay Patakin
Dmitry Senushkin
Anton Konushin
MDE
66
1
0
01 Jul 2025
DiFuse-Net: RGB and Dual-Pixel Depth Estimation using Window Bi-directional Parallax Attention and Cross-modal Transfer Learning
Kunal Swami
Debtanu Gupta
A. Muduli
Chirag Jaiswal
Pankaj Bajpai
MDE
17
0
0
17 Jun 2025
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast
Beilei Cui
Yiming Huang
Long Bai
Hongliang Ren
15
0
0
16 Jun 2025
The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images without Any 3D Knowledge
Haoru Wang
Kai Ye
Yangyan Li
Wenzheng Chen
Baoquan Chen
62
0
0
11 Jun 2025
StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets
Anh-Quan Cao
Ivan Lopes
Raoul de Charette
16
0
0
09 Jun 2025
Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth
Jinyoung Jun
Lei Chu
Jiahao Li
Yan Lu
Chang-Su Kim
MDE
123
0
0
05 Jun 2025
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
Yang-tian Sun
Xin Yu
Zehuan Huang
Yi-Hua Huang
Yuan-Chen Guo
Ziyi Yang
Yan-Pei Cao
Xiaojuan Qi
DiffM
VGen
MDE
36
1
0
30 May 2025
Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation
Sanggyun Ma
Wonjoon Choi
Jihun Park
Jaeyeul Kim
Seunghun Lee
Jiwan Seo
S. Im
66
0
0
29 May 2025
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers
Kwon Byung-Ki
Qi Dai
Lee Hyoseok
Chong Luo
Tae-Hyun Oh
163
0
0
01 May 2025
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Wufei Ma
Luoxin Ye
Nessa McWeeney
Celso M de Melo
Jieneng Chen
LRM
118
1
0
01 May 2025
The Fourth Monocular Depth Estimation Challenge
Anton Obukhov
Matteo Poggi
Fabio Tosi
Ripudaman Singh Arora
Jaime Spencer
...
Tuan-Anh Yang
Minh-Quang Nguyen
T. Tran
Albert Luginov
Muhammad Shahzad
MDE
444
1
0
24 Apr 2025
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation
Mingxia Zhan
Li Zhang
Xiaomeng Chu
Beibei Wang
MDE
207
0
0
21 Apr 2025
PRISM: A Unified Framework for Photorealistic Reconstruction and Intrinsic Scene Modeling
Alara Dirik
Tuanfeng Y. Wang
Duygu Ceylan
Stefanos Zafeiriou
Anna Frühstück
DiffM
83
0
0
19 Apr 2025
Leveraging Automatic CAD Annotations for Supervised Learning in 3D Scene Understanding
Yuchen Rao
Stefan Ainetter
Sinisa Stekovic
Vincent Lepetit
Friedrich Fraundorfer
3DPC
3DV
512
0
0
18 Apr 2025
Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image
Tao Wen
Jiadong Wang
Yuxiao Chen
Shugong Xu
Chi Zhang
Xuelong Li
MDE
116
0
0
16 Apr 2025
Recent Advance in 3D Object and Scene Generation: A Survey
Xiang Tang
Ruotong Li
Xiaopeng Fan
147
0
0
16 Apr 2025
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
Eunsoo Im
Jung Kwon Lee
Changhyun Jee
166
0
0
15 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
193
0
0
15 Apr 2025
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou
Wenqi Xian
Guandao Yang
Mohamed Abdelfattah
Bharath Hariharan
Noah Snavely
Ning Yu
P. Debevec
MDE
118
0
0
09 Apr 2025
MVSAnywhere: Zero-Shot Multi-View Stereo
Sergio Izquierdo
Mohamed Sayed
Michael Firman
Guillermo Garcia-Hernando
Daniyar Turmukhambetov
Javier Civera
Oisin Mac Aodha
Gabriel J. Brostow
Jamie Watson
3DV
117
4
0
28 Mar 2025
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
Yue Li
Qi Ma
Runyi Yang
Huapeng Li
Mengjiao Ma
...
E. Konukoglu
Theo Gevers
Luc Van Gool
Martin R. Oswald
Danda Pani Paudel
3DGS
VLM
232
2
0
23 Mar 2025
SAM2 for Image and Video Segmentation: A Comprehensive Survey
Zhang Jiaxing
Tang Hao
VLM
104
0
0
17 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-Jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Yue Yang
219
2
0
16 Mar 2025
Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation
Hongyu Wen
Yiming Zuo
Venkat Subramanian
Patrick Chen
Jia Deng
3DV
165
0
0
14 Mar 2025
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Fanqi Pu
Yifan Wang
Jiru Deng
Wenming Yang
MDE
ViT
175
3
0
13 Mar 2025
DuCos: Duality Constrained Depth Super-Resolution via Foundation Model
Zhiqiang Yan
Zhengxue Wang
Haoye Dong
Jun Yu Li
Jian Yang
Gim Hee Lee
136
0
0
06 Mar 2025
Matrix3D: Large Photogrammetry Model All-in-One
Yuanxun Lu
Jingyang Zhang
Tian Fang
Jean-Daniel Nahmias
Yanghai Tsin
Long Quan
Xun Cao
Yao Yao
Shiwei Li
203
6
0
11 Feb 2025
Rethinking Encoder-Decoder Flow Through Shared Structures
Frederik Laboyrie
M. K. Yucel
Albert Saà-Garriga
AI4CE
70
0
0
24 Jan 2025
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Jianing Yang
Alexander Sax
Kevin J. Liang
Mikael Henaff
Hao Tang
Ang Cao
J. Chai
Franziska Meier
Matt Feiszli
3DGS
189
31
0
23 Jan 2025
PatchRefiner V2: Fast and Lightweight Real-Domain High-Resolution Metric Depth Estimation
Zhenyu Li
Wenqing Cui
S. Bhat
Peter Wonka
MDE
122
0
0
03 Jan 2025
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
295
1
0
29 Dec 2024
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Hanwen Jiang
Zexiang Xu
Desai Xie
Zheyu Chen
Haian Jin
...
Xin Sun
Jiuxiang Gu
Qixing Huang
Georgios Pavlakos
Hao Tan
468
4
0
18 Dec 2024
FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation
Yunpeng Bai
Qixing Huang
DiffM
163
0
0
01 Dec 2024
OMNI-DC: Highly Robust Depth Completion with Multiresolution Depth Integration
Yiming Zuo
Willow Yang
Zeyu Ma
Jia Deng
MDE
148
2
0
28 Nov 2024
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
160
9
0
25 Nov 2024
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Ziyao Zeng
Jingcheng Ni
Daniel Wang
Patrick Rim
Younjoon Chung
Fengyu Yang
Byung-Woo Hong
A. Wong
DiffM
MDE
277
2
0
24 Nov 2024
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Ruicheng Wang
Sicheng Xu
Cassie Dai
Jianfeng Xiang
Yu Deng
Xin Tong
Jiaolong Yang
TPM
3DH
MDE
186
39
0
24 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Dinesh Manocha
MoE
153
19
0
14 Oct 2024
SceneCraft: Layout-Guided 3D Scene Generation
Xiuyu Yang
Yunze Man
Jun-Kun Chen
Yu-Xiong Wang
3DV
176
9
0
11 Oct 2024
ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion
Zitian Zhang
Frédéric Fortier-Chouinard
Mathieu Garon
Anand Bhattad
Jean-François Lalonde
DiffM
130
4
0
10 Oct 2024
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Haoyi Zhu
Honghui Yang
Yating Wang
Jiange Yang
Limin Wang
Tong He
3DH
115
9
0
10 Oct 2024
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Xue Liu
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
243
4
0
07 Oct 2024
Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection
Hongru Yan
Yu Zheng
Yueqi Duan
3DGS
141
2
0
02 Oct 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
207
55
0
26 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
184
14
0
23 Sep 2024
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Gonzalo Martin Garcia
Karim Abou Zeid
Christian Schmidt
Daan de Geus
Alexander Hermans
Bastian Leibe
131
33
0
17 Sep 2024
SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
Jakub Gregorek
Lazaros Nalpantidis
3DGS
121
4
0
16 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-Xiong Wang
138
23
0
05 Sep 2024
Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation
Sai Prasanna
Daniel Honerkamp
Kshitij Sirohi
Tim Welschehold
Wolfram Burgard
Abhinav Valada
123
1
0
05 Aug 2024
LRM-Zero: Training Large Reconstruction Models with Synthesized Data
Desai Xie
Sai Bi
Zhixin Shu
Kai Zhang
Zexiang Xu
Yi Zhou
Soren Pirk
Arie E. Kaufman
Xin Sun
Hao Tan
SyDa
105
17
0
13 Jun 2024
1
2
3
Next