Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.02145
Cited By
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
4 December 2023
B. Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
VLM
MDE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation"
45 / 45 papers shown
Title
VIN-NBV: A View Introspection Network for Next-Best-View Selection for Resource-Efficient 3D Reconstruction
Noah Frahm
Dongxu Zhao
Andrea Dunn Beltran
Ron Alterovitz
Jan-Michael Frahm
Junier Oliva
Roni Sengupta
137
0
0
09 May 2025
VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery
Bojin Wu
Jing Chen
MDE
46
0
0
05 May 2025
Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings
Andreas Sochopoulos
Nikolay Malkin
Nikolaos Tsagkas
João Moura
Michael Gienger
S. Vijayakumar
41
1
0
02 May 2025
LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring
Raul David Dominguez Sanchez
Xavier Jair Diaz Ortiz
Xingcheng Zhou
M. Ronecker
Michael Karner
Daniel Watzenig
Alois C. Knoll
106
0
0
25 Apr 2025
DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
Yinqi Li
Hong Chang
Ruibing Hou
Shiguang Shan
Xilin Chen
DiffM
55
0
0
24 Apr 2025
MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation
Xingxing Zuo
Nikhil Ranganathan
Connor T. Lee
Georgia Gkioxari
Soon-Jo Chung
VLM
58
1
0
21 Apr 2025
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation
Mingxia Zhan
Li Zhang
Xiaomeng Chu
Beibei Wang
MDE
57
0
0
21 Apr 2025
DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation
Soyoung Yoo
Namwoo Kang
32
0
0
15 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
65
0
0
15 Apr 2025
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Jiantao Lin
Xin Yang
Meixi Chen
Yingjie Xu
D. Yan
Leyi Wu
Xinli Xu
Lie Xu
Shunsi Zhang
Ying-Cong Chen
58
1
0
03 Mar 2025
Revisiting Gradient-based Uncertainty for Monocular Depth Estimation
Julia Hornauer
Amir El-Ghoussani
Vasileios Belagiannis
UQCV
55
0
0
09 Feb 2025
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
Chenguo Lin
Panwang Pan
Bangbang Yang
Zeming Li
Yadong Mu
3DGS
76
7
0
28 Jan 2025
Rethinking Encoder-Decoder Flow Through Shared Structures
Frederik Laboyrie
M. K. Yucel
Albert Saà-Garriga
AI4CE
45
0
0
24 Jan 2025
CheapNVS: Real-Time On-Device Narrow-Baseline Novel View Synthesis
K. Georgiadis
M. K. Yucel
Albert Saà-Garriga
ViT
52
1
0
24 Jan 2025
Survey on Monocular Metric Depth Estimation
Jiuling Zhang
VLM
69
0
0
21 Jan 2025
DPBridge: Latent Diffusion Bridge for Dense Prediction
Haorui Ji
Taojun Lin
Hongdong Li
DiffM
51
1
0
29 Dec 2024
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations
Zhibing Li
Tong Wu
Jing Tan
Mengchen Zhang
Jiaqi Wang
D. Lin
99
1
0
16 Dec 2024
DepthSplat: Connecting Gaussian Splatting and Depth
Haofei Xu
Songyou Peng
Fangjinhua Wang
Hermann Blum
Dániel Baráth
Andreas Geiger
Marc Pollefeys
3DGS
MDE
50
29
0
17 Oct 2024
A Simple Approach to Unifying Diffusion-based Conditional Generation
Xirui Li
Charles Herrmann
Kelvin C.K. Chan
Yinxiao Li
Deqing Sun
Chao Ma
Ming Yang
DiffM
VLM
43
1
0
15 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Tianyi Zhou
MoE
71
16
0
14 Oct 2024
High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity
Qian Yu
Peng-Tao Jiang
Hao Zhang
Jinwei Chen
Bo Li
Lihe Zhang
Huchuan Lu
47
2
0
14 Oct 2024
IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera
Jian Huang
Chengrui Dong
Peidong Liu
Peidong Liu
3DGS
29
2
0
10 Oct 2024
Diffusion Models in 3D Vision: A Survey
Zhen Wang
Dongyuan Li
Renhe Jiang
Tianyu He
Jiang Bian
Renhe Jiang
MedIm
66
4
0
07 Oct 2024
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
Junyi Zhang
Charles Herrmann
Junhwa Hur
Varun Jampani
Trevor Darrell
Forrester Cole
Deqing Sun
Ming Yang
VGen
86
70
0
04 Oct 2024
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Ziyao Zeng
Yangchao Wu
Hyoungseob Park
Daniel Wang
Fengyu Yang
Stefano Soatto
Dong Lao
Byung-Woo Hong
Alex Wong
MDE
20
7
0
03 Oct 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
46
40
0
26 Sep 2024
FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera
Guoyang Zhao
Yuxuan Liu
Weiqing Qi
Fulong Ma
Ming Liu
Jun Ma
MDE
32
0
0
23 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
57
10
0
23 Sep 2024
SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
Jakub Gregorek
Lazaros Nalpantidis
3DGS
38
3
0
16 Sep 2024
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation
Nischal Khanal
Shivanand Venkanna Sheshappanavar
MDE
42
0
0
10 Sep 2024
LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors
Hanyang Yu
Xiaoxiao Long
Ping Tan
3DGS
31
4
0
05 Sep 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Yu Qiao
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
67
48
0
05 Aug 2024
Gaussian Splatting Lucas-Kanade
Liuyue Xie
Joel Julin
Koichiro Niinuma
László A. Jeni
3DGS
38
2
0
16 Jul 2024
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
Chang-Han Yeh
Chin-Yang Lin
Zhixiang Wang
Chi-Wei Hsiao
Ting-Hsuan Chen
Hau-Shiang Shiu
Yu-Lun Liu
VGen
DiffM
57
5
0
01 Jul 2024
DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image
Qingxuan Wu
Zhiyang Dou
Sirui Xu
Soshi Shimada
Chen Wang
...
Taku Komura
Vladislav Golyanik
Christian Theobalt
Wenping Wang
Lingjie Liu
CVBM
3DH
68
4
0
26 Jun 2024
Fast Encoder-Based 3D from Casual Videos via Point Track Processing
Yoni Kasten
Wuyue Lu
Haggai Maron
3DPC
27
2
0
10 Apr 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
54
115
0
22 Mar 2024
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
Wei Yin
Chi Zhang
Hao Chen
Zhipeng Cai
Gang Yu
Kaixuan Wang
Xiaozhi Chen
Chunhua Shen
MDE
131
174
0
20 Jul 2023
Neural LiDAR Fields for Novel View Synthesis
S. Huang
Zan Gojcic
Zian Wang
Francis Williams
Yoni Kasten
Sanja Fidler
Konrad Schindler
Or Litany
72
46
0
02 May 2023
Visual-Language Prompt Tuning with Knowledge-guided Context Optimization
Hantao Yao
Rui Zhang
Changsheng Xu
VLM
VPVLM
127
200
0
23 Mar 2023
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation
Yiqun Duan
Xianda Guo
Zhengbiao Zhu
DiffM
MDE
48
68
0
09 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
160
214
0
03 Mar 2023
Palette: Image-to-Image Diffusion Models
Chitwan Saharia
William Chan
Huiwen Chang
Chris A. Lee
Jonathan Ho
Tim Salimans
David J. Fleet
Mohammad Norouzi
DiffM
VLM
342
1,588
0
10 Nov 2021
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
189
385
0
06 Nov 2021
Deep Ordinal Regression Network for Monocular Depth Estimation
Huan Fu
Mingming Gong
Chaohui Wang
Kayhan Batmanghelich
Dacheng Tao
MDE
185
1,707
0
06 Jun 2018
1