Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.23400
Cited By
Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation
29 May 2025
Sanggyun Ma
Wonjoon Choi
Jihun Park
Jaeyeul Kim
Seunghun Lee
Jiwan Seo
S. Im
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation"
40 / 40 papers shown
Title
BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation
Kieran Saunders
Luis J. Manso
George Vogiatzis
MDE
47
2
0
29 Jul 2024
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
104
406
0
13 Jun 2024
UniDepth: Universal Monocular Metric Depth Estimation
Luigi Piccinelli
Yung-Hsu Yang
Daniel Gehrig
Mattia Segu
Siyuan Li
Luc Van Gool
Fisher Yu
VLM
MDE
130
140
0
27 Mar 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
251
132
0
22 Mar 2024
Hybridnet for depth estimation and semantic segmentation
Dalila Sánchez-Escobedo
Xiao Lin
J. Casas
M. Pardàs
SSeg
MDE
82
9
0
09 Feb 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
198
788
0
19 Jan 2024
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Bingxin Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
VLM
MDE
90
159
0
04 Dec 2023
Joint Depth Prediction and Semantic Segmentation with Multi-View SAM
Mykhailo Shvets
Dongxu Zhao
Marc Niethammer
Roni Sengupta
Alexander C. Berg
MDE
56
9
0
31 Oct 2023
X-PDNet: Accurate Joint Plane Instance Segmentation and Monocular Depth Estimation with Cross-Task Distillation and Boundary Correction
Cao Dinh Duc
Jongwoo Lim
41
2
0
15 Sep 2023
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
Wei Yin
Chi Zhang
Hao Chen
Zhipeng Cai
Gang Yu
Kaixuan Wang
Xiaozhi Chen
Chunhua Shen
MDE
170
187
0
20 Jul 2023
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
299
3,383
0
14 Apr 2023
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
313
7,274
0
05 Apr 2023
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth
S. Bhat
R. Birkl
Diana Wofk
Peter Wonka
Matthias Müller
VLM
MDE
152
501
0
23 Feb 2023
ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation
Daitao Xing
Jinglin Shen
C. Ho
Anthony Tzes
ViT
MDE
49
6
0
12 Dec 2022
Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem
Xingyu Chen
Ruonan Zhang
Ji Jiang
Yan Wang
Ge Li
Thomas H. Li
MQ
MDE
70
29
0
02 Oct 2022
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
Chaoqiang Zhao
Youming Zhang
Matteo Poggi
Fabio Tosi
Xianda Guo
Zheng Zhu
Guan Huang
Yang Tang
S. Mattoccia
ViT
MDE
60
181
0
06 Aug 2022
Deep Digging into the Generalization of Self-Supervised Monocular Depth Estimation
Ji-Hoon Bae
Sungho Moon
Sunghoon Im
MDE
51
88
0
23 May 2022
NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation
Weihao Yuan
Xiaodong Gu
Zuozhuo Dai
Siyu Zhu
Ping Tan
59
179
0
03 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
393
15,486
0
20 Dec 2021
SUB-Depth: Self-distillation and Uncertainty Boosting Self-supervised Monocular Depth Estimation
Hang Zhou
Sarah Taylor
David Greenwood
Michal Mackiewicz
UQCV
MDE
45
11
0
18 Nov 2021
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
130
1,729
0
24 Mar 2021
Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction
Guanglei Yang
Hao Tang
M. Ding
N. Sebe
Elisa Ricci
ViT
69
190
0
22 Mar 2021
Learning Depth via Leveraging Semantics: Self-supervised Monocular Depth Estimation with Both Implicit and Explicit Semantic Guidance
Rui Li
Xiantuo He
Danna Xue
Shaolin Su
Qing Mao
Yu Zhu
Jinqiu Sun
Yanning Zhang
SSL
MDE
85
30
0
11 Feb 2021
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
Mike Roberts
Jason Ramapuram
Anurag Ranjan
Atulit Kumar
Miguel Angel Bautista
Nathan Paczan
Russ Webb
Joshua M. Susskind
104
386
0
04 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
601
40,961
0
22 Oct 2020
OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
Weifeng Chen
Shengyi Qian
David Fan
Noriyuki Kojima
Max Hamilton
Jia Deng
3DV
59
62
0
26 Jul 2020
Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object Problem by Semantic Guidance
Marvin Klingner
Jan-Aike Termöhlen
Jonas Mikolajczyk
Tim Fingscheidt
MDE
108
321
0
14 Jul 2020
TartanAir: A Dataset to Push the Limits of Visual SLAM
Wenshan Wang
Delong Zhu
Xiangwei Wang
Yaoyu Hu
Yuheng Qiu
Chen Wang
Yafei Hu
Ashish Kapoor
Sebastian Scherer
51
384
0
31 Mar 2020
Virtual KITTI 2
Yohann Cabon
Naila Murray
Martin Humenberger
3DPC
63
286
0
29 Jan 2020
BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks
Yao Yao
Zixin Luo
Shiwei Li
Jingyang Zhang
Yufan Ren
Lei Zhou
Tian Fang
Long Quan
3DV
104
471
0
22 Nov 2019
DIODE: A Dense Indoor and Outdoor DEpth Dataset
Igor Vasiljevic
Nicholas I. Kolkin
Shanyi Zhang
Ruotian Luo
Haochen Wang
...
Andrea F. Daniele
Mohammadreza Mostajabi
Steven Basart
Matthew R. Walter
Gregory Shakhnarovich
MDE
3DV
69
232
0
01 Aug 2019
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
MDE
199
1,789
0
02 Jul 2019
3D Packing for Self-Supervised Monocular Depth Estimation
Vitor Campagnolo Guizilini
Rares Andrei Ambrus
Sudeep Pillai
Allan Raventos
Adrien Gaidon
SSL
3DPC
MDE
79
647
0
06 May 2019
Geometry meets semantics for semi-supervised monocular depth estimation
Pierluigi Zama Ramirez
Matteo Poggi
Fabio Tosi
S. Mattoccia
Luigi Di Stefano
MDE
70
113
0
09 Oct 2018
Monocular Depth Estimation using Multi-Scale Continuous CRFs as Sequential Deep Networks
Dan Xu
Elisa Ricci
Wanli Ouyang
Xiaogang Wang
N. Sebe
MDE
67
100
0
01 Mar 2018
Unsupervised Learning of Depth and Ego-Motion from Video
Tinghui Zhou
Matthew A. Brown
Noah Snavely
D. Lowe
MDE
125
2,574
0
25 Apr 2017
End-to-End Learning of Geometry and Context for Deep Stereo Regression
Alex Kendall
H. Martirosyan
Saumitro Dasgupta
Peter Henry
Ryan Kennedy
Abraham Bachrach
Adam Bry
3DV
3DPC
MDE
83
1,332
0
13 Mar 2017
Deeper Depth Prediction with Fully Convolutional Residual Networks
Iro Laina
Christian Rupprecht
Vasileios Belagiannis
Federico Tombari
Nassir Navab
3DV
MDE
406
1,826
0
01 Jun 2016
Single-Image Depth Perception in the Wild
Weifeng Chen
Z. Fu
Dawei Yang
Jia Deng
MDE
100
519
0
13 Apr 2016
Depth Map Prediction from a Single Image using a Multi-Scale Deep Network
David Eigen
Christian Puhrsch
Rob Fergus
MDE
3DPC
3DV
214
4,054
0
09 Jun 2014
1