ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.15506
  4. Cited By
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

22 March 2024
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
    3DGS
ArXivPDFHTML

Papers citing "Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation"

50 / 191 papers shown
Title
SolidGS: Consolidating Gaussian Surfel Splatting for Sparse-View Surface
  Reconstruction
SolidGS: Consolidating Gaussian Surfel Splatting for Sparse-View Surface Reconstruction
Zhuowen Shen
Yuan Liu
Zhang Chen
Zhong Li
Jiepeng Wang
...
Jingdong Zhang
Yi Tian Xu
Scott Schaefer
Xin Li
Wenping Wang
3DGS
120
1
0
19 Dec 2024
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
Haoyi Jiang
Liu Liu
Tianheng Cheng
Xinjie Wang
Tianwei Lin
Zhizhong Su
Wen Liu
Xinyu Wang
3DGS
ViT
155
7
0
17 Dec 2024
RoMeO: Robust Metric Visual Odometry
RoMeO: Robust Metric Visual Odometry
JunDa Cheng
Z. Cai
Zhaoxing Zhang
Wei Yin
Matthias Müller
Michael Paulitsch
Xin Yang
129
0
0
16 Dec 2024
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction
Dongxu Wei
Zhiqi Li
Peidong Liu
147
2
0
09 Dec 2024
PaintScene4D: Consistent 4D Scene Generation from Text Prompts
PaintScene4D: Consistent 4D Scene Generation from Text Prompts
Vinayak Gupta
Yunze Man
Yu-Xiong Wang
VGen
121
0
0
05 Dec 2024
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
Jiahao Lu
Tianyu Huang
Peng Li
Zhiyang Dou
Cheng Lin
Zhiming Cui
Z. Dong
Sai-Kit Yeung
Wenping Wang
Yuan Liu
VGen
MDE
131
10
0
04 Dec 2024
AlphaTablets: A Generic Plane Representation for 3D Planar
  Reconstruction from Monocular Videos
AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos
Yuze He
Wang Zhao
Shaohui Liu
Yubin Hu
Yushi Bai
Yu-Hui Wen
Yang Liu
105
1
0
29 Nov 2024
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion
  Distillation
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation
Duc-Hai Pham
Tung Do
P. Nguyen
Binh-Son Hua
K. Nguyen
Rang Nguyen
MDE
111
2
0
27 Nov 2024
Boost 3D Reconstruction using Diffusion-based Monocular Camera
  Calibration
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
Junyuan Deng
Wei Yin
Xiaoyang Guo
Qian Zhang
Xiaotao Hu
Weiqiang Ren
Xiaoxiao Long
P. Tan
DiffM
MDE
113
1
0
26 Nov 2024
UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations
UniGaussian: Driving Scene Reconstruction from Multiple Camera Models via Unified Gaussian Representations
Y. Ren
Guile Wu
Runhao Li
Zheyuan Yang
Yibo Liu
Xingxin Chen
Tongtong Cao
Bingbing Liu
3DGS
115
4
0
22 Nov 2024
SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
Xuanchi Ren
Y. Lu
Hanxue Liang
Zhangjie Wu
Huan Ling
Mike Chen
Sanja Fidler
Francis Williams
Jiahui Huang
3DGS
69
10
0
26 Oct 2024
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Ruicheng Wang
Sicheng Xu
Cassie Dai
Jianfeng Xiang
Yu Deng
Xin Tong
Jiaolong Yang
TPM
3DH
MDE
117
36
0
24 Oct 2024
VistaDream: Sampling multiview consistent images for single-view scene
  reconstruction
VistaDream: Sampling multiview consistent images for single-view scene reconstruction
Haiping Wang
Yuan Liu
Ziwei Liu
Wenping Wang
Z. Dong
Bisheng Yang
78
13
0
22 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Dinesh Manocha
MoE
92
17
0
14 Oct 2024
FusionSense: Bridging Common Sense, Vision, and Touch for Robust
  Sparse-View Reconstruction
FusionSense: Bridging Common Sense, Vision, and Touch for Robust Sparse-View Reconstruction
Irving Fang
Kairui Shi
Xiaoxiao He
Siqi Tan
Yifan Wang
Hanwen Zhao
Hung-Jui Huang
Wenzhen Yuan
Chen Feng
Jing Zhang
3DGS
71
1
0
10 Oct 2024
Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting
Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting
Matthew Strong
Boshu Lei
Aiden Swann
Wen Jiang
Kostas Daniilidis
Monroe Kennedy III
3DGS
67
3
0
07 Oct 2024
HazyDet: Open-Source Benchmark for Drone-View Object Detection with Depth-Cues in Hazy Scenes
HazyDet: Open-Source Benchmark for Drone-View Object Detection with Depth-Cues in Hazy Scenes
Changfeng Feng
Zhenyuan Chen
Renke Kou
Chunping Wang
Jian Yang
Ming-Ming Cheng
Xiangbo Shu
Yimian Dai
73
5
0
30 Sep 2024
KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation
KineDepth: Utilizing Robot Kinematics for Online Metric Depth Estimation
Soofiyan Atar
Yuheng Zhi
Florian Richter
Michael C. Yip
MDE
83
0
0
29 Sep 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
143
50
0
26 Sep 2024
Combining Absolute and Semi-Generalized Relative Poses for Visual
  Localization
Combining Absolute and Semi-Generalized Relative Poses for Visual Localization
Vojtech Panek
Torsten Sattler
Zuzana Kukelova
54
0
0
21 Sep 2024
Reactive Collision Avoidance for Safe Agile Navigation
Reactive Collision Avoidance for Safe Agile Navigation
Alessandro Saviolo
Niko Picello
Jeffrey Mao
Rishabh Verma
Giuseppe Loianno
78
0
0
18 Sep 2024
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Gonzalo Martin Garcia
Karim Abou Zeid
Christian Schmidt
Daan de Geus
Alexander Hermans
Bastian Leibe
102
31
0
17 Sep 2024
Introducing a Class-Aware Metric for Monocular Depth Estimation: An
  Automotive Perspective
Introducing a Class-Aware Metric for Monocular Depth Estimation: An Automotive Perspective
Tim Bader
Leon Eisemann
Adrian Pogorzelski
Namrata Jangid
Attila B. Kis
58
0
0
06 Sep 2024
UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM
UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM
Mostafa Mansour
Ahmed Abdelsalam
Ari Happonen
J. Porras
Esa Rahtu
3DGS
MDE
165
0
0
31 Aug 2024
Map-Free Visual Relocalization Enhanced by Instance Knowledge and Depth
  Knowledge
Map-Free Visual Relocalization Enhanced by Instance Knowledge and Depth Knowledge
Mingyu Xiao
Runze Chen
Haiyong Luo
Fang Zhao
Juan Wang
Xuepeng Ma
65
0
0
23 Aug 2024
CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving
CoSEC: A Coaxial Stereo Event Camera Dataset for Autonomous Driving
Shihan Peng
Hanyu Zhou
Hao Dong
Zhiwei Shi
Haoyue Liu
Yuxing Duan
Yi Chang
Luxin Yan
52
3
0
16 Aug 2024
Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
Towards Robust Monocular Depth Estimation in Non-Lambertian Surfaces
Junrui Zhang
Jiaqi Li
Yachuan Huang
Yiran Wang
Jinghong Zheng
Liao Shen
Z. Cao
MDE
69
3
0
12 Aug 2024
BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular
  Depth Estimation
BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
Xiang Zhang
Bingxin Ke
Hayko Riemenschneider
Nando Metzger
Anton Obukhov
Markus H. Gross
Konrad Schindler
Christopher Schroers
DiffM
MDE
66
8
0
25 Jul 2024
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation
  Models
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models
Yongtao Ge
Guangkai Xu
Zhiyue Zhao
Libo Sun
Zheng Huang
Yanlong Sun
Hao Chen
Chunhua Shen
MDE
53
3
0
18 Jun 2024
Depth Anything V2
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
104
406
0
13 Jun 2024
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
An-Chieh Cheng
Hongxu Yin
Yang Fu
Qiushan Guo
Ruihan Yang
Jan Kautz
Xiaolong Wang
Sifei Liu
LRM
81
65
0
03 Jun 2024
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Jiahao Shao
Yuanbo Yang
Hongyu Zhou
Youmin Zhang
Yujun Shen
Vitor Campagnolo Guizilini
Yue Wang
Matteo Poggi
Yiyi Liao
VGen
DiffM
MDE
66
41
0
03 Jun 2024
MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion
  Scaffolds
MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds
Jiahui Lei
Yijia Weng
Adam W. Harley
Leonidas Guibas
Kostas Daniilidis
3DGS
72
43
0
27 May 2024
SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling
SceneFactory: A Workflow-centric and Unified Framework for Incremental Scene Modeling
Yijun Yuan
M. Bleier
Andreas Nüchter
80
0
0
13 May 2024
MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based
  Monocular Guidance
MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance
Yuqun Wu
Jae Yong Lee
Chuhang Zou
Shenlong Wang
Derek Hoiem
63
0
0
12 Apr 2024
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Andreea Dogaru
M. Ozer
Bernhard Egger
3DGS
84
6
0
04 Apr 2024
UniDepth: Universal Monocular Metric Depth Estimation
UniDepth: Universal Monocular Metric Depth Estimation
Luigi Piccinelli
Yung-Hsu Yang
Daniel Gehrig
Mattia Segu
Siyuan Li
Luc Van Gool
Fisher Yu
VLM
MDE
130
139
0
27 Mar 2024
DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
Matias Turkulainen
Xuqian Ren
Iaroslav Melekhov
Otto Seiskari
Esa Rahtu
Arno Solin
3DGS
68
63
0
26 Mar 2024
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos
Yufu Wang
ZiYun Wang
Lingjie Liu
Kostas Daniilidis
71
32
0
26 Mar 2024
Adaptive Surface Normal Constraint for Geometric Estimation from
  Monocular Images
Adaptive Surface Normal Constraint for Geometric Estimation from Monocular Images
Xiaoxiao Long
Yuhang Zheng
Yupeng Zheng
Beiwen Tian
Cheng Lin
Lingjie Liu
Hao Zhao
Guyue Zhou
Wenping Wang
60
12
0
08 Feb 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
198
788
0
19 Jan 2024
Repurposing Diffusion-Based Image Generators for Monocular Depth
  Estimation
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Bingxin Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
VLM
MDE
90
159
0
04 Dec 2023
PolyMaX: General Dense Prediction with Mask Transformer
PolyMaX: General Dense Prediction with Mask Transformer
Xuan S. Yang
Liangzhe Yuan
Kimberly Wilber
Astuti Sharma
Xiuye Gu
...
Stephanie Debats
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Liang-Chieh Chen
69
15
0
09 Nov 2023
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
Haoyi Zhu
Honghui Yang
Xiaoyang Wu
Di Huang
Sha Zhang
...
Hengshuang Zhao
Chunhua Shen
Yu Qiao
Tong He
Wanli Ouyang
SSL
125
46
0
12 Oct 2023
Uni3D: Exploring Unified 3D Representation at Scale
Uni3D: Exploring Unified 3D Representation at Scale
Junsheng Zhou
Jinsheng Wang
Baorui Ma
Yu-Shen Liu
Tiejun Huang
Xinlong Wang
62
96
0
10 Oct 2023
Vision Transformers Need Registers
Vision Transformers Need Registers
Zilong Chen
Maxime Oquab
Julien Mairal
Huaping Liu
ViT
146
340
0
28 Sep 2023
IEBins: Iterative Elastic Bins for Monocular Depth Estimation
IEBins: Iterative Elastic Bins for Monocular Depth Estimation
Shuwei Shao
Z. Pei
Xingming Wu
Zhong Liu
Weihai Chen
Zhengguo Li
MDE
48
51
0
25 Sep 2023
Robust Geometry-Preserving Depth Estimation Using Differentiable
  Rendering
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering
Chi Zhang
Wei Yin
Gang Yu
Zhibin Wang
Tao Chen
Bin-Bin Fu
Qiufeng Wang
Chunhua Shen
MDE
133
6
0
18 Sep 2023
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View
  Transformation
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation
Zhiqi Li
Zhiding Yu
David Austin
Mingsheng Fang
Shiyi Lan
Jan Kautz
J. Álvarez
77
108
0
04 Jul 2023
Towards Zero-Shot Scale-Aware Monocular Depth Estimation
Towards Zero-Shot Scale-Aware Monocular Depth Estimation
Vitor Campagnolo Guizilini
Igor Vasiljevic
Di Chen
Rares Andrei Ambrus
Adrien Gaidon
MDE
75
79
0
29 Jun 2023
Previous
1234
Next