Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.15506
Cited By
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
22 March 2024
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation"
50 / 191 papers shown
Title
H2-Mapping: Real-time Dense Mapping Using Hierarchical Hybrid Representation
Chenxing Jiang
Han-Qi Zhang
Peize Liu
Zehuan Yu
Hui Cheng
Boyu Zhou
Shaojie Shen
3DH
86
37
0
05 Jun 2023
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
289
3,383
0
14 Apr 2023
Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion
Haisong Liu
Tao Lu
Yihui Xu
Jia-Wei Liu
Limin Wang
81
11
0
21 Mar 2023
Iterative Geometry Encoding Volume for Stereo Matching
Gangwei Xu
Xianqi Wang
Xiao-Hua Ding
Xin Yang
3DV
45
192
0
12 Mar 2023
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth
S. Bhat
R. Birkl
Diana Wofk
Peter Wonka
Matthias Müller
VLM
MDE
143
501
0
23 Feb 2023
Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting
Benjamin Wilson
William Qi
Tanmay Agarwal
John Lambert
Jagjeet Singh
...
Andrew Hartnett
J. K. Pontes
Deva Ramanan
Peter Carr
James Hays
3DPC
AI4TS
83
636
0
02 Jan 2023
Unifying Flow, Stereo and Depth Estimation
Haofei Xu
Jing Zhang
Jianfei Cai
Hamid Rezatofighi
Feng Yu
Dacheng Tao
Andreas Geiger
MDE
93
211
0
10 Nov 2022
Hierarchical Normalization for Robust Monocular Depth Estimation
Chi Zhang
Wei Yin
Zhibin Wang
Gang Yu
Bin-Bin Fu
Chunhua Shen
MDE
54
38
0
18 Oct 2022
IronDepth: Iterative Refinement of Single-View Depth using Surface Normal and its Uncertainty
Gwangbin Bae
Ignas Budvytis
R. Cipolla
44
28
0
07 Oct 2022
Towards Accurate Reconstruction of 3D Scene Shape from A Single Monocular Image
Wei Yin
Jianming Zhang
Oliver Wang
Simon Niklaus
Simon Chen
Yifan Liu
Chunhua Shen
MDE
84
45
0
28 Aug 2022
NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors
Jiepeng Wang
Peng Wang
Xiaoxiao Long
Christian Theobalt
Taku Komura
Lingjie Liu
Wenping Wang
3DV
42
146
0
27 Jun 2022
BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection
Yinhao Li
Zheng Ge
Guanyi Yu
Jinrong Yang
Zengran Wang
Yukang Shi
Jian‐Yuan Sun
Zeming Li
MDE
68
613
0
21 Jun 2022
MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction
Zehao Yu
Songyou Peng
Michael Niemeyer
Torsten Sattler
Andreas Geiger
112
461
0
01 Jun 2022
Multiview Stereo with Cascaded Epipolar RAFT
Zeyu Ma
Zachary Teed
Jia Deng
100
46
0
09 May 2022
Improving Monocular Visual Odometry Using Learned Depth
Libo Sun
Wei Yin
Enze Xie
Zhengrong Li
Changming Sun
Chunhua Shen
MDE
61
26
0
04 Apr 2022
3D Common Corruptions and Data Augmentation
Oğuzhan Fatih Kar
Teresa Yeo
Andrei Atanov
Amir Zamir
3DPC
78
111
0
02 Mar 2022
A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo
Wang Zhao
Shaohui Liu
Yi Wei
Hengkai Guo
Yong Liu
3DV
101
15
0
19 Jan 2022
A ConvNet for the 2020s
Zhuang Liu
Hanzi Mao
Chaozheng Wu
Christoph Feichtenhofer
Trevor Darrell
Saining Xie
ViT
152
5,158
0
10 Jan 2022
MSeg: A Composite Dataset for Multi-domain Semantic Segmentation
John Lambert
Zhuang Liu
Ozan Sener
James Hays
V. Koltun
VLM
70
202
0
27 Dec 2021
PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving
Pengchuan Xiao
Zhenlei Shao
Steven Hao
Zishuo Zhang
Xiaolin Chai
...
Jian Wu
Kai Sun
Kun Jiang
Yunlong Wang
Diange Yang
3DV
3DPC
57
189
0
23 Dec 2021
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
383
15,454
0
20 Dec 2021
Dense Depth Priors for Neural Radiance Fields from Sparse Input Views
Barbara Roessle
Jonathan T. Barron
B. Mildenhall
Pratul P. Srinivasan
Matthias Nießner
VGen
164
368
0
06 Dec 2021
DIML/CVL RGB-D Dataset: 2M RGB-D Images of Natural Indoor and Outdoor Scenes
Jaehoon Cho
Dongbo Min
Youngjung Kim
Kwanghoon Sohn
3DV
92
43
0
22 Oct 2021
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans
Ainaz Eftekhar
Alexander Sax
Roman Bachmann
Jitendra Malik
Amir Zamir
MedIm
76
299
0
11 Oct 2021
Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation
Gwangbin Bae
Ignas Budvytis
R. Cipolla
108
118
0
20 Sep 2021
Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
Santhosh Kumar Ramakrishnan
Aaron Gokaslan
Erik Wijmans
Oleksandr Maksymets
Alexander Clegg
...
Andrew Westbury
Angel X. Chang
Manolis Savva
Yili Zhao
Dhruv Batra
61
384
0
16 Sep 2021
RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching
Lahav Lipson
Zachary Teed
Jia Deng
MDE
80
404
0
15 Sep 2021
DROID-SLAM: Deep Visual SLAM for Monocular, Stereo, and RGB-D Cameras
Zachary Teed
Jia Deng
MDE
VLM
100
597
0
24 Aug 2021
Depth-supervised NeRF: Fewer Views and Faster Training for Free
Kangle Deng
Andrew Liu
Jun-Yan Zhu
Deva Ramanan
172
888
0
06 Jul 2021
Scaling Vision Transformers
Xiaohua Zhai
Alexander Kolesnikov
N. Houlsby
Lucas Beyer
ViT
128
1,084
0
08 Jun 2021
Adaptive Surface Normal Constraint for Depth Estimation
Xiaoxiao Long
Cheng Lin
Lingjie Liu
Wei Li
Christian Theobalt
Ruigang Yang
Wenping Wang
3DV
67
61
0
29 Mar 2021
Generalizing to the Open World: Deep Visual Odometry with Online Adaptation
Shunkai Li
Xin Wu
Yingdian Cao
H. Zha
53
43
0
29 Mar 2021
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
125
1,729
0
24 Mar 2021
Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction
Guanglei Yang
Hao Tang
M. Ding
N. Sebe
Elisa Ricci
ViT
69
190
0
22 Mar 2021
DSEC: A Stereo Event Camera Dataset for Driving Scenarios
Mathias Gehrig
Willem Aarents
Daniel Gehrig
Davide Scaramuzza
3DV
80
336
0
10 Mar 2021
Virtual Normal: Enforcing Geometric Constraints for Accurate and Robust Depth Prediction
Wei Yin
Yifan Liu
Chunhua Shen
MDE
67
72
0
07 Mar 2021
PENet: Towards Precise and Efficient Image Guided Depth Completion
Mu Hu
Shuling Wang
Bin Li
Shiyu Ning
Li Fan
Xiaojin Gong
MDE
125
277
0
01 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
856
29,341
0
26 Feb 2021
Learning to Recover 3D Scene Shape from a Single Image
Wei Yin
Jianming Zhang
Oliver Wang
Simon Niklaus
Long Mai
Simon Chen
Chunhua Shen
MDE
91
236
0
17 Dec 2020
GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation
Xiaojuan Qi
Zhengzhe Liu
Renjie Liao
Philip Torr
R. Urtasun
Jiaya Jia
3DV
109
62
0
13 Dec 2020
Robust Consistent Video Depth Estimation
Johannes Kopf
Xuejian Rong
Jia-Bin Huang
MDE
141
168
0
10 Dec 2020
AdaBins: Depth Estimation using Adaptive Bins
S. Bhat
Ibraheem Alhashim
Peter Wonka
3DV
MDE
ViT
110
855
0
28 Nov 2020
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
Mike Roberts
Jason Ramapuram
Anurag Ranjan
Atulit Kumar
Miguel Angel Bautista
Nathan Paczan
Russ Webb
Joshua M. Susskind
98
386
0
04 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
585
40,961
0
22 Oct 2020
Shape, Illumination, and Reflectance from Shading
Jonathan T. Barron
Jitendra Malik
3DV
40
727
0
07 Oct 2020
SNE-RoadSeg: Incorporating Surface Normal Information into Semantic Segmentation for Accurate Freespace Detection
Rui Fan
Haoyu Wang
Peide Cai
Ming-Yuan Liu
75
152
0
26 Aug 2020
OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
Weifeng Chen
Shengyi Qian
David Fan
Noriyuki Kojima
Max Hamilton
Jia Deng
3DV
59
62
0
26 Jul 2020
Non-Local Spatial Propagation Network for Depth Completion
Jinsun Park
Kyungdon Joo
Zhe Hu
Chi Liu
In So Kweon
3DV
MDE
110
325
0
20 Jul 2020
Single View Metrology in the Wild
Rui Zhu
Xingyi Yang
Yannick Hold-Geoffroy
Federico Perazzi
Jonathan Eisenmann
Kalyan Sunkavalli
Manmohan Chandraker
60
36
0
18 Jul 2020
Surface Normal Estimation of Tilted Images via Spatial Rectifier
Tien Do
Khiem Vuong
S. Roumeliotis
H. Park
46
44
0
17 Jul 2020
Previous
1
2
3
4
Next