Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.01341
Cited By
v1
v2
v3 (latest)
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
2 July 2019
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer"
50 / 1,081 papers shown
Title
Boosting Multi-View Stereo with Depth Foundation Model in the Absence of Real-World Labels
Jie Zhu
Bo Peng
Zhe Zhang
Bingzheng Liu
Jianjun Lei
52
0
0
16 Apr 2025
Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image
Tao Wen
Jiadong Wang
Yuxiao Chen
Shugong Xu
Fangqiu Yi
Xuelong Li
MDE
121
0
0
16 Apr 2025
TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion
Yanjie Wang
Jiajian Li
Chaoyi Hong
Ruibo Li
Liusheng Sun
Xiao-yang Song
Zhe Wang
Zhiguo Cao
Guosheng Lin
MDE
105
0
0
16 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
199
0
0
15 Apr 2025
DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation
Soyoung Yoo
Namwoo Kang
116
0
0
15 Apr 2025
GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*
Eunsoo Im
Jung Kwon Lee
Changhyun Jee
170
0
0
15 Apr 2025
Art3D: Training-Free 3D Generation from Flat-Colored Illustration
Xiaoyan Cong
Jiayi Shen
Zekun Li
Rao Fu
Tao Lu
Srinath Sridhar
3DH
90
0
0
14 Apr 2025
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
Zeren Jiang
Chuanxia Zheng
Iro Laina
Diane Larlus
Andrea Vedaldi
VGen
97
2
0
10 Apr 2025
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution
Gene Chou
Wenqi Xian
Guandao Yang
Mohamed Abdelfattah
Bharath Hariharan
Noah Snavely
Ning Yu
P. Debevec
MDE
122
0
0
09 Apr 2025
PromptHMR: Promptable Human Mesh Recovery
Yufu Wang
Yu Sun
Priyanka Patel
Kostas Daniilidis
Michael J. Black
Muhammed Kocabas
3DH
179
0
0
08 Apr 2025
A High-Force Gripper with Embedded Multimodal Sensing for Powerful and Perception Driven Grasping
Edoardo Del Bianco
Davide Torielli
Federico Rollo
Damiano Gasperini
Arturo Laurenzi
Lorenzo Baccelliere
L. Muratore
Marco Roveri
Nikos Tsagarakis
68
2
0
07 Apr 2025
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision
Yuandong Pu
Le Zhuo
Kaiwen Zhu
Liangbin Xie
Wenlong Zhang
Xiangyu Chen
Peng Gao
Yu Qiao
Chao Dong
Yihao Liu
MLLM
103
2
0
07 Apr 2025
Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
Xin Zhang
Robby T. Tan
Mamba
100
0
0
04 Apr 2025
Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization
Jiadong Wang
Jingyuan Liu
Xin Sun
Krishna Kumar Singh
Zhixin Shu
...
Nanxuan Zhao
Tuanfeng Y. Wang
Simon Chen
Ulrich Neumann
Jae Shin Yoon
74
0
0
03 Apr 2025
All-day Depth Completion via Thermal-LiDAR Fusion
Janghyun Kim
Minseong Kweon
Jinsun Park
Ukcheol Shin
VLM
94
0
0
03 Apr 2025
DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB Image
Jijun Xiang
Xuan Zhu
Xianqi Wang
Yuanbo Wang
Hao Zhang
Fei Guo
Xin-She Yang
114
0
0
02 Apr 2025
WorldScore: A Unified Evaluation Benchmark for World Generation
Haoyi Duan
Hong-Xing Yu
Sirui Chen
L. Fei-Fei
Jiajun Wu
VGen
144
8
0
01 Apr 2025
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Tian-Xing Xu
Xiangjun Gao
Wenbo Hu
Xiaoyu Li
Song-Hai Zhang
Ying Shan
VGen
MDE
154
2
0
01 Apr 2025
Beyond Static Scenes: Camera-controllable Background Generation for Human Motion
Mingshuai Yao
Mengting Chen
Qinye Zhou
Yize Zhang
Ming-Yu Liu
...
Chen Ju
Shuai Xiao
Qingwen Liu
Jinsong Lan
Wangmeng Zuo
DiffM
VGen
127
1
0
01 Apr 2025
Distance Estimation to Support Assistive Drones for the Visually Impaired using Robust Calibration
Suman Raj
Bhavani A Madhabhavi
Madhav Kumar
Prabhav Gupta
Yogesh Simmhan
84
1
0
31 Mar 2025
Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views
Chong Bao
Xiyu Zhang
Zehao Yu
Jiale Shi
Guofeng Zhang
Songyou Peng
Zhaopeng Cui
3DGS
3DV
99
0
0
31 Mar 2025
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training
Xingyu Chen
Yue Chen
Yuliang Xiu
Andreas Geiger
Anpei Chen
3DPC
VGen
114
7
0
31 Mar 2025
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model
Jannik Endres
Oliver Hahn
Charles Corbière
Simone Schaub-Meyer
Stefan Roth
Alexandre Alahi
MDE
77
0
0
30 Mar 2025
MVSAnywhere: Zero-Shot Multi-View Stereo
Sergio Izquierdo
Mohamed Sayed
Michael Firman
Guillermo Garcia-Hernando
Daniyar Turmukhambetov
Javier Civera
Oisin Mac Aodha
Gabriel J. Brostow
Jamie Watson
3DV
125
4
0
28 Mar 2025
One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images
Byeongjun Kwon
Munchurl Kim
VLM
MDE
95
0
0
28 Mar 2025
Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
S. Yu
Yuxin Chen
Zhongang Qi
Zeke Xie
Yifan Wang
Lijun Wang
Ying Shan
Huchuan Lu
77
0
0
28 Mar 2025
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and Challenges
Ukcheol Shin
Jinsun Park
3DV
MDE
81
0
0
28 Mar 2025
GenFusion: Closing the Loop between Reconstruction and Generation via Videos
Sibo Wu
Congrong Xu
Binbin Huang
Andreas Geiger
Anpei Chen
VGen
495
1
0
27 Mar 2025
FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion
Pihai Sun
Junjun Jiang
Yuanqi Yao
Youyu Chen
Wenbo Zhao
Kui Jiang
Xianming Liu
MDE
81
0
0
25 Mar 2025
Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors
Yuke Lou
Yiming Wang
Zhen Wu
Rui Zhao
Wenjia Wang
Mingyi Shi
Taku Komura
99
2
0
25 Mar 2025
Aether: Geometric-Aware Unified World Modeling
Aether Team
Haoyi Zhu
Yanjie Wang
Jianjun Zhou
Wenzheng Chang
...
Zizun Li
Junyi Chen
Chunhua Shen
Jiangmiao Pang
Tong He
DiffM
VGen
127
9
0
24 Mar 2025
Enhancing Martian Terrain Recognition with Deep Constrained Clustering
Tejas Panambur
M. Parente
75
0
0
22 Mar 2025
Distilling Monocular Foundation Model for Fine-grained Depth Completion
Yingping Liang
Yutao Hu
Wenqi Shao
Ying Fu
MDE
101
1
0
21 Mar 2025
Radar-Guided Polynomial Fitting for Metric Depth Estimation
Patrick Rim
Hyoungseob Park
Vadim Ezhov
Jeffrey Moon
Alex Wong
MDE
118
0
0
21 Mar 2025
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
Quanhao Li
Zhen Xing
Rui Wang
Hui Zhang
Qi Dai
Zuxuan Wu
VGen
118
2
0
20 Mar 2025
Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras
Beilei Cui
Long Bai
Mobarakol Islam
An-Chi Wang
Zejun Ma
...
Feng Li
Zhen Chen
Zhongliang Jiang
Nassir Navab
Hongliang Ren
MedIm
88
0
0
20 Mar 2025
UniK3D: Universal Camera Monocular 3D Estimation
Luigi Piccinelli
Daniel Gehrig
Mattia Segu
Yifan Yang
Siyuan Li
Wim Abbeloos
Luc Van Gool
MDE
88
1
0
20 Mar 2025
NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes
Han-Hung Lee
Qinghong Han
Angel X. Chang
153
0
0
20 Mar 2025
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Xuan Shen
Weize Ma
Jing Liu
Changdi Yang
Rui Ding
...
Wei Niu
Yanzhi Wang
Pu Zhao
Jun Lin
Jiuxiang Gu
MQ
99
0
0
20 Mar 2025
EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining
Boshen Xu
Yuting Mei
Xinbi Liu
Sipeng Zheng
Qin Jin
VLM
MDE
110
0
0
19 Mar 2025
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
Yulin Pan
Xiangteng He
Chaojie Mao
Zhen Han
Zeyinzi Jiang
Junxuan Zhang
Yu Liu
EGVM
VLM
116
2
0
18 Mar 2025
SplatVoxel: History-Aware Novel View Streaming without Temporal Training
Yiming Wang
Lucy Chai
Xuan Luo
Michael Niemeyer
Manuel Lagunas
Stephen Lombardi
Siyu Tang
Tiancheng Sun
3DGS
146
0
0
18 Mar 2025
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
Haoyu Guo
He Zhu
Sida Peng
Haotong Lin
Yunzhi Yan
Tao Xie
Wenguan Wang
Xiaowei Zhou
Hujun Bao
3DV
MDE
125
1
0
18 Mar 2025
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Dingkang Liang
Dingyuan Zhang
Xin Zhou
Sifan Tu
Tianrui Feng
Xiaofan Li
Yumeng Zhang
Mingyang Du
Xiao Tan
Xiang Bai
110
3
0
17 Mar 2025
GenStereo: Towards Open-World Generation of Stereo Images and Unsupervised Matching
Feng Qiao
Zhexiao Xiong
Eric Xing
Nathan Jacobs
DiffM
3DV
94
1
0
17 Mar 2025
Deblur Gaussian Splatting SLAM
Francesco Girlanda
D. Rozumnyi
Marc Pollefeys
Martin R. Oswald
3DGS
90
0
0
16 Mar 2025
Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space
Weichen Zhang
Zile Zhou
Zhiheng Zheng
Chen Gao
Jinqiang Cui
Yongqian Li
Xinlei Chen
Xiao-Ping Zhang
LRM
137
5
0
14 Mar 2025
Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation
Hongyu Wen
Yiming Zuo
Venkat Subramanian
Patrick Chen
Jia Deng
3DV
168
0
0
14 Mar 2025
CoSTA
∗
\ast
∗
: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing
Advait Gupta
NandaKiran Velaga
Dang Nguyen
Dinesh Manocha
DiffM
128
0
0
13 Mar 2025
Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
Yihong Luo
Tianyang Hu
Yifan Song
Jiacheng Sun
Zechao Li
Jing Tang
DiffM
148
1
0
13 Mar 2025
Previous
1
2
3
4
5
...
20
21
22
Next