Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.01341
Cited By
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
2 July 2019
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
MDE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer"
50 / 1,054 papers shown
Title
Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion
Hongyu Chen
Yi-Meng Gao
Min Zhou
Peng Wang
Xubin Li
Tiezheng Ge
Bo Zheng
DiffM
31
5
0
23 Apr 2024
Generating Daylight-driven Architectural Design via Diffusion Models
Pengzhi Li
Baijuan Li
AI4CE
DiffM
26
11
0
20 Apr 2024
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
Zhiheng Liu
Ouyang Hao
Qiuyu Wang
Ka Leong Cheng
Jie Xiao
Kai Zhu
Nan Xue
Yu Liu
Yujun Shen
Yang Cao
DiffM
3DGS
51
20
0
17 Apr 2024
IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination
Xi Chen
Sida Peng
Dongchen Yang
Yuan Liu
Bowen Pan
Chengfei Lv
Xiaowei Zhou
DiffM
44
18
0
17 Apr 2024
Predicting Long-horizon Futures by Conditioning on Geometry and Time
Tarasha Khurana
Deva Ramanan
AI4TS
55
0
0
17 Apr 2024
How to deal with glare for improved perception of Autonomous Vehicles
M.Zeshan Alam
Zeeshan Kaleem
S. Kelouwani
33
0
0
17 Apr 2024
Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
50
1
0
15 Apr 2024
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Han Lin
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
DiffM
VGen
69
20
0
15 Apr 2024
Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video
Hongchi Xia
Zhi-Hao Lin
Wei-Chiu Ma
Shenlong Wang
VGen
38
13
0
15 Apr 2024
Probing the 3D Awareness of Visual Foundation Models
Mohamed El Banani
Amit Raj
Kevis-Kokitsi Maninis
Abhishek Kar
Yuanzhen Li
Michael Rubinstein
Deqing Sun
Leonidas J. Guibas
Justin Johnson
Varun Jampani
40
79
0
12 Apr 2024
On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
Agneet Chatterjee
Tejas Gokhale
Chitta Baral
Yezhou Yang
VLM
35
2
0
12 Apr 2024
G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images
Zixiong Huang
Qi Chen
Libo Sun
Yifan Yang
Naizhou Wang
Mingkui Tan
Qi Wu
3DV
43
2
0
11 Apr 2024
Socially Pertinent Robots in Gerontological Healthcare
Xavier Alameda-Pineda
Angus Addlesee
Daniel Hernández García
Chris Reinke
Soraya Arias
...
Pinchas Tandeitnik
Francesco Tonini
Nicolas Turro
T. Wintz
Yanchao Yu
42
3
0
11 Apr 2024
Fast Encoder-Based 3D from Casual Videos via Point Track Processing
Yoni Kasten
Wuyue Lu
Haggai Maron
3DPC
37
2
0
10 Apr 2024
Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior
Fan Lu
Kwan-Yee Lin
Yan Xu
Hongsheng Li
Guang Chen
Changjun Jiang
31
7
0
10 Apr 2024
Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos
Fengrui Tian
Yueqi Duan
Angtian Wang
Jianfei Guo
Shaoyi Du
3DPC
39
3
0
08 Apr 2024
Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving
Jinlong Li
Baolu Li
Zhengzhong Tu
Xinyu Liu
Qing Guo
Felix Juefei Xu
Runsheng Xu
Hongkai Yu
DiffM
53
18
0
07 Apr 2024
DATENeRF: Depth-Aware Text-based Editing of NeRFs
Sara Rojas
Julien Philip
Kai Zhang
Sai Bi
Fujun Luan
Guohao Li
Kalyan Sunkavalli
DiffM
27
3
0
06 Apr 2024
Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning
Rui Li
Tobias Fischer
Mattia Segu
Marc Pollefeys
Luc Van Gool
Federico Tombari
24
8
0
04 Apr 2024
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Fei Chen
Jingyu Sun
Gerasimos Lampouras
Ignacio Iacobacci
Sarah Parisot
45
10
0
03 Apr 2024
The Devil is in the Edges: Monocular Depth Estimation with Edge-aware Consistency Fusion
Pengzhi Li
Yikang Ding
Haohan Wang
Chengshuai Tang
Zhiheng Li
MDE
46
1
0
30 Mar 2024
MaGRITTe: Manipulative and Generative 3D Realization from Image, Topview and Text
T. Hara
Tatsuya Harada
DiffM
27
3
0
30 Mar 2024
Sketch-to-Architecture: Generative AI-aided Architectural Design
Pengzhi Li
Baijuan Li
Zhiheng Li
3DV
AI4CE
LM&Ro
40
18
0
29 Mar 2024
GlORIE-SLAM: Globally Optimized RGB-only Implicit Encoding Point Cloud SLAM
Ganlin Zhang
Erik Sandström
Youmin Zhang
Manthan Patel
Luc Van Gool
Martin R. Oswald
41
19
0
28 Mar 2024
Neural Fields for 3D Tracking of Anatomy and Surgical Instruments in Monocular Laparoscopic Video Clips
Beerend G. A. Gerats
J. Wolterink
Seb P. Mol
I. A. Broeders
53
0
0
28 Mar 2024
GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds
Shengjun Zhang
Xin Fei
Yueqi Duan
3DPC
38
1
0
28 Mar 2024
UniDepth: Universal Monocular Metric Depth Estimation
Luigi Piccinelli
Yung-Hsu Yang
Daniel Gehrig
Mattia Segu
Siyuan Li
Luc Van Gool
Fisher Yu
VLM
MDE
81
128
0
27 Mar 2024
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
Suraj Patni
Aradhye Agarwal
Chetan Arora
VLM
DiffM
MDE
33
26
0
27 Mar 2024
Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos
Akshay Paruchuri
S. Ehrenstein
Shuxian Wang
Inbar Fried
Stephen M. Pizer
Marc Niethammer
Roni Sengupta
MDE
48
6
0
26 Mar 2024
Learning with Unreliability: Fast Few-shot Voxel Radiance Fields with Relative Geometric Consistency
Yingjie Xu
Bangzhen Liu
Hao Tang
Bailin Deng
Shengfeng He
21
4
0
26 Mar 2024
PropTest: Automatic Property Testing for Improved Visual Programming
Jaywon Koo
Ziyan Yang
Paola Cascante-Bonilla
Baishakhi Ray
Vicente Ordonez
LRM
29
2
0
25 Mar 2024
Configurable Holography: Towards Display and Scene Adaptation
Yicheng Zhan
Liang Shi
Wojciech Matusik
Qi Sun
K. Akşit
30
0
0
24 Mar 2024
Recent Trends in 3D Reconstruction of General Non-Rigid Scenes
Raza Yunus
J. E. Lenssen
Michael Niemeyer
Yiyi Liao
Christian Rupprecht
Christian Theobalt
Gerard Pons-Moll
Jia-Bin Huang
Vladislav Golyanik
Eddy Ilg
48
25
0
22 Mar 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
54
117
0
22 Mar 2024
Latent Diffusion Models for Attribute-Preserving Image Anonymization
Luca Piano
Pietro Basci
Fabrizio Lamberti
Lia Morra
DiffM
24
4
0
21 Mar 2024
DepthFM: Fast Monocular Depth Estimation with Flow Matching
Ming Gui
Johannes S. Fischer
Ulrich Prestel
Pingchuan Ma
Dmytro Kotovenko
Olga Grebenkova
S. A. Baumann
Vincent Tao Hu
Bjorn Ommer
MDE
36
53
0
20 Mar 2024
Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos
Hadi Alzayer
Zhihao Xia
Xuaner Zhang
Eli Shechtman
Jia-Bin Huang
Michael Gharbi
DiffM
VGen
27
19
0
19 Mar 2024
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Fucai Ke
Zhixi Cai
Simindokht Jahangard
Weiqing Wang
P. D. Haghighi
Hamid Rezatofighi
LRM
51
10
0
19 Mar 2024
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Xiao Fu
Wei Yin
Mu Hu
Kaixuan Wang
Yuexin Ma
Ping Tan
Shaojie Shen
Dahua Lin
Xiaoxiao Long
DiffM
48
106
0
18 Mar 2024
HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data
Mengqi Zhang
Yang Fu
Zheng Ding
Sifei Liu
Zhuowen Tu
Xiaolong Wang
44
17
0
18 Mar 2024
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors
Chenyang Ma
Kai Lu
Ta-Ying Cheng
Niki Trigoni
Andrew Markham
LRM
40
7
0
18 Mar 2024
InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting
Jiaxiang Tang
Ruijie Lu
Xiaokang Chen
Xiang Wen
Gang Zeng
Ziwei Liu
DiffM
MDE
32
15
0
18 Mar 2024
Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery
Yuqi Zhang
Guanying Chen
Jiaxing Chen
Shuguang Cui
41
1
0
18 Mar 2024
Benchmarking the Robustness of UAV Tracking Against Common Corruptions
Xiaoqiong Liu
Yunhe Feng
Shu Hu
Xiaohui Yuan
Heng Fan
AAML
45
0
0
18 Mar 2024
LightIt: Illumination Modeling and Control for Diffusion Models
Peter Kocsis
Julien Philip
Kalyan Sunkavalli
Matthias Nießner
Yannick Hold-Geoffroy
34
21
0
15 Mar 2024
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
Stephanie Fu
Mark Hamilton
Laura E. Brandt
Axel Feldmann
Zhoutong Zhang
William T. Freeman
MDE
38
49
0
15 Mar 2024
DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video
Huiqiang Sun
Xingyi Li
Liao Shen
Xinyi Ye
Ke Xian
Zhiguo Cao
VGen
27
8
0
15 Mar 2024
3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
Frank Zhang
Yibo Zhang
Quan Zheng
R. Ma
W. Hua
Hujun Bao
Weiwei Xu
Changqing Zou
54
9
0
14 Mar 2024
FogGuard: guarding YOLO against fog using perceptual loss
Soheil Gharatappeh
Sepideh Neshatfar
Salimeh Yasaei Sekeh
Vikas Dhiman
42
1
0
13 Mar 2024
3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surface
Linyi Jin
Nilesh Kulkarni
David Fouhey
3DV
34
2
0
13 Mar 2024
Previous
1
2
3
...
7
8
9
...
20
21
22
Next