Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.01341
Cited By
v1
v2
v3 (latest)
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
2 July 2019
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
MDE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer"
50 / 1,081 papers shown
Title
An Inpainting-Infused Pipeline for Attire and Background Replacement
F. Mahlow
A. F. Zanella
William Alberto Cruz-Castaneda
Marcellus Amadeus
78
0
0
05 Feb 2024
Decomposition-based and Interference Perception for Infrared and Visible Image Fusion in Complex Scenes
Xilai Li
Xiaosong Li
Haishu Tan
64
1
0
03 Feb 2024
RIDERS: Radar-Infrared Depth Estimation for Robust Sensing
Han Li
Yukai Ma
Yuehao Huang
Yaqing Gu
Weihua Xu
Yong-Jin Liu
Xingxing Zuo
77
6
0
03 Feb 2024
Closing the Gap in Human Behavior Analysis: A Pipeline for Synthesizing Trimodal Data
Christian Stippel
Thomas Heitzinger
Rafael Sterzinger
Martin Kampel
41
0
0
02 Feb 2024
Improved Scene Landmark Detection for Camera Localization
Tien Do
Sudipta N. Sinha
62
3
0
31 Jan 2024
Proximity QA: Unleashing the Power of Multi-Modal Large Language Models for Spatial Proximity Analysis
Jianing Li
Xi Nan
Ming Lu
Li Du
Shanghang Zhang
65
2
0
31 Jan 2024
Evaluation in Neural Style Transfer: A Review
E. Ioannou
Steve Maddock
72
2
0
30 Jan 2024
Repositioning the Subject within Image
Yikai Wang
Chenjie Cao
Ke Fan
Qiaole Dong
Yifan Li
Xiangyang Xue
Yanwei Fu
DiffM
99
2
0
30 Jan 2024
BoostDream: Efficient Refining for High-Quality Text-to-3D Generation from Multi-View Diffusion
Yonghao Yu
Shunan Zhu
Huai Qin
Haorui Li
Jinglu Hu
73
8
0
30 Jan 2024
Depth Anything in Medical Images: A Comparative Study
John J. Han
Ayberk Acar
Callahan Henry
Jie Ying Wu
MedIm
47
10
0
29 Jan 2024
Diffutoon: High-Resolution Editable Toon Shading via Diffusion Models
Zhongjie Duan
Chengyu Wang
Cen Chen
Weining Qian
Jun Huang
DiffM
51
7
0
29 Jan 2024
Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes
Diandian Guo
Deng-Ping Fan
Tongyu Lu
Daniel Gehrig
Luc Van Gool
VOS
62
4
0
27 Jan 2024
FoVA-Depth: Field-of-View Agnostic Depth Estimation for Cross-Dataset Generalization
Daniel Lichy
Hang Su
Abhishek Badki
Jan Kautz
Orazio Gallo
MDE
77
2
0
24 Jan 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
286
826
0
19 Jan 2024
ActAnywhere: Subject-Aware Video Background Generation
Boxiao Pan
Zhan Xu
Chun-Hao Paul Huang
Krishna Kumar Singh
Yang Zhou
Leonidas Guibas
Jimei Yang
VGen
DiffM
61
3
0
19 Jan 2024
DaFoEs: Mixing Datasets towards the generalization of vision-state deep-learning Force Estimation in Minimally Invasive Robotic Surgery
Mikel De Iturrate Reyzabal
Mingcong Chen
Wei Huang
Sebastien Ourselin
Hongbin Liu
OOD
107
3
0
17 Jan 2024
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Jonghyun Lee
Hansam Cho
Youngjoon Yoo
Seoung Bum Kim
Yonghyun Jeong
DiffM
53
7
0
17 Jan 2024
SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Generation
Zhixuan Liu
Peter Schaldenbrand
Beverley-Claire Okogwu
Wenxuan Peng
Youngsik Yun
Andrew Hundt
Jihie Kim
Jean Oh
75
18
0
16 Jan 2024
A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal Endoscopy
Edward Sanderson
B. Matuszewski
74
2
0
11 Jan 2024
HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models
Hanzhang Wang
Haoran Wang
Jinze Yang
Zhongrui Yu
Zeke Xie
Lei Tian
Xinyan Xiao
Junjun Jiang
Xianming Liu
Mingming Sun
DiffM
58
1
0
11 Jan 2024
Object-Centric Diffusion for Efficient Video Editing
Kumara Kahatapitiya
Adil Karjauv
Davide Abati
Fatih Porikli
Yuki M. Asano
A. Habibian
VGen
97
13
0
11 Jan 2024
Diffusion Priors for Dynamic View Synthesis from Monocular Videos
Chaoyang Wang
Peiye Zhuang
Aliaksandr Siarohin
Junli Cao
Guocheng Qian
Hsin-Ying Lee
Sergey Tulyakov
VGen
DiffM
86
13
0
10 Jan 2024
InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes
Mohamad Shahbazi
Liesbeth Claessens
Michael Niemeyer
Edo Collins
A. Tonioni
Luc Van Gool
Federico Tombari
95
12
0
10 Jan 2024
Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects
Tianhang Cheng
Wei-Chiu Ma
Kaiyu Guan
Antonio Torralba
Shenlong Wang
3DV
77
2
0
10 Jan 2024
RadarCam-Depth: Radar-Camera Fusion for Depth Estimation with Learned Metric Scale
Han Li
Yukai Ma
Yaqing Gu
Kewei Hu
Yong-Jin Liu
Xingxing Zuo
MDE
123
13
0
09 Jan 2024
Behavioural Cloning in VizDoom
Ryan Spick
Timothy Bradley
Ayush Raina
P. Amadori
Guy Moss
LM&Ro
54
1
0
08 Jan 2024
NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation
Casimir Feldmann
Niall Siegenheim
Nikolas Hars
Lovro Rabuzin
Mert Ertugrul
Luca Wolfart
Marc Pollefeys
Z. Bauer
Martin R. Oswald
75
4
0
08 Jan 2024
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
Aleksandar Stanić
Sergi Caelles
Michael Tschannen
LRM
VLM
96
10
0
03 Jan 2024
Instruct-Imagen: Image Generation with Multi-modal Instruction
Hexiang Hu
Kelvin C. K. Chan
Yu-Chuan Su
Wenhu Chen
Yandong Li
...
Xue Ben
Boqing Gong
William W. Cohen
Ming-Wei Chang
Xuhui Jia
MLLM
142
50
0
03 Jan 2024
Image Sculpting: Precise Object Editing with 3D Geometry Control
Jiraphon Yenphraphai
Xichen Pan
Sainan Liu
Daniele Panozzo
Saining Xie
87
22
0
02 Jan 2024
NightRain: Nighttime Video Deraining via Adaptive-Rain-Removal and Adaptive-Correction
Beibei Lin
Yeying Jin
Wending Yan
Wei Ye
Yuan. Yuan
Shunli Zhang
Robby T. Tan
91
11
0
01 Jan 2024
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Peihao Wang
Zhiwen Fan
Dejia Xu
Dilin Wang
Sreyas Mohan
...
Rakesh Ranjan
Yilei Li
Qiang Liu
Zhangyang Wang
Vikas Chandra
128
21
0
31 Dec 2023
FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Feng Liang
Bichen Wu
Jialiang Wang
Licheng Yu
Kunpeng Li
...
Ishan Misra
Jia-Bin Huang
Peizhao Zhang
Peter Vajda
Diana Marculescu
VGen
DiffM
67
35
0
29 Dec 2023
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu
Christopher Clark
Sangho Lee
Zichen Zhang
Savya Khosla
Ryan Marten
Derek Hoiem
Aniruddha Kembhavi
VLM
MLLM
102
175
0
28 Dec 2023
ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environments
Maghsood Salimi
Mohammad Loni
Sara Afshar
Antonio Cicchetti
Marjan Sirjani
45
2
0
27 Dec 2023
HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D
Sangmin Woo
Byeongjun Park
Hyojun Go
Jin-Young Kim
Changick Kim
96
24
0
26 Dec 2023
Pola4All: survey of polarimetric applications and an open-source toolkit to analyze polarization
Joaquin Rodriguez
L.F.C. Lew-Yan-Voon
Renato Martins
Olivier Morel
63
1
0
22 Dec 2023
DUSt3R: Geometric 3D Vision Made Easy
Shuzhe Wang
Vincent Leroy
Yohann Cabon
Boris Chidlovskii
Jérôme Revaud
3DGS
119
406
0
21 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
155
273
0
21 Dec 2023
ZeroShape: Regression-based Zero-shot Shape Reconstruction
Zixuan Huang
Stefan Stojanov
Anh Thai
Varun Jampani
James M. Rehg
3DV
92
27
0
21 Dec 2023
NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields
Jens Naumann
Binbin Xu
Stefan Leutenegger
Xingxing Zuo
71
17
0
20 Dec 2023
Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model
Saurabh Saxena
Junhwa Hur
Charles Herrmann
Deqing Sun
David J. Fleet
DiffM
99
30
0
20 Dec 2023
SpecNeRF: Gaussian Directional Encoding for Specular Reflections
Li Ma
Vasu Agrawal
Haithem Turki
Changil Kim
Chen Gao
Pedro Sander
Michael Zollhöfer
Christian Richardt
3DGS
92
21
0
20 Dec 2023
BEVSeg2TP: Surround View Camera Bird's-Eye-View Based Joint Vehicle Segmentation and Ego Vehicle Trajectory Prediction
Sushil Sharma
Arindam Das
Ganesh Sistu
M. Halton
Ciarán Eising
57
6
0
20 Dec 2023
The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective
Wenqi Jia
Miao Liu
Hao Jiang
Ishwarya Ananthabhotla
James M. Rehg
V. Ithapu
Ruohan Gao
EgoV
90
8
0
20 Dec 2023
pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction
David Charatan
S. Li
Andrea Tagliasacchi
Vincent Sitzmann
3DGS
139
276
0
19 Dec 2023
Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion
Fan Zhang
Shaodi You
Yu Li
Ying Fu
MDE
137
20
0
19 Dec 2023
All for One, and One for All: UrbanSyn Dataset, the third Musketeer of Synthetic Driving Scenes
J. L. Gómez
Manuel Silva
Antonio Seoane
Agnes Borrás
Mario Noriega
Germán Ros
Jose A. Iglesias-Guitian
Antonio M. López
3DPC
242
13
0
19 Dec 2023
Scene-Conditional 3D Object Stylization and Composition
Jinghao Zhou
Tomas Jakab
Philip Torr
Christian Rupprecht
DiffM
141
3
0
19 Dec 2023
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Zeyinzi Jiang
Chaojie Mao
Yulin Pan
Zhen Han
Jingfeng Zhang
71
30
0
18 Dec 2023
Previous
1
2
3
...
9
10
11
...
20
21
22
Next