ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.01341
  4. Cited By
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot
  Cross-dataset Transfer
v1v2v3 (latest)

Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

2 July 2019
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
    MDE
ArXiv (abs)PDFHTML

Papers citing "Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer"

50 / 1,081 papers shown
Title
Transformers in Unsupervised Structure-from-Motion
Transformers in Unsupervised Structure-from-Motion
Hemang Chawla
Arnav Varma
Elahe Arani
Bahram Zonooz
ViT
55
1
0
16 Dec 2023
DiffusionLight: Light Probes for Free by Painting a Chrome Ball
DiffusionLight: Light Probes for Free by Painting a Chrome Ball
Pakkapon Phongthawee
Worameth Chinchuthakun
Nontaphat Sinsunthithet
Amit Raj
Varun Jampani
Pramook Khungurn
Supasorn Suwajanakorn
DiffM
107
27
0
14 Dec 2023
NViST: In the Wild New View Synthesis from a Single Image with
  Transformers
NViST: In the Wild New View Synthesis from a Single Image with Transformers
Wonbong Jang
Lourdes Agapito
ViT
89
10
0
13 Dec 2023
SceneWiz3D: Towards Text-guided 3D Scene Composition
SceneWiz3D: Towards Text-guided 3D Scene Composition
Qihang Zhang
Chaoyang Wang
Aliaksandr Siarohin
Peiye Zhuang
Yinghao Xu
Ceyuan Yang
Dahua Lin
Bolei Zhou
Sergey Tulyakov
Hsin-Ying Lee
91
34
0
13 Dec 2023
FreeControl: Training-Free Spatial Control of Any Text-to-Image
  Diffusion Model with Any Condition
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Sicheng Mo
Fangzhou Mu
Kuan Heng Lin
Yanli Liu
Bochen Guan
Yin Li
Bolei Zhou
DiffM
107
67
0
12 Dec 2023
CCM: Adding Conditional Controls to Text-to-Image Consistency Models
CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Jie Xiao
Kai Zhu
Han Zhang
Zhiheng Liu
Yujun Shen
Yu Liu
Xueyang Fu
Zheng-Jun Zha
DiffM
74
11
0
12 Dec 2023
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
99
75
0
11 Dec 2023
Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops
Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops
Aditya Prakash
Arjun Gupta
Saurabh Gupta
64
3
0
11 Dec 2023
ControlNet-XS: Designing an Efficient and Effective Architecture for
  Controlling Text-to-Image Diffusion Models
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models
Denis Zavadski
Johann-Friedrich Feiden
Carsten Rother
DiffM
81
8
0
11 Dec 2023
GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera
  Parameters via Ground Plane Embedding
GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera Parameters via Ground Plane Embedding
Karlo Koledić
Luka V. Petrović
Ivan Petrović
Ivan Marković
MDE
111
1
0
10 Dec 2023
SuperPrimitive: Scene Reconstruction at a Primitive Level
SuperPrimitive: Scene Reconstruction at a Primitive Level
Kirill Mazur
Gwangbin Bae
Andrew J. Davison
3DH
67
3
0
10 Dec 2023
ControlRoom3D: Room Generation using Semantic Proxy Rooms
ControlRoom3D: Room Generation using Semantic Proxy Rooms
Jonas Schult
Sam S. Tsai
Lukas Höllein
Bichen Wu
Jialiang Wang
...
Zijian He
Peizhao Zhang
Bastian Leibe
Peter Vajda
Ji Hou
88
34
0
08 Dec 2023
Fine Dense Alignment of Image Bursts through Camera Pose and Depth
  Estimation
Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation
Bruno Lecouat
Yann Dubois de Mont-Marin
Théo Bodrito
Julien Mairal
Jean Ponce
80
0
0
08 Dec 2023
ConVRT: Consistent Video Restoration Through Turbulence with Test-time
  Optimization of Neural Video Representations
ConVRT: Consistent Video Restoration Through Turbulence with Test-time Optimization of Neural Video Representations
Haoming Cai
Jingxi Chen
Brandon Yushan Feng
Weiyun Jiang
Mingyang Xie
Kevin Zhang
Ashok Veeraraghavan
Christopher A. Metzler
119
0
0
07 Dec 2023
WonderJourney: Going from Anywhere to Everywhere
WonderJourney: Going from Anywhere to Everywhere
Hong-Xing Yu
Haoyi Duan
Junhwa Hur
Kyle Sargent
Michael Rubinstein
...
Forrester Cole
Deqing Sun
Noah Snavely
Jiajun Wu
Charles Herrmann
VGen
113
57
0
06 Dec 2023
Intrinsic Harmonization for Illumination-Aware Compositing
Intrinsic Harmonization for Illumination-Aware Compositing
Chris Careaga
S. M. H. Miangoleh
Yagiz Aksoy
87
8
0
06 Dec 2023
Kandinsky 3.0 Technical Report
Kandinsky 3.0 Technical Report
V.Ya. Arkhipkin
Andrei Filatov
Viacheslav Vasilev
Anastasia Maltseva
Said Azizov
Igor Pavlov
Julia Agafonova
Andrey Kuznetsov
Denis Dimitrov
DiffM
110
13
0
06 Dec 2023
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution
  Monocular Metric Depth Estimation
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Zhenyu Li
Shariq Farooq Bhat
Peter Wonka
MDE
87
25
0
04 Dec 2023
Readout Guidance: Learning Control from Diffusion Features
Readout Guidance: Learning Control from Diffusion Features
Grace Luo
Trevor Darrell
Oliver Wang
Dan B. Goldman
Aleksander Holynski
103
27
0
04 Dec 2023
Repurposing Diffusion-Based Image Generators for Monocular Depth
  Estimation
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Bingxin Ke
Anton Obukhov
Shengyu Huang
Nando Metzger
Rodrigo Caye Daudt
Konrad Schindler
VLMMDE
162
173
0
04 Dec 2023
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Jiarui Xu
Yossi Gandelsman
Amir Bar
Jianwei Yang
Jianfeng Gao
Trevor Darrell
Xiaolong Wang
VLM
58
3
0
04 Dec 2023
Generative Rendering: Controllable 4D-Guided Video Generation with 2D
  Diffusion Models
Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models
Shengqu Cai
Duygu Ceylan
Matheus Gadelha
C. Huang
Tuanfeng Y. Wang
Gordon Wetzstein
VGen
104
18
0
03 Dec 2023
ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models
ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models
Jeong-gi Kwak
Erqun Dong
Yuhe Jin
Hanseok Ko
Shweta Mahajan
Kwang Moo Yi
DiffMVGen
105
41
0
03 Dec 2023
AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute
  Decomposition and Indexing
AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing
Fan Yang
Tianyi Chen
Xiaosheng He
Zhongang Cai
Lei Yang
Si Wu
Guosheng Lin
86
9
0
03 Dec 2023
Meta ControlNet: Enhancing Task Adaptation via Meta Learning
Meta ControlNet: Enhancing Task Adaptation via Meta Learning
Junjie Yang
Jinze Zhao
Peihao Wang
Zhangyang Wang
Yingbin Liang
130
3
0
03 Dec 2023
Enhancing Diffusion Models with 3D Perspective Geometry Constraints
Enhancing Diffusion Models with 3D Perspective Geometry Constraints
Rishi Upadhyay
Howard Zhang
Yunhao Ba
Ethan Yang
Blake Gella
Sicheng Jiang
Alex Wong
A. Kadambi
90
11
0
01 Dec 2023
DeepDR: Deep Structure-Aware RGB-D Inpainting for Diminished Reality
DeepDR: Deep Structure-Aware RGB-D Inpainting for Diminished Reality
Christina Schwarz-Gsaxner
Shohei Mori
Dieter Schmalstieg
Jan Egger
Gerhard Paar
Werner Bailer
Denis Kalkofen
66
5
0
01 Dec 2023
Exploiting Diffusion Prior for Generalizable Dense Prediction
Exploiting Diffusion Prior for Generalizable Dense Prediction
Hsin-Ying Lee
Hung-Yu Tseng
Hsin-Ying Lee
Ming-Hsuan Yang
DiffMMDE
102
23
0
30 Nov 2023
ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs
ZeST-NeRF: Using temporal aggregation for Zero-Shot Temporal NeRFs
Violeta Menéndez González
Andrew Gilbert
Graeme Phillipson
Stephen Jolly
Simon Hadfield
71
0
0
30 Nov 2023
HandRefiner: Refining Malformed Hands in Generated Images by
  Diffusion-based Conditional Inpainting
HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting
Wenquan Lu
Yufei Xu
Jing Zhang
Chaoyue Wang
Dacheng Tao
DiffM
80
28
0
29 Nov 2023
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
Xian Liu
Xiaohang Zhan
Jiaxiang Tang
Ying Shan
Gang Zeng
Dahua Lin
Xihui Liu
Ziwei Liu
3DGS
130
77
0
28 Nov 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffMVGen
97
126
0
28 Nov 2023
RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail
  Richness in Text-to-3D
RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D
Lingteng Qiu
Guanying Chen
Xiaodong Gu
Qi Zuo
Mutian Xu
Yushuang Wu
Weihao Yuan
Zilong Dong
Liefeng Bo
Xiaoguang Han
89
123
0
28 Nov 2023
Neural Style Transfer for Computer Games
Neural Style Transfer for Computer Games
E. Ioannou
Steve Maddock
98
2
0
24 Nov 2023
MonoNav: MAV Navigation via Monocular Depth Estimation and
  Reconstruction
MonoNav: MAV Navigation via Monocular Depth Estimation and Reconstruction
Nathaniel Simon
Anirudha Majumdar
69
3
0
23 Nov 2023
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space
Katja Schwarz
Seung Wook Kim
Jun Gao
Sanja Fidler
Andreas Geiger
Karsten Kreis
108
6
0
22 Nov 2023
Intrinsic Image Decomposition via Ordinal Shading
Intrinsic Image Decomposition via Ordinal Shading
Chris Careaga
Yagiz Aksoy
131
29
0
21 Nov 2023
Enhancing Scene Graph Generation with Hierarchical Relationships and
  Commonsense Knowledge
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge
Bowen Jiang
Zhijun Zhuang
Shreyas S. Shivakumar
Camillo J Taylor
104
8
0
21 Nov 2023
SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction
  Transformers trained under memory constraints
SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints
Aditya Nalgunda Ganesh
ViT
57
1
0
19 Nov 2023
MoVideo: Motion-Aware Video Generation with Diffusion Models
MoVideo: Motion-Aware Video Generation with Diffusion Models
Christos Sakaridis
Yuchen Fan
Kai Zhang
Radu Timofte
Luc Van Gool
Rakesh Ranjan
DiffMVGen
85
10
0
19 Nov 2023
Match and Locate: low-frequency monocular odometry based on deep feature
  matching
Match and Locate: low-frequency monocular odometry based on deep feature matching
S. Konev
Yuriy Biktairov
31
0
0
16 Nov 2023
MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry
  and Texture
MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture
Lincong Feng
Muyu Wang
Maoyu Wang
Kuo Xu
Xiaoli Liu
51
4
0
16 Nov 2023
CD-COCO: A Versatile Complex Distorted COCO Database for
  Scene-Context-Aware Computer Vision
CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer Vision
Ayman Beghdadi
Azeddine Beghdadi
Malik Mallem
Lotfi Beji
F. A. Cheikh
51
3
0
12 Nov 2023
GOAT: GO to Any Thing
GOAT: GO to Any Thing
Matthew Chang
Théophile Gervet
Mukul Khanna
Sriram Yenamandra
Dhruv Shah
...
Saurabh Gupta
Dhruv Batra
Roozbeh Mottaghi
Jitendra Malik
Devendra Singh Chaplot
124
74
0
10 Nov 2023
Analyzing Modular Approaches for Visual Question Decomposition
Analyzing Modular Approaches for Visual Question Decomposition
Apoorv Khandelwal
Ellie Pavlick
Chen Sun
84
4
0
10 Nov 2023
PolyMaX: General Dense Prediction with Mask Transformer
PolyMaX: General Dense Prediction with Mask Transformer
Xuan S. Yang
Liangzhe Yuan
Kimberly Wilber
Astuti Sharma
Xiuye Gu
...
Stephanie Debats
Huisheng Wang
Hartwig Adam
Mikhail Sirotenko
Liang-Chieh Chen
105
15
0
09 Nov 2023
ConRad: Image Constrained Radiance Fields for 3D Generation from a
  Single Image
ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image
Senthil Purushwalkam
Nikhil Naik
80
5
0
09 Nov 2023
DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of
  mixture-of-datasets
DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets
Yash Jain
Harkirat Singh Behl
Z. Kira
Vibhav Vineet
69
15
0
08 Nov 2023
GLaMM: Pixel Grounding Large Multimodal Model
GLaMM: Pixel Grounding Large Multimodal Model
H. Rasheed
Muhammad Maaz
Sahal Shaji Mullappilly
Abdelrahman M. Shaker
Salman Khan
Hisham Cholakkal
Rao M. Anwer
Erix Xing
Ming-Hsuan Yang
Fahad S. Khan
MLLMVLM
157
239
0
06 Nov 2023
Get the Ball Rolling: Alerting Autonomous Robots When to Help to Close
  the Healthcare Loop
Get the Ball Rolling: Alerting Autonomous Robots When to Help to Close the Healthcare Loop
Jiaxin Shen
Yanyao Liu
Ziming Wang
Ziyuan Jiao
Yufeng Chen
Wenjuan Han
39
0
0
05 Nov 2023
Previous
123...101112...202122
Next