Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.09358
Cited By
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis
14 May 2025
Bingxin Ke
Kevin Qu
Tianfu Wang
Nando Metzger
Shengyu Huang
Bo Li
Anton Obukhov
Konrad Schindler
DiffM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis"
50 / 63 papers shown
Title
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
174
54
0
26 Sep 2024
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Gonzalo Martin Garcia
Karim Abou Zeid
Christian Schmidt
Daan de Geus
Alexander Hermans
Bastian Leibe
124
33
0
17 Sep 2024
SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
Jakub Gregorek
Lazaros Nalpantidis
3DGS
95
4
0
16 Sep 2024
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions
Fabio Tosi
Pierluigi Zama Ramirez
Matteo Poggi
DiffM
MQ
MDE
68
13
0
23 Jul 2024
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation
Zhenyu Li
Shariq Farooq Bhat
Peter Wonka
3DV
MDE
67
7
0
10 Jun 2024
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Jiahao Shao
Yuanbo Yang
Hongyu Zhou
Youmin Zhang
Yujun Shen
Vitor Campagnolo Guizilini
Yue Wang
Matteo Poggi
Yiyi Liao
VGen
DiffM
MDE
96
43
0
03 Jun 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
Mu Hu
Wei Yin
C. Zhang
Zhipeng Cai
Xiaoxiao Long
Kaixuan Wang
Kaixuan Wang
Gang Yu
Chunhua Shen
Shaojie Shen
3DGS
265
138
0
22 Mar 2024
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
Xiao Fu
Wei Yin
Mu Hu
Kaixuan Wang
Yuexin Ma
Ping Tan
Shaojie Shen
Dahua Lin
Xiaoxiao Long
DiffM
106
123
0
18 Mar 2024
Rethinking Inductive Biases for Surface Normal Estimation
Gwangbin Bae
Andrew J. Davison
102
52
0
01 Mar 2024
DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
Yuru Jia
Lukas Hoyer
Shengyu Huang
Tianfu Wang
Luc Van Gool
Konrad Schindler
Anton Obukhov
DiffM
104
23
0
05 Dec 2023
PatchFusion: An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Zhenyu Li
Shariq Farooq Bhat
Peter Wonka
MDE
71
24
0
04 Dec 2023
Breathing New Life into 3D Assets with Generative Repainting
Tianfu Wang
Menelaos Kanakis
Konrad Schindler
Luc Van Gool
Anton Obukhov
AI4CE
52
13
0
15 Sep 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
242
233
0
03 Mar 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
182
4,175
1
10 Feb 2023
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Jia Ning
Chen Li
Zheng Zhang
Zigang Geng
Qi Dai
Kun He
Han Hu
114
46
0
05 Jan 2023
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
211
1,830
0
17 Nov 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
200
3,500
0
16 Oct 2022
SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models
Omiros Pantazis
Gabriel J. Brostow
Kate E. Jones
Oisin Mac Aodha
VLM
77
41
0
07 Oct 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
279
2,891
0
25 Aug 2022
IRISformer: Dense Vision Transformers for Single-Image Inverse Rendering in Indoor Scenes
Rui Zhu
Zhengqin Li
J. Matai
Fatih Porikli
Manmohan Chandraker
ViT
99
49
0
16 Jun 2022
Physically-Based Editing of Indoor Scene Lighting from a Single Image
Zhengqin Li
Jia Shi
Sai Bi
Rui Zhu
Kalyan Sunkavalli
Milovs Havsan
Zexiang Xu
R. Ramamoorthi
Manmohan Chandraker
3DV
83
58
0
19 May 2022
P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior
Vaishakh Patil
Daniel Gehrig
Alexander Liniger
Luc Van Gool
MDE
59
130
0
05 Apr 2022
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation
Zhenyu Li
Xuyang Wang
Xianming Liu
Junjun Jiang
MDE
88
196
0
03 Apr 2022
DepthFormer: Exploiting Long-Range Correlation and Local Information for Accurate Monocular Depth Estimation
Zhenyu Li
Zehui Chen
Xianming Liu
Junjun Jiang
ViT
MDE
71
188
1
27 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
496
15,768
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
364
3,627
0
20 Dec 2021
Extract Free Dense Labels from CLIP
Chong Zhou
Chen Change Loy
Bo Dai
VLM
CLIP
155
481
0
02 Dec 2021
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
265
402
0
06 Nov 2021
Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans
Ainaz Eftekhar
Alexander Sax
Roman Bachmann
Jitendra Malik
Amir Zamir
MedIm
100
300
0
11 Oct 2021
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
VLM
CLIP
315
1,045
0
09 Oct 2021
Learning Indoor Inverse Rendering with 3D Spatially-Varying Lighting
Zian Wang
Jonah Philion
Sanja Fidler
Jan Kautz
3DV
161
83
0
13 Sep 2021
Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging
S. M. H. Miangoleh
Sebastian Dille
Long Mai
Sylvain Paris
Yagiz Aksoy
MoMe
3DV
MDE
70
187
0
28 May 2021
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
138
1,746
0
24 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
975
29,871
0
26 Feb 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
418
5,000
0
24 Feb 2021
Learning to Recover 3D Scene Shape from a Single Image
Wei Yin
Jianming Zhang
Oliver Wang
Simon Niklaus
Long Mai
Simon Chen
Chunhua Shen
MDE
109
240
0
17 Dec 2020
AdaBins: Depth Estimation using Adaptive Bins
S. Bhat
Ibraheem Alhashim
Peter Wonka
3DV
MDE
ViT
122
858
0
28 Nov 2020
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffM
SyDa
353
6,586
0
26 Nov 2020
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
292
7,492
0
06 Oct 2020
Bidirectional Attention Network for Monocular Depth Estimation
Shubhra Aich
Jean M. Uwabeza Vianney
Md. Amirul Islam
Mannat Kaur
Bingbing Liu
MDE
72
75
0
01 Sep 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
718
18,364
0
19 Jun 2020
DiverseDepth: Affine-invariant Depth Prediction Using Diverse Data
Wei Yin
Xinlong Wang
Chunhua Shen
Yifan Liu
Zhi Tian
Songcen Xu
Changming Sun
Dou Renyin
3DH
MDE
104
70
0
03 Feb 2020
Virtual KITTI 2
Yohann Cabon
Naila Murray
Martin Humenberger
3DPC
71
288
0
29 Jan 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
547
42,639
0
03 Dec 2019
DIODE: A Dense Indoor and Outdoor DEpth Dataset
Igor Vasiljevic
Nicholas I. Kolkin
Shanyi Zhang
Ruotian Luo
Haochen Wang
...
Andrea F. Daniele
Mohammadreza Mostajabi
Steven Basart
Matthew R. Walter
Gregory Shakhnarovich
MDE
3DV
84
233
0
01 Aug 2019
Generative Modeling by Estimating Gradients of the Data Distribution
Yang Song
Stefano Ermon
SyDa
DiffM
258
3,956
0
12 Jul 2019
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
MDE
210
1,800
0
02 Jul 2019
Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF from a Single Image
Zhengqin Li
Mohammad Shafiei
R. Ramamoorthi
Kalyan Sunkavalli
Manmohan Chandraker
3DV
53
264
0
07 May 2019
FrameNet: Learning Local Canonical Frames of 3D Surfaces from a Single RGB Image
Jingwei Huang
Yichao Zhou
Thomas Funkhouser
Leonidas Guibas
3DV
77
48
0
29 Mar 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
617
10,590
0
12 Dec 2018
1
2
Next