ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.05543
  4. Cited By
Adding Conditional Control to Text-to-Image Diffusion Models
v1v2v3 (latest)

Adding Conditional Control to Text-to-Image Diffusion Models

10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "Adding Conditional Control to Text-to-Image Diffusion Models"

50 / 367 papers shown
Title
Vision Transformer Adapter for Dense Predictions
Vision Transformer Adapter for Dense Predictions
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
129
564
0
17 May 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLMDiffM
407
6,866
0
13 Apr 2022
Exploring Plain Vision Transformer Backbones for Object Detection
Exploring Plain Vision Transformer Backbones for Object Detection
Yanghao Li
Hanzi Mao
Ross B. Girshick
Kaiming He
ViT
90
808
0
30 Mar 2022
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni
Adam Polyak
Oron Ashual
Shelly Sheynin
Devi Parikh
Yaniv Taigman
DiffM
79
521
0
24 Mar 2022
Self-Distilled StyleGAN: Towards Generation from Internet Photos
Self-Distilled StyleGAN: Towards Generation from Internet Photos
Ron Mokady
Michal Yarom
Omer Tov
Oran Lang
Daniel Cohen-Or
Tali Dekel
Michal Irani
Inbar Mosseri
62
45
0
24 Feb 2022
Multi-level Latent Space Structuring for Generative Control
Multi-level Latent Space Structuring for Generative Control
Oren Katzir
Vicky Perepelook
Dani Lischinski
Daniel Cohen-Or
102
4
0
11 Feb 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLMBDLVLMCLIP
542
4,360
0
28 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
460
15,665
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with
  Text-Guided Diffusion Models
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
358
3,605
0
20 Dec 2021
VL-Adapter: Parameter-Efficient Transfer Learning for
  Vision-and-Language Tasks
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung
Jaemin Cho
Joey Tianyi Zhou
VLMVPVLM
112
354
0
13 Dec 2021
Multimodal Conditional Image Synthesis with Product-of-Experts GANs
Multimodal Conditional Image Synthesis with Product-of-Experts GANs
Xun Huang
Arun Mallya
Ting-Chun Wang
Xuan Li
DiffM
81
90
0
09 Dec 2021
HyperInverter: Improving StyleGAN Inversion via Hypernetwork
HyperInverter: Improving StyleGAN Inversion via Hypernetwork
Tan M. Dinh
Anh Tran
Rang Nguyen
Binh-Son Hua
66
119
0
01 Dec 2021
HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing
HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing
Yuval Alaluf
Omer Tov
Ron Mokady
Rinon Gal
Amit H. Bermano
110
274
0
30 Nov 2021
Blended Diffusion for Text-driven Editing of Natural Images
Blended Diffusion for Text-driven Editing of Natural Images
Omri Avrahami
Dani Lischinski
Ohad Fried
DiffM
121
947
0
29 Nov 2021
Benchmarking Detection Transfer Learning with Vision Transformers
Benchmarking Detection Transfer Learning with Vision Transformers
Yanghao Li
Saining Xie
Xinlei Chen
Piotr Dollar
Kaiming He
Ross B. Girshick
72
168
0
22 Nov 2021
Palette: Image-to-Image Diffusion Models
Palette: Image-to-Image Diffusion Models
Chitwan Saharia
William Chan
Huiwen Chang
Chris A. Lee
Jonathan Ho
Tim Salimans
David J. Fleet
Mohammad Norouzi
DiffMVLM
484
1,640
0
10 Nov 2021
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language
  Modeling
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
265
400
0
06 Nov 2021
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
VLMCLIP
299
1,042
0
09 Oct 2021
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image
  Manipulation
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation
Gwanghyun Kim
Taesung Kwon
Jong Chul Ye
DiffM
198
650
0
06 Oct 2021
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential
  Equations
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations
Chenlin Meng
Yutong He
Yang Song
Jiaming Song
Jiajun Wu
Jun-Yan Zhu
Stefano Ermon
DiffM
144
1,492
0
02 Aug 2021
Variational Diffusion Models
Variational Diffusion Models
Diederik P. Kingma
Tim Salimans
Ben Poole
Jonathan Ho
DiffM
181
1,130
0
01 Jul 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRLAI4TSAI4CEALMAIMat
477
10,367
0
17 Jun 2021
Towards Light-weight and Real-time Line Segment Detection
Towards Light-weight and Real-time Line Segment Detection
Geonmo Gu
ByungSoo Ko
SeoungHyun Go
Sung-Hyun Lee
Jingeun Lee
Minchul Shin
3DGS
39
64
0
01 Jun 2021
On Fast Sampling of Diffusion Probabilistic Models
On Fast Sampling of Diffusion Probabilistic Models
Zhifeng Kong
Ming-Yu Liu
DiffM
78
198
0
31 May 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
161
1,222
0
30 May 2021
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis
  via Non-Autoregressive Generative Transformers
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis via Non-Autoregressive Generative Transformers
Zhu Zhang
Jianxin Ma
Chang Zhou
Rui Men
Zhikang Li
Ming Ding
Jie Tang
Jingren Zhou
Hongxia Yang
74
46
0
29 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
232
7,857
0
11 May 2021
Noise Estimation for Generative Diffusion Models
Noise Estimation for Generative Diffusion Models
Robin San-Roman
Eliya Nachmani
Lior Wolf
DiffM
91
107
0
06 Apr 2021
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
Or Patashnik
Zongze Wu
Eli Shechtman
Daniel Cohen-Or
Dani Lischinski
CLIPVLM
120
1,207
0
31 Mar 2021
Personalized Federated Learning using Hypernetworks
Personalized Federated Learning using Hypernetworks
Aviv Shamsian
Aviv Navon
Ethan Fetaya
Gal Chechik
FedML
118
334
0
08 Mar 2021
OpenPifPaf: Composite Fields for Semantic Keypoint Detection and
  Spatio-Temporal Association
OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association
S. Kreiss
Lorenzo Bertoni
Alexandre Alahi
98
129
0
03 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
955
29,436
0
26 Feb 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
418
4,953
0
24 Feb 2021
Improved Denoising Diffusion Probabilistic Models
Improved Denoising Diffusion Probabilistic Models
Alex Nichol
Prafulla Dhariwal
DiffM
349
3,702
0
18 Feb 2021
Only a Matter of Style: Age Transformation Using a Style-Based
  Regression Model
Only a Matter of Style: Age Transformation Using a Style-Based Regression Model
Yuval Alaluf
Or Patashnik
Daniel Cohen-Or
54
140
0
04 Feb 2021
Multimodal Variational Autoencoders for Semi-Supervised Learning: In
  Defense of Product-of-Experts
Multimodal Variational Autoencoders for Semi-Supervised Learning: In Defense of Product-of-Experts
S. Kutuzova
Oswin Krause
D. McCloskey
Mads Nielsen
Christian Igel
60
18
0
18 Jan 2021
Intrinsic Dimensionality Explains the Effectiveness of Language Model
  Fine-Tuning
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning
Armen Aghajanyan
Luke Zettlemoyer
Sonal Gupta
101
568
1
22 Dec 2020
Taming Transformers for High-Resolution Image Synthesis
Taming Transformers for High-Resolution Image Synthesis
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
129
2,962
0
17 Dec 2020
Pre-Trained Image Processing Transformer
Pre-Trained Image Processing Transformer
Hanting Chen
Yunhe Wang
Tianyu Guo
Chang Xu
Yiping Deng
Zhenhua Liu
Siwei Ma
Chunjing Xu
Chao Xu
Wen Gao
VLMViT
143
1,676
0
01 Dec 2020
Score-Based Generative Modeling through Stochastic Differential
  Equations
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffMSyDa
344
6,480
0
26 Nov 2020
Denoising Diffusion Implicit Models
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
283
7,384
0
06 Oct 2020
Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
Elad Richardson
Yuval Alaluf
Or Patashnik
Yotam Nitzan
Yaniv Azar
Stav Shapiro
Daniel Cohen-Or
138
1,108
0
03 Aug 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
648
18,096
0
19 Jun 2020
Cross-domain Correspondence Learning for Exemplar-based Image
  Translation
Cross-domain Correspondence Learning for Exemplar-based Image Translation
Peiying Zhang
Bo Zhang
Dong Chen
Lu Yuan
Fang Wen
77
239
0
12 Apr 2020
Side-Tuning: A Baseline for Network Adaptation via Additive Side
  Networks
Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks
Jeffrey O. Zhang
Alexander Sax
Amir Zamir
Leonidas Guibas
Jitendra Malik
62
28
0
31 Dec 2019
DIODE: A Dense Indoor and Outdoor DEpth Dataset
DIODE: A Dense Indoor and Outdoor DEpth Dataset
Igor Vasiljevic
Nicholas I. Kolkin
Shanyi Zhang
Ruotian Luo
Haochen Wang
...
Andrea F. Daniele
Mohammadreza Mostajabi
Steven Basart
Matthew R. Walter
Gregory Shakhnarovich
MDE3DV
77
233
0
01 Aug 2019
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot
  Cross-dataset Transfer
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
René Ranftl
Katrin Lasinger
David Hafner
Konrad Schindler
V. Koltun
MDE
204
1,793
0
02 Jul 2019
Semantic Image Synthesis with Spatially-Adaptive Normalization
Semantic Image Synthesis with Spatially-Adaptive Normalization
Taesung Park
Ming-Yuan Liu
Ting-Chun Wang
Jun-Yan Zhu
156
2,688
0
18 Mar 2019
Parameter-Efficient Transfer Learning for NLP
Parameter-Efficient Transfer Learning for NLP
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
210
4,460
0
02 Feb 2019
OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity
  Fields
OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Zhe Cao
Gines Hidalgo
Tomas Simon
S. Wei
Yaser Sheikh
3DHCVBM
124
4,592
0
18 Dec 2018
Previous
12345678
Next