Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.05543
Cited By
v1
v2
v3 (latest)
Adding Conditional Control to Text-to-Image Diffusion Models
10 February 2023
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adding Conditional Control to Text-to-Image Diffusion Models"
50 / 3,090 papers shown
Title
SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis
Hanrong Ye
Jason Kuen
Qing Liu
Zhe Lin
Brian L. Price
Dan Xu
VLM
130
12
0
06 Nov 2023
Cross-Image Attention for Zero-Shot Appearance Transfer
Yuval Alaluf
Daniel Garibi
Or Patashnik
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
96
80
0
06 Nov 2023
Exploring the Capability of Text-to-Image Diffusion Models with Structural Edge Guidance for Multi-Spectral Satellite Image Inpainting
Mikolaj Czerkawski
Christos Tachtatzis
DiffM
72
9
0
06 Nov 2023
InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image
Jianhui Li
Shilong Liu
Zidong Liu
Yikai Wang
Kaiwen Zheng
Jinghui Xu
Jianmin Li
Jun Zhu
86
10
0
06 Nov 2023
Domain Transfer in Latent Space (DTLS) Wins on Image Super-Resolution -- a Non-Denoising Model
C. Hui
W. Siu
N. Law
DiffM
64
1
0
04 Nov 2023
Stable Diffusion Reference Only: Image Prompt and Blueprint Jointly Guided Multi-Condition Diffusion Model for Secondary Painting
Hao Ai
Lu Sheng
DiffM
40
3
0
04 Nov 2023
inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignment Multi-Encoder VAE
Tawin Jiramahapokee
69
2
0
03 Nov 2023
The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing
Shen Nie
Hanzhong Guo
Cheng Lu
Yuhao Zhou
Chenyu Zheng
Chongxuan Li
DiffM
134
43
0
02 Nov 2023
Gaussian Mixture Solvers for Diffusion Models
Hanzhong Guo
Cheng Lu
Fan Bao
Tianyu Pang
Shuicheng Yan
Chao Du
Chongxuan Li
72
11
0
02 Nov 2023
Controllable Music Production with Diffusion Models and Guidance Gradients
Mark Levy
Bruno Di Giorgi
Floris Weers
Angelos Katharopoulos
Tom Nickson
DiffM
119
23
0
01 Nov 2023
Intriguing Properties of Data Attribution on Diffusion Models
Xiaosen Zheng
Tianyu Pang
Chao Du
Jing Jiang
Min Lin
TDI
131
26
1
01 Nov 2023
Consistent Video-to-Video Transfer Using Synthetic Dataset
Jiaxin Cheng
Tianjun Xiao
Tong He
VGen
DiffM
112
17
0
01 Nov 2023
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Xinyuan Chen
Yaohui Wang
Lingjun Zhang
Shaobin Zhuang
Xin Ma
Jiashuo Yu
Yali Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
DiffM
79
146
0
31 Oct 2023
CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models
Ziyang Yuan
Mingdeng Cao
Xintao Wang
Zhongang Qi
Chun Yuan
Ying Shan
DiffM
102
24
0
30 Oct 2023
SeamlessNeRF: Stitching Part NeRFs with Gradient Propagation
Bingchen Gong
Yuehao Wang
Xiaoguang Han
Qi Dou
84
3
0
30 Oct 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
99
314
0
30 Oct 2023
Text-to-3D with Classifier Score Distillation
Xin Yu
Yuanchen Guo
Yangguang Li
Ding Liang
Song-Hai Zhang
Xiaojuan Qi
DiffM
114
87
0
30 Oct 2023
Sound of Story: Multi-modal Storytelling with Audio
Jaeyeon Bae
Seokhoon Jeong
Seokun Kang
Namgi Han
Jae-Yon Lee
Hyounghun Kim
Taehwan Kim
57
4
0
30 Oct 2023
Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models
Pushkal Katara
Zhou Xian
Katerina Fragkiadaki
LM&Ro
126
44
0
27 Oct 2023
Entity Embeddings : Perspectives Towards an Omni-Modality Era for Large Language Models
Eren Unlu
Unver Ciftci
64
0
0
27 Oct 2023
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors
You-Ming Chang
Chen Yeh
Wei-Chen Chiu
Ning Yu
VPVLM
VLM
154
30
0
26 Oct 2023
Semantic Generative Augmentations for Few-Shot Counting
Perla Doubinsky
Nicolas Audebert
M. Crucianu
Hervé Le Borgne
VLM
DiffM
88
4
0
26 Oct 2023
Generating by Understanding: Neural Visual Generation with Logical Symbol Groundings
Yifei Peng
Yu Jin
Yu Jin
Zhexu Luo
Wang-Zhou Dai
Zhong Ren
Kun Zhou
Kun Zhou
GAN
NAI
64
0
0
26 Oct 2023
Dolfin: Diffusion Layout Transformers without Autoencoder
Yilin Wang
Zeyuan Chen
Liangjun Zhong
Zheng Ding
Zhizhou Sha
Zhuowen Tu
114
17
0
25 Oct 2023
CVPR 2023 Text Guided Video Editing Competition
Jay Zhangjie Wu
Xiuyu Li
Difei Gao
Zhen Dong
Jinbin Bai
...
Xu Cheng
Jie Tang
Mike Zheng Shou
Kurt Keutzer
Forrest N. Iandola
100
35
0
24 Oct 2023
Integrating View Conditions for Image Synthesis
Jinbin Bai
Zhen Dong
Aosong Feng
Xiao Zhang
Tian-Chun Ye
Kaicheng Zhou
120
15
0
24 Oct 2023
Language-driven Scene Synthesis using Multi-conditional Diffusion Model
An Vuong
M. Vu
T. Nguyen
Baoru Huang
Dzung Nguyen
T. Vo
Anh Nguyen
DiffM
97
9
0
24 Oct 2023
FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models
Lihe Yang
Xiaogang Xu
Bingyi Kang
Yinghuan Shi
Hengshuang Zhao
93
46
0
23 Oct 2023
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Ruoxi Shi
Hansheng Chen
Zhuoyang Zhang
Minghua Liu
Chao Xu
Xinyue Wei
Linghao Chen
Chong Zeng
Hao Su
VLM
85
373
0
23 Oct 2023
S3Aug: Segmentation, Sampling, and Shift for Action Recognition
Taiki Sugiura
Toru Tamaki
AI4TS
67
2
0
23 Oct 2023
A comprehensive survey on deep active learning in medical image analysis
Haoran Wang
Q. Jin
Shiman Li
Siyu Liu
Manning Wang
Zhijian Song
VLM
128
31
0
22 Oct 2023
TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models
Tianshi Cao
Karsten Kreis
Sanja Fidler
Nicholas Sharp
Kangxue Yin
94
79
0
20 Oct 2023
Learning Interatomic Potentials at Multiple Scales
Xiang Fu
Albert Musaelian
Anders Johansson
Tommi Jaakkola
Boris Kozinsky
81
2
0
20 Oct 2023
WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
Jun-Yan He
Zhi-Qi Cheng
Chenyang Li
Jingdong Sun
Wangmeng Xiang
...
Yusen Hu
Bin Luo
Yifeng Geng
Xuansong Xie
Jingren Zhou
64
13
0
20 Oct 2023
CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation
Sihan Xu
Ziqiao Ma
Yidong Huang
Honglak Lee
Joyce Chai
DiffM
112
24
0
19 Oct 2023
DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation
Bangbang Yang
Wenqi Dong
Lin Ma
Wenbo Hu
Xiao Liu
Zhaopeng Cui
Yuewen Ma
DiffM
51
19
0
19 Oct 2023
Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems
David T. Hoffmann
Simon Schrodi
Jelena Bratulić
Nadine Behrmann
Volker Fischer
Thomas Brox
116
8
0
19 Oct 2023
EMIT-Diff: Enhancing Medical Image Segmentation via Text-Guided Diffusion Model
Zheyu Zhang
Lanhong Yao
Bin Wang
Debesh Jha
Elif Keles
Alpay Medetalibeyoglu
Ulas Bagci
MedIm
90
15
0
19 Oct 2023
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
Zijie Pan
Jiachen Lu
Xiatian Zhu
Li Zhang
DiffM
87
11
0
19 Oct 2023
Closed-Form Diffusion Models
Christopher Scarvelis
Haitz Sáez de Ocáriz Borde
Justin Solomon
DiffM
189
12
0
19 Oct 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Jinbo Xing
Menghan Xia
Yong Zhang
Haoxin Chen
Wangbo Yu
Hanyuan Liu
Xintao Wang
Tien-Tsin Wong
Ying Shan
VGen
125
256
0
18 Oct 2023
To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now
Yimeng Zhang
Jinghan Jia
Xin Chen
Aochuan Chen
Yihua Zhang
Jiancheng Liu
Ke Ding
Sijia Liu
DiffM
177
101
0
18 Oct 2023
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
Xinhua Cheng
Tianyu Yang
Jianan Wang
Yu Li
Lei Zhang
Jian Zhang
Li-ming Yuan
DiffM
90
43
0
18 Oct 2023
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu
Liangyu Chen
Tong Yang
Chunle Guo
Chongyi Li
Xiangyu Zhang
DiffM
VGen
166
53
0
16 Oct 2023
BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys
Yu Gu
Jianwei Yang
Naoto Usuyama
Chun-yue Li
Sheng Zhang
M. Lungren
Jianfeng Gao
Hoifung Poon
MedIm
111
24
0
16 Oct 2023
A Survey on Video Diffusion Models
Zhen Xing
Qijun Feng
Haoran Chen
Qi Dai
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVM
VGen
179
139
0
16 Oct 2023
TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
Yukai Shi
Jianan Wang
He Cao
Boshi Tang
Xianbiao Qi
Tianyu Yang
Yukun Huang
Shilong Liu
Lei Zhang
H. Shum
DiffM
66
20
0
16 Oct 2023
DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing
Jia-Wei Liu
Yan-Pei Cao
Jay Zhangjie Wu
Weijia Mao
Yuchao Gu
Rui Zhao
Jussi Keppo
Ying Shan
Mike Zheng Shou
VGen
DiffM
106
17
0
16 Oct 2023
ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion
Jiayu Yang
Ziang Cheng
Yunfei Duan
Pan Ji
Hongdong Li
DiffM
88
58
0
16 Oct 2023
Scene Graph Conditioning in Latent Diffusion
Frank Fundel
DiffM
59
0
0
16 Oct 2023
Previous
1
2
3
...
51
52
53
...
60
61
62
Next