Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.19114
Cited By
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
25 May 2025
H. Zhang
Dexiang Hong
Maoke Yang
Yutao Chen
Zhao Zhang
Jie Shao
Xinglong Wu
Zuxuan Wu
Yu Jiang
DiffM
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design"
48 / 98 papers shown
Title
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Mengqi Huang
Zhendong Mao
Mingcong Liu
Qian He
Yongdong Zhang
DiffM
63
25
0
01 Mar 2024
Transparent Image Layer Diffusion using Latent Transparency
Lvmin Zhang
Maneesh Agrawala
53
46
0
27 Feb 2024
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
Dewei Zhou
You Li
Fan Ma
Zongxin Yang
Yi Yang
DiffM
43
59
0
08 Feb 2024
InstanceDiffusion: Instance-level Control for Image Generation
Xudong Wang
Trevor Darrell
Sai Saketh Rambhatla
Rohit Girdhar
Ishan Misra
VLM
DiffM
42
91
0
05 Feb 2024
Instruct-Imagen: Image Generation with Multi-modal Instruction
Hexiang Hu
Kelvin C. K. Chan
Yu-Chuan Su
Wenhu Chen
Yandong Li
...
Xue Ben
Boqing Gong
William W. Cohen
Ming-Wei Chang
Xuhui Jia
MLLM
92
45
0
03 Jan 2024
Generative Multimodal Models are In-Context Learners
Quan-Sen Sun
Yufeng Cui
Xiaosong Zhang
Fan Zhang
Qiying Yu
...
Yueze Wang
Yongming Rao
Jingjing Liu
Tiejun Huang
Xinlong Wang
MLLM
LRM
112
265
0
20 Dec 2023
InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models
Jiun Tian Hoe
Xudong Jiang
Chee Seng Chan
Yap-Peng Tan
Weipeng Hu
60
13
0
10 Dec 2023
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
L. Ran
Xiaodong Cun
Jia-Wei Liu
Rui Zhao
Song Zijie
Xintao Wang
Jussi Keppo
Mike Zheng Shou
55
12
0
04 Dec 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
56
47
0
28 Nov 2023
COLE: A Hierarchical Generation Framework for Multi-Layered and Editable Graphic Design
Peidong Jia
Chenxuan Li
Yuhui Yuan
Zeyu Liu
Yichao Shen
...
Dong Chen
Ji Li
Xiaodong Xie
Shanghang Zhang
Baining Guo
45
7
0
28 Nov 2023
Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis
Xiaohui Chen
Yongfei Liu
Yingxiang Yang
Jianbo Yuan
Quanzeng You
Liping Liu
Hongxia Yang
DiffM
65
12
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
60
66
0
28 Nov 2023
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
Biao Gong
Siteng Huang
Yutong Feng
Shiwei Zhang
Yuyuan Li
Yu Liu
DiffM
67
13
0
27 Nov 2023
LayoutPrompter: Awaken the Design Ability of Large Language Models
Jiawei Lin
Jiaqi Guo
Shizhao Sun
Z. Yang
Jian-Guang Lou
Dongmei Zhang
VLM
44
23
0
11 Nov 2023
PixArt-
α
α
α
: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
70
414
0
30 Sep 2023
SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation
Chengyou Jia
Minnan Luo
Zhuohang Dang
Guangwen Dai
Xiaojun Chang
Mengmeng Wang
Jingdong Wang
DiffM
71
14
0
20 Aug 2023
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Hu Ye
Jun Zhang
Siyi Liu
Xiao Han
Wei Yang
DiffM
67
765
0
13 Aug 2023
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Binbin Yang
Yinzheng Luo
Ziliang Chen
Guangrun Wang
Xiaodan Liang
Liang Lin
DiffM
57
14
0
13 Aug 2023
TextPainter: Multimodal Text Image Generation with Visual-harmony and Text-comprehension for Poster Design
Yifan Gao
Jinpeng Lin
Min Zhou
Chuanbin Liu
Hongtao Xie
T. Ge
Yuning Jiang
39
14
0
09 Aug 2023
AutoPoster: A Highly Automatic and Content-aware Design System for Advertising Poster Generation
Jinpeng Lin
Min Zhou
Ye Ma
Yifan Gao
Chenxi Fei
Yang Chen
Zhang Yu
T. Ge
13
26
0
02 Aug 2023
AnyDoor: Zero-shot Object-level Image Customization
Xi Chen
Lianghua Huang
Yu Liu
Yujun Shen
Deli Zhao
Hengshuang Zhao
DiffM
94
267
0
18 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
166
2,242
0
04 Jul 2023
Grounded Text-to-Image Synthesis with Attention Refocusing
Quynh Phung
Songwei Ge
Jia-Bin Huang
DiffM
46
108
0
08 Jun 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee K. Wong
90
252
0
25 May 2023
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models
Sungnyun Kim
Junsoo Lee
Kibeom Hong
Daesik Kim
Namhyuk Ahn
DiffM
50
15
0
24 May 2023
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Can Qin
Shu Zhen Zhang
Ning Yu
Yihao Feng
Xinyi Yang
...
Caiming Xiong
Silvio Savarese
Stefano Ermon
Yun Fu
Ran Xu
71
128
0
18 May 2023
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation
Yuval Kirstain
Adam Polyak
Uriel Singer
Shahbuland Matiana
Joe Penna
Omer Levy
EGVM
185
375
0
02 May 2023
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLM
CLIP
SSL
249
3,205
0
14 Apr 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
84
360
0
12 Apr 2023
Towards Flexible Multi-modal Document Models
Naoto Inoue
Kotaro Kikuchi
E. Simo-Serra
Mayu Otani
Kota Yamaguchi
52
21
0
31 Mar 2023
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation
Guangcong Zheng
Xianpan Zhou
Xuewei Li
Zhongang Qi
Ying Shan
Xi Li
DiffM
57
182
0
30 Mar 2023
Freestyle Layout-to-Image Synthesis
Han Xue
Z. Huang
Qianru Sun
Li Song
Wenjun Zhang
DiffM
34
65
0
25 Mar 2023
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
Naoto Inoue
Kotaro Kikuchi
E. Simo-Serra
Mayu Otani
Kota Yamaguchi
DiffM
70
105
0
14 Mar 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
163
1,893
0
09 Mar 2023
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models
Chong Mou
Xintao Wang
Liangbin Xie
Yanze Wu
Shuai Liu
Zhongang Qi
Ying Shan
Xiaohu Qie
DiffM
54
999
0
16 Feb 2023
GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
VLM
102
584
1
17 Jan 2023
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
66
2,182
0
19 Dec 2022
All are Worth Words: A ViT Backbone for Diffusion Models
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
62
343
0
25 Sep 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
196
2,789
0
25 Aug 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
289
5,904
0
23 May 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
268
15,081
0
20 Dec 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
148
7,639
0
11 May 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
681
28,659
0
26 Feb 2021
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
147
7,166
0
06 Oct 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
299
17,550
0
19 Jun 2020
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
218
10,591
0
17 Feb 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
270
19,824
0
23 Oct 2019
PubLayNet: largest dataset ever for document layout analysis
Xu Zhong
Jianbin Tang
Antonio Jimeno Yepes
29
454
0
16 Aug 2019
Previous
1
2