Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.05032
Cited By
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
9 December 2022
Weixi Feng
Xuehai He
Tsu-jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Qing Guo
William Yang Wang
CoGe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis"
50 / 263 papers shown
Title
Amodal Completion via Progressive Mixed Context Diffusion
Katherine Xu
Lingzhi Zhang
Jianbo Shi
DiffM
50
16
0
24 Dec 2023
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization
Jiafeng Mao
Xueting Wang
Kiyoharu Aizawa
DiffM
60
3
0
13 Dec 2023
CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models
Tuna Han Salih Meral
Enis Simsar
Federico Tombari
Pinar Yanardag
DiffM
VLM
35
26
0
11 Dec 2023
Correcting Diffusion Generation through Resampling
Yujian Liu
Yang Zhang
Tommi Jaakkola
Shiyu Chang
33
7
0
10 Dec 2023
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao
Zhouhui Lian
76
27
0
08 Dec 2023
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
Zirui Wang
Zhizhou Sha
Zheng Ding
Yilin Wang
Zhuowen Tu
DiffM
27
20
0
06 Dec 2023
Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Sitong Su
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
DiffM
35
3
0
06 Dec 2023
TPA3D: Triplane Attention for Fast Text-to-3D Generation
Hong-En Chen
Bin-Shih Wu
Sheng-Yu Huang
Yu-Chiang Frank Wang
17
2
0
05 Dec 2023
A Contrastive Compositional Benchmark for Text-to-Image Synthesis: A Study with Unified Text-to-Image Fidelity Metrics
Xiangru Zhu
Penglei Sun
Chengyu Wang
Jingping Liu
Zhixu Li
Yanghua Xiao
Jun Huang
CoGe
102
5
0
04 Dec 2023
Fair Text-to-Image Diffusion via Fair Mapping
Jia Li
Lijie Hu
Jingfeng Zhang
Tianhang Zheng
Hua Zhang
Di Wang
51
14
0
29 Nov 2023
DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Jiao Sun
Deqing Fu
Yushi Hu
Su Wang
Royi Rassin
...
Dana Alon
Charles Herrmann
Sjoerd van Steenkiste
Ranjay Krishna
Cyrus Rashtchian
EGVM
32
40
0
29 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
31
43
0
28 Nov 2023
Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis
Xiaohui Chen
Yongfei Liu
Yingxiang Yang
Jianbo Yuan
Quanzeng You
Liping Liu
Hongxia Yang
DiffM
53
11
0
28 Nov 2023
SEED-Bench-2: Benchmarking Multimodal Large Language Models
Bohao Li
Yuying Ge
Yixiao Ge
Guangzhi Wang
Rui Wang
Ruimao Zhang
Ying Shan
MLLM
VLM
28
67
0
28 Nov 2023
Synthetic Shifts to Initial Seed Vector Exposes the Brittle Nature of Latent-Based Diffusion Models
Poyuan Mao
Shashank Kotyan
Tham Yik Foong
Danilo Vasconcellos Vargas
26
5
0
24 Nov 2023
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
Peiang Zhao
Han Li
Ruiyang Jin
S. Kevin Zhou
DiffM
51
12
0
21 Nov 2023
An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
Tripti Shukla
Balaji Vasan Srinivasan
DiffM
103
20
0
20 Nov 2023
DECDM: Document Enhancement using Cycle-Consistent Diffusion Models
Jiaxin Zhang
Joy Rimchala
Lalla Mouatadid
Kamalika Das
Kumar Sricharan
DiffM
24
0
0
16 Nov 2023
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Yanwu Xu
Yang Zhao
Zhisheng Xiao
Tingbo Hou
134
107
0
14 Nov 2023
VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning on Language-Video Foundation Models
Hong Chen
Xin Wang
Guanning Zeng
Yipeng Zhang
Yuwei Zhou
Feilin Han
Wenwu Zhu
Wenwu Zhu
VGen
DiffM
36
1
0
02 Nov 2023
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts
Xinhua Cheng
Tianyu Yang
Jianan Wang
Yu Li
Lei Zhang
Jian Zhang
Li-ming Yuan
DiffM
28
43
0
18 Oct 2023
GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment
Dhruba Ghosh
Hanna Hajishirzi
Ludwig Schmidt
9
137
0
17 Oct 2023
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task
Maya Okawa
Ekdeep Singh Lubana
Robert P. Dick
Hidenori Tanaka
CoGe
DiffM
37
44
0
13 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
13
22
0
12 Oct 2023
Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else
Hazarapet Tunanyan
Dejia Xu
Shant Navasardyan
Zhangyang Wang
Humphrey Shi
DiffM
83
7
0
11 Oct 2023
CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding
Eslam Mohamed Bakr
Mohamed Ayman
Mahmoud Ahmed
Habib Slim
Mohamed Elhoseiny
LRM
28
12
0
10 Oct 2023
Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Mihir Prabhudesai
Anirudh Goyal
Deepak Pathak
Katerina Fragkiadaki
37
111
0
05 Oct 2023
Predicated Diffusion: Predicate Logic-Based Attention Guidance for Text-to-Image Diffusion Models
Kota Sueyoshi
Takashi Matsubara
DiffM
18
8
0
03 Oct 2023
TP2O: Creative Text Pair-to-Object Generation using Balance Swap-Sampling
Jun Li
Zedong Zhang
Jian Yang
DiffM
32
6
0
03 Oct 2023
ImagenHub: Standardizing the evaluation of conditional image generation models
Max W.F. Ku
Tianle Li
Kai Zhang
Yujie Lu
Xingyu Fu
Wenwen Zhuang
Wenhu Chen
EGVM
36
47
0
02 Oct 2023
Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Xu Ju
Ailing Zeng
Hao Wang
Shaoteng Liu
Qiang Xu
DiffM
34
67
0
02 Oct 2023
DreamLLM: Synergistic Multimodal Comprehension and Creation
Runpei Dong
Chunrui Han
Yuang Peng
Zekun Qi
Zheng Ge
...
Hao-Ran Wei
Xiangwen Kong
Xiangyu Zhang
Kaisheng Ma
Li Yi
MLLM
39
173
0
20 Sep 2023
Progressive Text-to-Image Diffusion with Soft Latent Direction
Yuteng Ye
Jiale Cai
Hang Zhou
Guanwen Li
Youjia Zhang
Zikai Song
Chenxing Gao
Junqing Yu
Wei Yang
43
5
0
18 Sep 2023
NExT-GPT: Any-to-Any Multimodal LLM
Shengqiong Wu
Hao Fei
Leigang Qu
Wei Ji
Tat-Seng Chua
MLLM
46
457
0
11 Sep 2023
Create Your World: Lifelong Text-to-Image Diffusion
Gan Sun
Wenqi Liang
Jiahua Dong
Jun Li
Zhengming Ding
Yang Cong
DiffM
VLM
30
28
0
08 Sep 2023
MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
Yupeng Zhou
Daquan Zhou
Zuo-Liang Zhu
Yaxing Wang
Qibin Hou
Jiashi Feng
27
10
0
08 Sep 2023
Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
Jinglong Wang
Xiawei Li
Jing Zhang
Qingyuan Xu
Qin Zhou
Qian Yu
Lu Sheng
Dong Xu
VLM
DiffM
26
45
0
06 Sep 2023
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
Tianyi Zhang
Zheng Wang
Jin Huang
M. M. Tasnim
Wei Shi
VLM
16
21
0
25 Aug 2023
Dense Text-to-Image Generation with Attention Modulation
Yunji Kim
Jiyoung Lee
Jin-Hwa Kim
Jung-Woo Ha
Jun-Yan Zhu
DiffM
41
134
0
24 Aug 2023
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment
Xujie Zhang
Binbin Yang
Michael C. Kampffmeyer
Wenqing Zhang
Shiyue Zhang
Guansong Lu
Liang Lin
Hang Xu
Xiaodan Liang
DiffM
31
9
0
22 Aug 2023
LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
Leigang Qu
Shengqiong Wu
Hao Fei
Liqiang Nie
Tat-Seng Chua
LM&Ro
DiffM
MLLM
35
88
0
09 Aug 2023
TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition
Shilin Lu
Yanzhu Liu
A. Kong
43
51
0
24 Jul 2023
Divide & Bind Your Attention for Improved Generative Semantic Nursing
Yumeng Li
M. Keuper
Dan Zhang
Anna Khoreva
DiffM
25
47
0
20 Jul 2023
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
Jinheng Xie
Yuexiang Li
Yawen Huang
Haozhe Liu
Wentian Zhang
Yefeng Zheng
Mike Zheng Shou
DiffM
51
193
0
20 Jul 2023
Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation
Federico Betti
Jacopo Staiano
Lorenzo Baraldi
Lorenzo Baraldi
Rita Cucchiara
N. Sebe
EGVM
29
6
0
18 Jul 2023
Zero-Shot Image Harmonization with Generative Model Prior
Jianqi Chen
Yilan Zhang
Zhengxia Zou
Keyan Chen
Z. Shi
DiffM
28
5
0
17 Jul 2023
TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation
P. Grimal
Hervé Le Borgne
Olivier Ferret
Julien Tourille
EGVM
42
10
0
11 Jul 2023
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Jaskirat Singh
Liang Zheng
31
18
0
10 Jul 2023
Human Inspired Progressive Alignment and Comparative Learning for Grounded Word Acquisition
Yuwei Bao
B. Lattimer
J. Chai
CLL
43
1
0
05 Jul 2023
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou
Xintao Wang
Jie Song
Ying Shan
Jian Zhang
DiffM
47
141
0
05 Jul 2023
Previous
1
2
3
4
5
6
Next