ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.05032
  4. Cited By
Training-Free Structured Diffusion Guidance for Compositional
  Text-to-Image Synthesis

Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis

9 December 2022
Weixi Feng
Xuehai He
Tsu-jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Qing Guo
William Yang Wang
    CoGe
ArXivPDFHTML

Papers citing "Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis"

50 / 263 papers shown
Title
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
94
2,128
0
04 Jul 2023
Localized Text-to-Image Generation for Free via Cross Attention Control
Localized Text-to-Image Generation for Free via Cross Attention Control
Yutong He
Ruslan Salakhutdinov
J. Zico Kolter
DiffM
64
21
0
26 Jun 2023
A-STAR: Test-time Attention Segregation and Retention for Text-to-image
  Synthesis
A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
K. J. Joseph
Apoorv Saxena
Koustava Goswami
Balaji Vasan Srinivasan
VLM
DiffM
11
46
0
26 Jun 2023
Text-Anchored Score Composition: Tackling Condition Misalignment in
  Text-to-Image Diffusion Models
Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
Luozhou Wang
Guibao Shen
Wenhang Ge
Guangyong Chen
Yijun Li
Yingke Chen
DiffM
38
4
0
26 Jun 2023
Zero-shot spatial layout conditioning for text-to-image diffusion models
Zero-shot spatial layout conditioning for text-to-image diffusion models
Guillaume Couairon
Marlene Careil
Matthieu Cord
Stéphane Lathuilière
Jakob Verbeek
VLM
16
63
0
23 Jun 2023
Energy-Based Cross Attention for Bayesian Context Update in
  Text-to-Image Diffusion Models
Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models
Geon Yeong Park
Jeongsol Kim
Beomsu Kim
Sang Wan Lee
Jong Chul Ye
DiffM
19
21
0
16 Jun 2023
Linguistic Binding in Diffusion Models: Enhancing Attribute
  Correspondence through Attention Map Alignment
Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Royi Rassin
Eran Hirsch
Daniel Glickman
Shauli Ravfogel
Yoav Goldberg
Gal Chechik
DiffM
42
100
0
15 Jun 2023
Norm-guided latent space exploration for text-to-image generation
Norm-guided latent space exploration for text-to-image generation
Dvir Samuel
Rami Ben-Ari
N. Darshan
Haggai Maron
Gal Chechik
DiffM
29
24
0
14 Jun 2023
Grounded Text-to-Image Synthesis with Attention Refocusing
Grounded Text-to-Image Synthesis with Attention Refocusing
Quynh Phung
Songwei Ge
Jia-Bin Huang
DiffM
30
104
0
08 Jun 2023
Unsupervised Compositional Concepts Discovery with Text-to-Image
  Generative Models
Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
Nan Liu
Yilun Du
Shuang Li
J. Tenenbaum
Antonio Torralba
DiffM
CoGe
14
24
0
08 Jun 2023
On the Design Fundamentals of Diffusion Models: A Survey
On the Design Fundamentals of Diffusion Models: A Survey
Ziyi Chang
G. Koulieris
Hubert P. H. Shum
DiffM
29
53
0
07 Jun 2023
Stable Diffusion is Unstable
Stable Diffusion is Unstable
Chengbin Du
Yanxi Li
Zhongwei Qiu
Chang Xu
DiffM
33
17
0
05 Jun 2023
Detector Guidance for Multi-Object Text-to-Image Generation
Detector Guidance for Multi-Object Text-to-Image Generation
Luping Liu
Zijian Zhang
Yi Ren
Rongjie Huang
Xiang Yin
Zhou Zhao
DiffM
31
9
0
04 Jun 2023
Discovering Failure Modes of Text-guided Diffusion Models via
  Adversarial Search
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search
Qihao Liu
Adam Kortylewski
Yutong Bai
Song Bai
Alan Yuille
DiffM
32
12
0
01 Jun 2023
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine
  Semantic Re-alignment
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment
Guian Fang
Zutao Jiang
Jianhua Han
Guangsong Lu
Hang Xu
Shengcai Liao
Xiaodan Liang
EGVM
29
1
0
31 May 2023
Cones 2: Customizable Image Synthesis with Multiple Subjects
Cones 2: Customizable Image Synthesis with Multiple Subjects
Zhiheng Liu
Yifei Zhang
Yujun Shen
Kecheng Zheng
Kai Zhu
Ruili Feng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
63
80
0
30 May 2023
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept
  Customization of Diffusion Models
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Yuchao Gu
Xintao Wang
Jay Zhangjie Wu
Yujun Shi
Yunpeng Chen
...
Shuning Chang
Wei Yu Wu
Yixiao Ge
Ying Shan
Mike Zheng Shou
DiffM
52
166
0
29 May 2023
Photoswap: Personalized Subject Swapping in Images
Photoswap: Personalized Subject Swapping in Images
Jing Gu
Yilin Wang
Nanxuan Zhao
Tsu-jui Fu
Wei Xiong
...
Zhifei Zhang
He Zhang
Jianming Zhang
Hyun-Sun Jung
Xin Eric Wang
DiffM
26
37
0
29 May 2023
Autoencoding Conditional Neural Processes for Representation Learning
Autoencoding Conditional Neural Processes for Representation Learning
Victor Prokhorov
Ivan Titov
N. Siddharth
BDL
18
0
0
29 May 2023
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph
  Diffusion
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion
Guangyao Zhai
Evin Pınar Örnek
Shun-cheng Wu
Yan Di
F. Tombari
Nassir Navab
Benjamin Busam
DiffM
32
12
0
25 May 2023
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion
  Models
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Ying Fan
Olivia Watkins
Yuqing Du
Hao Liu
Moonkyung Ryu
Craig Boutilier
Pieter Abbeel
Mohammad Ghavamzadeh
Kangwook Lee
Kimin Lee
46
135
0
25 May 2023
LayoutGPT: Compositional Visual Planning and Generation with Large
  Language Models
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Qing Guo
William Yang Wang
MLLM
30
162
0
24 May 2023
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal
  Image Generation
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Marco Bellagente
Manuel Brack
H. Teufel
Felix Friedrich
Bjorn Deiseroth
...
Koen Oostermeijer
Andres Felipe Cruz Salinas
P. Schramowski
Kristian Kersting
Samuel Weinbach
36
15
0
24 May 2023
I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create
  Visual Metaphors
I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors
Tuhin Chakrabarty
Arkadiy Saakyan
Olivia Winn
Artemis Panagopoulou
Yue Yang
Marianna Apidianaki
Smaranda Muresan
DiffM
33
41
0
24 May 2023
Vision + Language Applications: A Survey
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
30
6
0
24 May 2023
Text-guided 3D Human Generation from 2D Collections
Text-guided 3D Human Generation from 2D Collections
Tsu-jui Fu
Wenhan Xiong
Yixin Nie
Jingyu Liu
Barlas Ouguz
William Yang Wang
39
1
0
23 May 2023
Compositional Text-to-Image Synthesis with Attention Map Control of
  Diffusion Models
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang
Zekang Chen
Chen Chen
Jiancang Ma
H. Lu
Xiaodong Lin
DiffM
52
65
0
23 May 2023
Training Priors Predict Text-To-Image Model Performance
Training Priors Predict Text-To-Image Model Performance
Charles Lovering
Ellie Pavlick
CoGe
30
3
0
23 May 2023
If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based
  Text-to-Image Generation by Selection
If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
Shyamgopal Karthik
Karsten Roth
Massimiliano Mancini
Zeynep Akata
36
20
0
22 May 2023
The CLIP Model is Secretly an Image-to-Prompt Converter
The CLIP Model is Secretly an Image-to-Prompt Converter
Yuxuan Ding
Chunna Tian
Haoxuan Ding
Lingqiao Liu
DiffM
22
14
0
22 May 2023
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image
  Synthesis Evaluation
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Yujie Lu
Xianjun Yang
Xiujun Li
Qing Guo
William Yang Wang
EGVM
52
35
0
18 May 2023
Discffusion: Discriminative Diffusion Models as Few-shot Vision and
  Language Learners
Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners
Xuehai He
Weixi Feng
Tsu-jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
William Yang Wang
Qing Guo
DiffM
49
7
0
18 May 2023
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Jianyi Wang
Zongsheng Yue
Shangchen Zhou
Kelvin C. K. Chan
Chen Change Loy
44
281
0
11 May 2023
Guided Image Synthesis via Initial Image Editing in Diffusion Model
Guided Image Synthesis via Initial Image Editing in Diffusion Model
Jiafeng Mao
Xueting Wang
Kiyoharu Aizawa
DiffM
37
52
0
05 May 2023
Generating images of rare concepts using pre-trained diffusion models
Generating images of rare concepts using pre-trained diffusion models
Dvir Samuel
Rami Ben-Ari
Simon Raviv
N. Darshan
Gal Chechik
135
38
0
27 Apr 2023
Training-Free Location-Aware Text-to-Image Synthesis
Training-Free Location-Aware Text-to-Image Synthesis
Jiafeng Mao
Xueting Wang
19
11
0
26 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
79
79
0
13 Apr 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image
  Generation
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
46
313
0
12 Apr 2023
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image
  Models
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
Eslam Mohamed Bakr
Pengzhan Sun
Xiaoqian Shen
Faizan Farooq Khan
Li Erran Li
Mohamed Elhoseiny
VLM
24
76
0
11 Apr 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for
  High-Fidelity Text-to-Image Synthesis
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe-nan Lin
Yang Zhang
Shiyu Chang
DiffM
42
44
0
07 Apr 2023
Training-Free Layout Control with Cross-Attention Guidance
Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
135
222
0
06 Apr 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Vidit Goel
E. Peruzzo
Yi Ding
Dejia Xu
Xingqian Xu
N. Sebe
Trevor Darrell
Zhangyang Wang
Humphrey Shi
DiffM
27
6
0
30 Mar 2023
Text-to-Image Diffusion Models are Zero-Shot Classifiers
Text-to-Image Diffusion Models are Zero-Shot Classifiers
Kevin Clark
P. Jaini
DiffM
VLM
32
107
0
27 Mar 2023
Human Preference Score: Better Aligning Text-to-Image Models with Human
  Preference
Human Preference Score: Better Aligning Text-to-Image Models with Human Preference
Xiaoshi Wu
Keqiang Sun
Feng Zhu
Rui Zhao
Hongsheng Li
31
132
0
25 Mar 2023
CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D
  Scene Layout
CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout
Haotian Bai
Yiqi Lin
Hui Xiong
Sijia Li
H. Lu
Xiaodong Lin
Lin Wang
DiffM
45
42
0
24 Mar 2023
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation
  with Question Answering
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Yushi Hu
Benlin Liu
Jungo Kasai
Yizhong Wang
Mari Ostendorf
Ranjay Krishna
Noah A. Smith
EGVM
41
208
0
21 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
41
269
0
20 Mar 2023
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
Jiwen Yu
Yinhuai Wang
Chen Zhao
Guohao Li
Jian Zhang
DiffM
24
168
0
17 Mar 2023
Unified Multi-Modal Latent Diffusion for Joint Subject and Text
  Conditional Image Generation
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation
Y. Ma
Huan Yang
Wenjing Wang
Jianlong Fu
Jiaying Liu
17
65
0
16 Mar 2023
Cones: Concept Neurons in Diffusion Models for Customized Generation
Cones: Concept Neurons in Diffusion Models for Customized Generation
Zhiheng Liu
Ruili Feng
Kai Zhu
Yifei Zhang
Kecheng Zheng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
111
120
0
09 Mar 2023
Previous
123456
Next