ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.06720
  4. Cited By
Expressive Text-to-Image Generation with Rich Text

Expressive Text-to-Image Generation with Rich Text

13 April 2023
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
    DiffM
ArXivPDFHTML

Papers citing "Expressive Text-to-Image Generation with Rich Text"

50 / 86 papers shown
Title
Be Decisive: Noise-Induced Layouts for Multi-Subject Generation
Be Decisive: Noise-Induced Layouts for Multi-Subject Generation
Omer Dahary
Yehonathan Cohen
Or Patashnik
Kfir Aberman
Daniel Cohen-Or
DiffM
6
0
0
27 May 2025
Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models
Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models
Donghoon Kim
Minji Bae
Kyuhong Shim
B. Shim
52
0
0
13 May 2025
Compass Control: Multi Object Orientation Control for Text-to-Image Generation
Compass Control: Multi Object Orientation Control for Text-to-Image Generation
Rishubh Parihar
Vaibhav Agrawal
Sachidanand VS
R. V. Babu
DiffM
60
0
0
09 Apr 2025
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model
Qi Mao
Lawrence Yunliang Chen
Yuchao Gu
Mike Zheng Shou
Ming-Hsuan Yang
DiffM
54
0
0
08 Apr 2025
DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Hongbin Lin
Zilu Guo
Yiming Zhang
Shuaicheng Niu
Yafeng Li
Ruiyi Zhang
Shuguang Cui
Zhen Li
DiffM
61
1
0
14 Mar 2025
Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
Héctor Laria
Alexandra Gomez-Villa
Jiang Qin
Muhammad Atif Butt
Bogdan Raducanu
Javier Vázquez-Corral
Joost van de Weijer
Kai Wang
DiffM
77
0
0
12 Mar 2025
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Edo Kadosh
Nir Goren
Or Patashnik
Daniel Garibi
Daniel Cohen-Or
DiffM
79
0
0
27 Feb 2025
VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
Zhipeng Chen
Lan Yang
Yonggang Qi
Honggang Zhang
Kaiyue Pang
Ke Li
Yi-Zhe Song
DiffM
107
0
0
31 Dec 2024
Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
Weimin Qiu
Jieke Wang
Meng Tang
DiffM
105
0
0
28 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
87
1
0
12 Nov 2024
Token Merging for Training-Free Semantic Binding in Text-to-Image
  Synthesis
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
Taihang Hu
Linxuan Li
Joost van de Weijer
Hongcheng Gao
Fahad Shahbaz Khan
Jian Yang
Ming-Ming Cheng
Kai Wang
Yaxing Wang
DiffM
74
7
0
11 Nov 2024
Improving image synthesis with diffusion-negative sampling
Improving image synthesis with diffusion-negative sampling
Alakh Desai
Nuno Vasconcelos
DiffM
42
0
0
08 Nov 2024
GrounDiT: Grounding Diffusion Transformers via Noisy Patch
  Transplantation
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
Phillip Y. Lee
Taehoon Yoon
Minhyuk Sung
82
6
1
27 Oct 2024
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour
David Harrison
Maxwell Horton
Jeffrey Marker
Houman Bedayat
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
Saman Naderiparizi
MQ
73
3
0
14 Oct 2024
Semantic Token Reweighting for Interpretable and Controllable Text
  Embeddings in CLIP
Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Eunji Kim
Kyuhong Shim
Simyung Chang
Sungroh Yoon
CLIP
53
0
0
11 Oct 2024
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through
  Data, Reward, and Conditional Guidance Design
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li
Qian Long
Jian Zheng
Xiaofeng Gao
Robinson Piramuthu
Wenhu Chen
William Yang Wang
VGen
57
23
0
08 Oct 2024
JVID: Joint Video-Image Diffusion for Visual-Quality and
  Temporal-Consistency in Video Generation
JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation
Hadrien Reynaud
Matthew Baugh
Mischa Dombrowski
Sarah Cechnicka
Qingjie Meng
Bernhard Kainz
VLM
44
0
0
21 Sep 2024
Anim-Director: A Large Multimodal Model Powered Agent for Controllable
  Animation Video Generation
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Yunxin Li
Haoyuan Shi
Baotian Hu
Longyue Wang
Jiashun Zhu
Jinyi Xu
Zhen Zhao
Min Zhang
VGen
63
7
0
19 Aug 2024
EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos
EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos
Aashish Rai
Srinath Sridhar
DiffM
49
4
0
30 Jul 2024
Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular
  Transformer
Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular Transformer
Yang Wu
Kaihua Zhang
Jianjun Qian
Jin Xie
Jian Yang
DiffM
76
5
0
29 Jul 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
123
7
0
07 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual
  Conditional Diffusion Models
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
97
37
0
02 Jul 2024
AnyMaker: Zero-shot General Object Customization via Decoupled
  Dual-Level ID Injection
AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection
Lingjie Kong
Kai WU
Xiaobin Hu
Wenhui Han
Jinlong Peng
Chengming Xu
Donghao Luo
Jiangning Zhang
Chengjie Wang
Yanwei Fu
DiffM
43
0
0
17 Jun 2024
Crafting Parts for Expressive Object Composition
Crafting Parts for Expressive Object Composition
Harsh Rangwani
Aishwarya Agarwal
Kuldeep Kulkarni
R. Venkatesh Babu
Srikrishna Karanam
DiffM
66
2
0
14 Jun 2024
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image
  Diffusion Models
Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models
Ziyi Wu
Yulia Rubanova
Rishabh Kabra
Drew A. Hudson
Igor Gilitschenski
Yusuf Aytar
Sjoerd van Steenkiste
Kelsey R. Allen
Thomas Kipf
VGen
DiffM
66
9
0
13 Jun 2024
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image
  Generation
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation
Lianyu Pang
Jian Yin
Baoquan Zhao
Feize Wu
Fu Lee Wang
Qing Li
Xudong Mao
DiffM
62
1
0
07 Jun 2024
Coherent Zero-Shot Visual Instruction Generation
Coherent Zero-Shot Visual Instruction Generation
Quynh Phung
Songwei Ge
Jia-Bin Huang
57
2
0
06 Jun 2024
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
Jiannan Huang
Jun Hao Liew
Hanshu Yan
Yuyang Yin
Yao Zhao
Yunchao Wei
Yunchao Wei
DiffM
109
7
0
27 May 2024
LEAST: "Local" text-conditioned image style transfer
LEAST: "Local" text-conditioned image style transfer
Silky Singh
Surgan Jandial
Simra Shahid
Abhinav Java
55
0
0
25 May 2024
Compositional Text-to-Image Generation with Dense Blob Representations
Compositional Text-to-Image Generation with Dense Blob Representations
Weili Nie
Sifei Liu
Morteza Mardani
Chao Liu
Benjamin Eckart
Arash Vahdat
DiffM
91
19
0
14 May 2024
StyleMamba : State Space Model for Efficient Text-driven Image Style
  Transfer
StyleMamba : State Space Model for Efficient Text-driven Image Style Transfer
Zijia Wang
Zhi-Song Liu
Mamba
43
8
0
08 May 2024
MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion
  Generation
MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation
Xujie Zhang
Ente Lin
Xiu Li
Yuxuan Luo
Michael C. Kampffmeyer
Xin Dong
Xiaodan Liang
56
12
0
01 May 2024
Editable Image Elements for Controllable Synthesis
Editable Image Elements for Controllable Synthesis
Jiteng Mu
Michael Gharbi
Richard Zhang
Eli Shechtman
Nuno Vasconcelos
Xiaolong Wang
Taesung Park
DiffM
60
9
0
24 Apr 2024
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Nupur Kumari
Grace Su
Richard Zhang
Taesung Park
Eli Shechtman
Jun-Yan Zhu
DiffM
49
5
0
18 Apr 2024
ZeST: Zero-Shot Material Transfer from a Single Image
ZeST: Zero-Shot Material Transfer from a Single Image
Ta-Ying Cheng
Prafull Sharma
Andrew Markham
Niki Trigoni
Varun Jampani
49
10
0
09 Apr 2024
MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation
MC2^22: Multi-concept Guidance for Customized Multi-concept Generation
Jiaxiu Jiang
Yabo Zhang
Kailai Feng
Xiaohe Wu
Wangmeng Zuo
DiffM
47
12
0
08 Apr 2024
AWOL: Analysis WithOut synthesis using Language
AWOL: Analysis WithOut synthesis using Language
Silvia Zuffi
Michael J. Black
43
2
0
03 Apr 2024
GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from
  a Single Image
GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image
Chong Bao
Yinda Zhang
Yuan Li
Xiyu Zhang
Bangbang Yang
Hujun Bao
Marc Pollefeys
Guofeng Zhang
Zhaopeng Cui
DiffM
60
5
0
02 Apr 2024
ReNoise: Real Image Inversion Through Iterative Noising
ReNoise: Real Image Inversion Through Iterative Noising
Daniel Garibi
Or Patashnik
Andrey Voynov
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
56
54
0
21 Mar 2024
One-Step Image Translation with Text-to-Image Models
One-Step Image Translation with Text-to-Image Models
Gaurav Parmar
Taesung Park
Srinivasa Narasimhan
Jun-Yan Zhu
51
46
0
18 Mar 2024
Bridging Different Language Models and Generative Vision Models for
  Text-to-Image Generation
Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
Shihao Zhao
Shaozhe Hao
Bojia Zi
Huaizhe Xu
Kwan-Yee K. Wong
DiffM
VLM
75
8
0
12 Mar 2024
It's All About Your Sketch: Democratising Sketch Control in Diffusion
  Models
It's All About Your Sketch: Democratising Sketch Control in Diffusion Models
Subhadeep Koley
A. Bhunia
Deeptanshu Sekhri
Aneeshan Sain
Pinaki Nath Chowdhury
Tao Xiang
Yi-Zhe Song
DiffM
50
16
0
12 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
90
38
0
07 Mar 2024
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video
  Diffusion Models via Training-Free Unified Attention Control
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
Xuweiyi Chen
Tian Xia
Sihan Xu
VGen
DiffM
45
8
0
04 Mar 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
87
90
0
27 Feb 2024
Consolidating Attention Features for Multi-view Image Editing
Consolidating Attention Features for Multi-view Image Editing
Or Patashnik
Rinon Gal
Daniel Cohen-Or
Jun-Yan Zhu
Fernando de la Torre
37
5
0
22 Feb 2024
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image
  Generation
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
Chong Zeng
Yue Dong
Pieter Peers
Youkang Kong
Hongzhi Wu
Xin Tong
76
28
0
19 Feb 2024
Separable Multi-Concept Erasure from Diffusion Models
Separable Multi-Concept Erasure from Diffusion Models
Mengnan Zhao
Lihe Zhang
Tianhang Zheng
Yuqiu Kong
Baocai Yin
61
10
0
03 Feb 2024
A Survey on Data Augmentation in Large Model Era
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
72
24
0
27 Jan 2024
FreeControl: Training-Free Spatial Control of Any Text-to-Image
  Diffusion Model with Any Condition
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Sicheng Mo
Fangzhou Mu
Kuan Heng Lin
Yanli Liu
Bochen Guan
Yin Li
Bolei Zhou
DiffM
65
62
0
12 Dec 2023
12
Next