Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.07093
Cited By
GLIGEN: Open-Set Grounded Text-to-Image Generation
17 January 2023
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GLIGEN: Open-Set Grounded Text-to-Image Generation"
50 / 472 papers shown
Title
Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation
Amir Mohammad Izadi
Seyed Mohsen Hosseini
Soroush Vafaie Tabar
Ali Abdollahi
Armin Saghafian
M. Baghshah
EGVM
40
0
0
09 Mar 2025
Underlying Semantic Diffusion for Effective and Efficient In-Context Learning
Zhong Ji
Weilong Cao
Yan Zhang
Yanwei Pang
Jungong Han
X. Li
DiffM
VLM
47
0
0
06 Mar 2025
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Zhao Yang
Zezhong Qian
Xiaofan Li
Weixiang Xu
Gongpeng Zhao
Ruohong Yu
Lingsi Zhu
Longjun Liu
DiffM
VGen
65
1
0
05 Mar 2025
StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts
Zhaoxing Gan
Mengtian Li
Ruhua Chen
Zhongxia Ji
Sichen Guo
Huanling Hu
Guangnan Ye
Zuo Hu
DiffM
VGen
85
0
0
04 Mar 2025
VisAgent: Narrative-Preserving Story Visualization Framework
Seungkwon Kim
GyuTae Park
Sangyeon Kim
Seung-Hun Nam
40
0
0
04 Mar 2025
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
Zhendong Wang
Jianmin Bao
Shuyang Gu
Dong Chen
Wengang Zhou
Yiming Li
DiffM
53
0
0
03 Mar 2025
ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts
Linhao Huang
Jing Yu
DiffM
47
0
0
03 Mar 2025
Zero-Shot Head Swapping in Real-World Scenarios
S. Jeong
Taewoong Kang
Hyojin Jang
Jaegul Choo
34
0
0
02 Mar 2025
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Yifei Qian
Zhongliang Guo
Bowen Deng
Chun Tong Lei
Shuai Zhao
Chun Pong Lau
Xiaopeng Hong
Michael P. Pound
DiffM
59
0
0
28 Feb 2025
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Yuhao Li
Mirana Claire Angel
Salman Khan
Yu Zhu
Jinqiu Sun
Yanning Zhang
F. Khan
VGen
46
0
0
27 Feb 2025
Multi-Perspective Data Augmentation for Few-shot Object Detection
Anh-Khoa Nguyen Vu
Quoc-Truong Truong
Vinh-Tiep Nguyen
T. Ngo
Thanh-Toan Do
Tam V. Nguyen
74
1
0
25 Feb 2025
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
Yifan Pu
Yiming Zhao
Zhicong Tang
Ruihong Yin
Haoxing Ye
...
Ji Li
Xiu Li
Zheng Lian
Gao Huang
Baining Guo
DiffM
62
2
0
25 Feb 2025
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Xiangpeng Yang
Linchao Zhu
Hehe Fan
Yi Yang
DiffM
VGen
46
5
0
24 Feb 2025
Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control
Xianghui Ze
Zhenbo Song
Qiwei Wang
Jianfeng Lu
Yujiao Shi
53
0
0
05 Feb 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
46
8
0
23 Jan 2025
ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions
Shiyue Zhang
Zheng Chong
Xi Lu
Wenqing Zhang
Haoxiang Li
Xujie Zhang
Jiehui Huang
Xiao Dong
Xiaodan Liang
DiffM
42
0
0
21 Jan 2025
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
76
4
0
20 Jan 2025
Grounding Text-to-Image Diffusion Models for Controlled High-Quality Image Generation
Ahmad Süleyman
Göksel Biricik
52
2
0
15 Jan 2025
Enhancing Image Generation Fidelity via Progressive Prompts
Zhen Xiong
Yuqi Li
Chuanguang Yang
Tiao Tan
Zhihong Zhu
Siyuan Li
Yue Ma
45
1
0
13 Jan 2025
EditAR: Unified Conditional Generation with Autoregressive Models
Jiteng Mu
Nuno Vasconcelos
Xihuai Wang
DiffM
38
4
0
08 Jan 2025
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Dongmin Park
Sebin Kim
Taehong Moon
Minkyu Kim
Kangwook Lee
Jaewoong Cho
DiffM
CoGe
64
2
0
08 Jan 2025
RealCustom++: Representing Images as Real-Word for Real-Time Customization
Zhendong Mao
Mengqi Huang
Fei Ding
Mingcong Liu
Qian He
Xiaojun Chang
DiffM
75
6
0
03 Jan 2025
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Hao Fei
Shengqiong Wu
H. Zhang
Tat-Seng Chua
Shuicheng Yan
64
38
0
31 Dec 2024
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang
Zuxuan Wu
Zhen Xing
Jie Shao
Yu-Gang Jiang
51
9
0
31 Dec 2024
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Yuntao Chen
Yuqi Wang
Zhaoxiang Zhang
149
7
0
24 Dec 2024
MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing
Chuang Yang
Bingxuan Zhao
Qing Zhou
Qi Wang
83
1
0
18 Dec 2024
LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model
Xi Wang
Yiming Li
Heng Fang
Yichen Peng
H. Xie
Xi Yang
Chuntao Li
DiffM
72
0
0
16 Dec 2024
Mojito: Motion Trajectory and Intensity Control for Video Generation
Xuehai He
Shuohang Wang
Jianwei Yang
Xiaoxia Wu
Yali Wang
Kuan-Chieh Jackson Wang
Z. Zhan
Olatunji Ruwase
Yelong Shen
Qing Guo
VGen
86
1
0
12 Dec 2024
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Q. He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Yao Liu
Yuxiang Wang
Chengjie Wang
Xiaomeng Li
Jing Zhang
DiffM
122
1
0
04 Dec 2024
Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild
Siyoon Jin
Jisu Nam
Jiyoung Kim
Dahyun Chung
Yeong-Seok Kim
Joonhyung Park
Heonjeong Chu
Seungryong Kim
DiffM
82
0
0
04 Dec 2024
SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models
Sabina Martyniak
Joanna Kaleta
Diego DallÁlba
Michał Naskręt
Szymon Płotka
Przemysław Korzeniowski
MedIm
75
0
0
03 Dec 2024
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Z. Wu
Jingcheng Ni
Xiaodong Wang
Yuxin Guo
Rui Chen
Lewei Lu
Jifeng Dai
Yuwen Xiong
74
6
0
02 Dec 2024
Improving Object Detection by Modifying Synthetic Data with Explainable AI
Nitish Mital
Simon Malzard
Richard Walters
Celso M. De Melo
Raghuveer Rao
Victoria Nockles
80
0
0
02 Dec 2024
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
Shwetha Ram
T. Neiman
Qianli Feng
Andrew Stuart
S. D. Tran
Trishul M. Chilimbi
77
1
0
28 Nov 2024
Training Data Synthesis with Difficulty Controlled Diffusion Model
Zerun Wang
Jiafeng Mao
Xueting Wang
Toshihiko Yamasaki
DiffM
80
0
0
27 Nov 2024
Relations, Negations, and Numbers: Looking for Logic in Generative Text-to-Image Models
C. Conwell
Rupert Tawiah-Quashie
T. Ullman
74
2
0
26 Nov 2024
Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Eric H. Jiang
Yasi Zhang
Zhi Zhang
Yixin Wan
Andrew Lizarraga
Shufan Li
Ying Nian Wu
DiffM
77
2
0
25 Nov 2024
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Y. Li
Fan Ma
Yi Yang
138
2
0
24 Nov 2024
AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks
Y. Li
Fan Ma
Yi Yang
DiffM
144
2
0
24 Nov 2024
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
Datao Tang
Xiangyong Cao
Xuan Wu
Jialin Li
Jing Yao
Xueru Bai
Deyu Meng
Yin Li
Deyu Meng
DiffM
80
6
0
23 Nov 2024
LocRef-Diffusion:Tuning-Free Layout and Appearance-Guided Generation
Fan Deng
Yaguang Wu
Xinyang Yu
Xiangjun Huang
Jian Yang
Guangyu Yan
Qiang Xu
DiffM
89
0
0
22 Nov 2024
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Alessandro Fontanella
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Sarah Parisot
36
2
0
16 Nov 2024
Boundary Attention Constrained Zero-Shot Layout-To-Image Generation
Huancheng Chen
Jingtao Li
Weiming Zhuang
H. Vikalo
Lingjuan Lyu
DiffM
36
0
0
15 Nov 2024
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
Taihang Hu
Linxuan Li
Joost van de Weijer
Hongcheng Gao
Fahad Shahbaz Khan
Jian Yang
Ming-Ming Cheng
Kai Wang
Yaxing Wang
DiffM
57
4
0
11 Nov 2024
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
Nvidia
:
Yuval Atzmon
Maciej Bala
Yogesh Balaji
...
Ting-Chun Wang
Fangyin Wei
Xiaohui Zeng
Yu Zeng
Qinsheng Zhang
58
6
0
11 Nov 2024
Layout Control and Semantic Guidance with Attention Loss Backward for T2I Diffusion Model
Guandong Li
DiffM
28
0
0
11 Nov 2024
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Zhennan Chen
Yajie Li
Haofan Wang
Z. Chen
Zhengkai Jiang
Jun Yu Li
Qian Wang
Jian Yang
Ying Tai
DiffM
49
8
0
10 Nov 2024
Improving image synthesis with diffusion-negative sampling
Alakh Desai
Nuno Vasconcelos
DiffM
32
0
0
08 Nov 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
Koichi Namekata
Sherwin Bahmani
Ziyi Wu
Yash Kant
Igor Gilitschenski
David B. Lindell
VGen
62
13
0
07 Nov 2024
Training-free Regional Prompting for Diffusion Transformers
Anthony Chen
Jianjin Xu
Wenzhao Zheng
Gaole Dai
Yuxiang Wang
Renrui Zhang
Haofan Wang
Shanghang Zhang
VLM
40
2
0
04 Nov 2024
Previous
1
2
3
4
5
...
8
9
10
Next