ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.07093
  4. Cited By
GLIGEN: Open-Set Grounded Text-to-Image Generation

GLIGEN: Open-Set Grounded Text-to-Image Generation

17 January 2023
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
    VLM
ArXivPDFHTML

Papers citing "GLIGEN: Open-Set Grounded Text-to-Image Generation"

50 / 472 papers shown
Title
Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation
Amir Mohammad Izadi
Seyed Mohsen Hosseini
Soroush Vafaie Tabar
Ali Abdollahi
Armin Saghafian
M. Baghshah
EGVM
40
0
0
09 Mar 2025
Underlying Semantic Diffusion for Effective and Efficient In-Context Learning
Zhong Ji
Weilong Cao
Yan Zhang
Yanwei Pang
Jungong Han
X. Li
DiffM
VLM
47
0
0
06 Mar 2025
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Zhao Yang
Zezhong Qian
Xiaofan Li
Weixiang Xu
Gongpeng Zhao
Ruohong Yu
Lingsi Zhu
Longjun Liu
DiffM
VGen
65
1
0
05 Mar 2025
StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts
Zhaoxing Gan
Mengtian Li
Ruhua Chen
Zhongxia Ji
Sichen Guo
Huanling Hu
Guangnan Ye
Zuo Hu
DiffM
VGen
85
0
0
04 Mar 2025
VisAgent: Narrative-Preserving Story Visualization Framework
Seungkwon Kim
GyuTae Park
Sangyeon Kim
Seung-Hun Nam
40
0
0
04 Mar 2025
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
Zhendong Wang
Jianmin Bao
Shuyang Gu
Dong Chen
Wengang Zhou
Yiming Li
DiffM
53
0
0
03 Mar 2025
ToLo: A Two-Stage, Training-Free Layout-To-Image Generation Framework For High-Overlap Layouts
Linhao Huang
Jing Yu
DiffM
47
0
0
03 Mar 2025
Zero-Shot Head Swapping in Real-World Scenarios
Zero-Shot Head Swapping in Real-World Scenarios
S. Jeong
Taewoong Kang
Hyojin Jang
Jaegul Choo
34
0
0
02 Mar 2025
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot Counting
Yifei Qian
Zhongliang Guo
Bowen Deng
Chun Tong Lei
Shuai Zhao
Chun Pong Lau
Xiaopeng Hong
Michael P. Pound
DiffM
59
0
0
28 Feb 2025
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Yuhao Li
Mirana Claire Angel
Salman Khan
Yu Zhu
Jinqiu Sun
Yanning Zhang
F. Khan
VGen
46
0
0
27 Feb 2025
Multi-Perspective Data Augmentation for Few-shot Object Detection
Multi-Perspective Data Augmentation for Few-shot Object Detection
Anh-Khoa Nguyen Vu
Quoc-Truong Truong
Vinh-Tiep Nguyen
T. Ngo
Thanh-Toan Do
Tam V. Nguyen
74
1
0
25 Feb 2025
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
Yifan Pu
Yiming Zhao
Zhicong Tang
Ruihong Yin
Haoxing Ye
...
Ji Li
Xiu Li
Zheng Lian
Gao Huang
Baining Guo
DiffM
62
2
0
25 Feb 2025
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Xiangpeng Yang
Linchao Zhu
Hehe Fan
Yi Yang
DiffM
VGen
46
5
0
24 Feb 2025
Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control
Controllable Satellite-to-Street-View Synthesis with Precise Pose Alignment and Zero-Shot Environmental Control
Xianghui Ze
Zhenbo Song
Qiwei Wang
Jianfeng Lu
Yujiao Shi
53
0
0
05 Feb 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
46
8
0
23 Jan 2025
ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions
ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions
Shiyue Zhang
Zheng Chong
Xi Lu
Wenqing Zhang
Haoxiang Li
Xujie Zhang
Jiehui Huang
Xiao Dong
Xiaodan Liang
DiffM
42
0
0
21 Jan 2025
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
76
4
0
20 Jan 2025
Grounding Text-to-Image Diffusion Models for Controlled High-Quality Image Generation
Grounding Text-to-Image Diffusion Models for Controlled High-Quality Image Generation
Ahmad Süleyman
Göksel Biricik
52
2
0
15 Jan 2025
Enhancing Image Generation Fidelity via Progressive Prompts
Enhancing Image Generation Fidelity via Progressive Prompts
Zhen Xiong
Yuqi Li
Chuanguang Yang
Tiao Tan
Zhihong Zhu
Siyuan Li
Yue Ma
45
1
0
13 Jan 2025
EditAR: Unified Conditional Generation with Autoregressive Models
EditAR: Unified Conditional Generation with Autoregressive Models
Jiteng Mu
Nuno Vasconcelos
Xihuai Wang
DiffM
38
4
0
08 Jan 2025
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Dongmin Park
Sebin Kim
Taehong Moon
Minkyu Kim
Kangwook Lee
Jaewoong Cho
DiffM
CoGe
64
2
0
08 Jan 2025
RealCustom++: Representing Images as Real-Word for Real-Time Customization
RealCustom++: Representing Images as Real-Word for Real-Time Customization
Zhendong Mao
Mengqi Huang
Fei Ding
Mingcong Liu
Qian He
Xiaojun Chang
DiffM
75
6
0
03 Jan 2025
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Hao Fei
Shengqiong Wu
H. Zhang
Tat-Seng Chua
Shuicheng Yan
64
38
0
31 Dec 2024
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang
Zuxuan Wu
Zhen Xing
Jie Shao
Yu-Gang Jiang
51
9
0
31 Dec 2024
DrivingGPT: Unifying Driving World Modeling and Planning with
  Multi-modal Autoregressive Transformers
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Yuntao Chen
Yuqi Wang
Zhaoxiang Zhang
149
7
0
24 Dec 2024
MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing
MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing
Chuang Yang
Bingxuan Zhao
Qing Zhou
Qi Wang
83
1
0
18 Dec 2024
LineArt: A Knowledge-guided Training-free High-quality Appearance
  Transfer for Design Drawing with Diffusion Model
LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model
Xi Wang
Yiming Li
Heng Fang
Yichen Peng
H. Xie
Xi Yang
Chuntao Li
DiffM
72
0
0
16 Dec 2024
Mojito: Motion Trajectory and Intensity Control for Video Generation
Mojito: Motion Trajectory and Intensity Control for Video Generation
Xuehai He
Shuohang Wang
Jianwei Yang
Xiaoxia Wu
Yali Wang
Kuan-Chieh Jackson Wang
Z. Zhan
Olatunji Ruwase
Yelong Shen
Qing Guo
VGen
86
1
0
12 Dec 2024
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Q. He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Yao Liu
Yuxiang Wang
Chengjie Wang
Xiaomeng Li
Jing Zhang
DiffM
122
1
0
04 Dec 2024
Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild
Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis in-the-Wild
Siyoon Jin
Jisu Nam
Jiyoung Kim
Dahyun Chung
Yeong-Seok Kim
Joonhyung Park
Heonjeong Chu
Seungryong Kim
DiffM
82
0
0
04 Dec 2024
SimuScope: Realistic Endoscopic Synthetic Dataset Generation through
  Surgical Simulation and Diffusion Models
SimuScope: Realistic Endoscopic Synthetic Dataset Generation through Surgical Simulation and Diffusion Models
Sabina Martyniak
Joanna Kaleta
Diego DallÁlba
Michał Naskręt
Szymon Płotka
Przemysław Korzeniowski
MedIm
75
0
0
03 Dec 2024
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for
  Autonomous Driving
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Z. Wu
Jingcheng Ni
Xiaodong Wang
Yuxin Guo
Rui Chen
Lewei Lu
Jifeng Dai
Yuwen Xiong
74
6
0
02 Dec 2024
Improving Object Detection by Modifying Synthetic Data with Explainable AI
Improving Object Detection by Modifying Synthetic Data with Explainable AI
Nitish Mital
Simon Malzard
Richard Walters
Celso M. De Melo
Raghuveer Rao
Victoria Nockles
80
0
0
02 Dec 2024
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image
  Diffusion Models
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
Shwetha Ram
T. Neiman
Qianli Feng
Andrew Stuart
S. D. Tran
Trishul M. Chilimbi
77
1
0
28 Nov 2024
Training Data Synthesis with Difficulty Controlled Diffusion Model
Training Data Synthesis with Difficulty Controlled Diffusion Model
Zerun Wang
Jiafeng Mao
Xueting Wang
Toshihiko Yamasaki
DiffM
80
0
0
27 Nov 2024
Relations, Negations, and Numbers: Looking for Logic in Generative
  Text-to-Image Models
Relations, Negations, and Numbers: Looking for Logic in Generative Text-to-Image Models
C. Conwell
Rupert Tawiah-Quashie
T. Ullman
74
2
0
26 Nov 2024
Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian
  Theory
Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Eric H. Jiang
Yasi Zhang
Zhi Zhang
Yixin Wan
Andrew Lizarraga
Shufan Li
Ying Nian Wu
DiffM
77
2
0
25 Nov 2024
Imagine and Seek: Improving Composed Image Retrieval with an Imagined
  Proxy
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Y. Li
Fan Ma
Yi Yang
138
2
0
24 Nov 2024
AnySynth: Harnessing the Power of Image Synthetic Data Generation for
  Generalized Vision-Language Tasks
AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks
Y. Li
Fan Ma
Yi Yang
DiffM
144
2
0
24 Nov 2024
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
Datao Tang
Xiangyong Cao
Xuan Wu
Jialin Li
Jing Yao
Xueru Bai
Deyu Meng
Yin Li
Deyu Meng
DiffM
80
6
0
23 Nov 2024
LocRef-Diffusion:Tuning-Free Layout and Appearance-Guided Generation
LocRef-Diffusion:Tuning-Free Layout and Appearance-Guided Generation
Fan Deng
Yaguang Wu
Xinyang Yu
Xiangjun Huang
Jian Yang
Guangyu Yan
Qiang Xu
DiffM
89
0
0
22 Nov 2024
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Alessandro Fontanella
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Sarah Parisot
36
2
0
16 Nov 2024
Boundary Attention Constrained Zero-Shot Layout-To-Image Generation
Boundary Attention Constrained Zero-Shot Layout-To-Image Generation
Huancheng Chen
Jingtao Li
Weiming Zhuang
H. Vikalo
Lingjuan Lyu
DiffM
36
0
0
15 Nov 2024
Token Merging for Training-Free Semantic Binding in Text-to-Image
  Synthesis
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
Taihang Hu
Linxuan Li
Joost van de Weijer
Hongcheng Gao
Fahad Shahbaz Khan
Jian Yang
Ming-Ming Cheng
Kai Wang
Yaxing Wang
DiffM
57
4
0
11 Nov 2024
Edify Image: High-Quality Image Generation with Pixel Space Laplacian
  Diffusion Models
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models
Nvidia
:
Yuval Atzmon
Maciej Bala
Yogesh Balaji
...
Ting-Chun Wang
Fangyin Wei
Xiaohui Zeng
Yu Zeng
Qinsheng Zhang
58
6
0
11 Nov 2024
Layout Control and Semantic Guidance with Attention Loss Backward for
  T2I Diffusion Model
Layout Control and Semantic Guidance with Attention Loss Backward for T2I Diffusion Model
Guandong Li
DiffM
28
0
0
11 Nov 2024
Region-Aware Text-to-Image Generation via Hard Binding and Soft
  Refinement
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Zhennan Chen
Yajie Li
Haofan Wang
Z. Chen
Zhengkai Jiang
Jun Yu Li
Qian Wang
Jian Yang
Ying Tai
DiffM
49
8
0
10 Nov 2024
Improving image synthesis with diffusion-negative sampling
Improving image synthesis with diffusion-negative sampling
Alakh Desai
Nuno Vasconcelos
DiffM
32
0
0
08 Nov 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
Koichi Namekata
Sherwin Bahmani
Ziyi Wu
Yash Kant
Igor Gilitschenski
David B. Lindell
VGen
62
13
0
07 Nov 2024
Training-free Regional Prompting for Diffusion Transformers
Training-free Regional Prompting for Diffusion Transformers
Anthony Chen
Jianjin Xu
Wenzhao Zheng
Gaole Dai
Yuxiang Wang
Renrui Zhang
Haofan Wang
Shanghang Zhang
VLM
40
2
0
04 Nov 2024
Previous
12345...8910
Next