ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.07093
  4. Cited By
GLIGEN: Open-Set Grounded Text-to-Image Generation

GLIGEN: Open-Set Grounded Text-to-Image Generation

17 January 2023
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
    VLM
ArXivPDFHTML

Papers citing "GLIGEN: Open-Set Grounded Text-to-Image Generation"

50 / 472 papers shown
Title
Generate Anything Anywhere in Any Scene
Generate Anything Anywhere in Any Scene
Yuheng Li
Haotian Liu
Yangming Wen
Yong Jae Lee
DiffM
59
12
0
29 Jun 2023
Localized Text-to-Image Generation for Free via Cross Attention Control
Localized Text-to-Image Generation for Free via Cross Attention Control
Yutong He
Ruslan Salakhutdinov
J. Zico Kolter
DiffM
64
21
0
26 Jun 2023
A-STAR: Test-time Attention Segregation and Retention for Text-to-image
  Synthesis
A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
K. J. Joseph
Apoorv Saxena
Koustava Goswami
Balaji Vasan Srinivasan
VLM
DiffM
11
46
0
26 Jun 2023
Text-Anchored Score Composition: Tackling Condition Misalignment in
  Text-to-Image Diffusion Models
Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
Luozhou Wang
Guibao Shen
Wenhang Ge
Guangyong Chen
Yijun Li
Yingke Chen
DiffM
38
4
0
26 Jun 2023
Zero-shot spatial layout conditioning for text-to-image diffusion models
Zero-shot spatial layout conditioning for text-to-image diffusion models
Guillaume Couairon
Marlene Careil
Matthieu Cord
Stéphane Lathuilière
Jakob Verbeek
VLM
16
63
0
23 Jun 2023
Continuous Layout Editing of Single Images with Diffusion Models
Continuous Layout Editing of Single Images with Diffusion Models
Zhiyuan Zhang
Zhitong Huang
J. Liao
DiffM
21
10
0
22 Jun 2023
DreamEdit: Subject-driven Image Editing
DreamEdit: Subject-driven Image Editing
Tianle Li
Max W.F. Ku
Cong Wei
Wenhu Chen
EGVM
24
25
0
22 Jun 2023
Grounded Text-to-Image Synthesis with Attention Refocusing
Grounded Text-to-Image Synthesis with Attention Refocusing
Quynh Phung
Songwei Ge
Jia-Bin Huang
DiffM
25
104
0
08 Jun 2023
Improving Tuning-Free Real Image Editing with Proximal Guidance
Improving Tuning-Free Real Image Editing with Proximal Guidance
Ligong Han
Song Wen
Qi Chen
Zhixing Zhang
Kunpeng Song
...
Qilong Zhangli
Jindong Jiang
Zhaoyang Xia
Akash Srivastava
Dimitris N. Metaxas
DiffM
24
56
0
08 Jun 2023
WOUAF: Weight Modulation for User Attribution and Fingerprinting in
  Text-to-Image Diffusion Models
WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Changhoon Kim
Kyle Min
Maitreya Patel
Sheng Cheng
Yezhou Yang
WIGM
24
28
0
07 Jun 2023
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data
  Generation
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation
Kai Chen
Enze Xie
Zhe Chen
Yibo Wang
Lanqing Hong
Zhenguo Li
Dit-Yan Yeung
DiffM
20
21
0
07 Jun 2023
A survey of Generative AI Applications
A survey of Generative AI Applications
Roberto Gozalo-Brizuela
Eduardo C. Garrido-Merchán
3DV
MedIm
21
80
0
05 Jun 2023
Efficient Text-Guided 3D-Aware Portrait Generation with Score
  Distillation Sampling on Distribution
Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Yiji Cheng
Fei Yin
Xiaoke Huang
Xintong Yu
Jiaxiang Liu
Shi Feng
Yujiu Yang
Yansong Tang
DiffM
26
4
0
03 Jun 2023
Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image
  Generation
Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation
Minghui Hu
Jianbin Zheng
Daqing Liu
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
DiffM
25
9
0
01 Jun 2023
Make-Your-Video: Customized Video Generation Using Textual and
  Structural Guidance
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Jinbo Xing
Menghan Xia
Yuxin Liu
Yuechen Zhang
Yong Zhang
...
Haoxin Chen
Xiaodong Cun
Xintao Wang
Ying Shan
T. Wong
VGen
DiffM
39
84
0
01 Jun 2023
Controllable Text-to-Image Generation with GPT-4
Controllable Text-to-Image Generation with GPT-4
Tianjun Zhang
Yi Zhang
Vibhav Vineet
Neel Joshi
Xin Wang
DiffM
18
42
0
29 May 2023
Photoswap: Personalized Subject Swapping in Images
Photoswap: Personalized Subject Swapping in Images
Jing Gu
Yilin Wang
Nanxuan Zhao
Tsu-jui Fu
Wei Xiong
...
Zhifei Zhang
He Zhang
Jianming Zhang
Hyun-Sun Jung
Xin Eric Wang
DiffM
26
37
0
29 May 2023
TaleCrafter: Interactive Story Visualization with Multiple Characters
TaleCrafter: Interactive Story Visualization with Multiple Characters
Yuan Gong
Youxin Pang
Xiaodong Cun
Menghan Xia
Yingqing He
...
Longyue Wang
Yong Zhang
Xintao Wang
Ying Shan
Yujiu Yang
DiffM
27
45
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image
  Editing With User Instructions
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
30
31
0
29 May 2023
Text-to-image Editing by Image Information Removal
Text-to-image Editing by Image Information Removal
Zhongping Zhang
Jian Zheng
Jacob Zhiyuan Fang
Bryan A. Plummer
DiffM
21
12
0
27 May 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee K. Wong
27
234
0
25 May 2023
Towards Language-guided Interactive 3D Generation: LLMs as Layout
  Interpreter with Generative Feedback
Towards Language-guided Interactive 3D Generation: LLMs as Layout Interpreter with Generative Feedback
Yiqi Lin
Hao Wu
Ruichen Wang
H. Lu
Xiaodong Lin
Hui Xiong
Lin Wang
3DV
42
12
0
25 May 2023
Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models
Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models
Jooyoung Choi
Yunjey Choi
Yunji Kim
Junho Kim
Sung-Hoon Yoon
DiffM
28
52
0
25 May 2023
LayoutGPT: Compositional Visual Planning and Generation with Large
  Language Models
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Qing Guo
William Yang Wang
MLLM
27
162
0
24 May 2023
Visual Programming for Text-to-Image Generation and Evaluation
Visual Programming for Text-to-Image Generation and Evaluation
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
MLLM
21
50
0
24 May 2023
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion
  Models
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models
Sungnyun Kim
Junsoo Lee
Kibeom Hong
Daesik Kim
Namhyuk Ahn
DiffM
19
14
0
24 May 2023
Vision + Language Applications: A Survey
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
30
5
0
24 May 2023
DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot
  Text-to-Video Generation
DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation
Susung Hong
Junyoung Seo
Heeseong Shin
Sung‐Jin Hong
Seung Wook Kim
DiffM
VGen
25
34
0
23 May 2023
Compositional Text-to-Image Synthesis with Attention Map Control of
  Diffusion Models
Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models
Ruichen Wang
Zekang Chen
Chen Chen
Jiancang Ma
H. Lu
Xiaodong Lin
DiffM
52
65
0
23 May 2023
VisorGPT: Learning Visual Prior via Generative Pre-Training
VisorGPT: Learning Visual Prior via Generative Pre-Training
Jinheng Xie
Kai Ye
Yudong Li
Yuexiang Li
Kevin Qinghong Lin
Yefeng Zheng
Linlin Shen
Mike Zheng Shou
ViT
95
8
0
23 May 2023
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image
  Diffusion Models with Large Language Models
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Long Lian
Boyi Li
Adam Yala
Trevor Darrell
40
152
0
23 May 2023
LeftRefill: Filling Right Canvas based on Left Reference through
  Generalized Text-to-Image Diffusion Model
LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model
Chenjie Cao
Yunuo Cai
Qiaole Dong
Yikai Wang
Yanwei Fu
DiffM
40
14
0
19 May 2023
LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis
LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis
Chang-Shu Liu
Rui Li
Kaidong Zhang
Xin Luo
Dong Liu
DiffM
29
3
0
19 May 2023
UniControl: A Unified Diffusion Model for Controllable Visual Generation
  In the Wild
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Can Qin
Shu Zhen Zhang
Ning Yu
Yihao Feng
Xinyi Yang
...
Caiming Xiong
Silvio Savarese
Stefano Ermon
Yun Fu
Ran Xu
23
118
0
18 May 2023
Let the Chart Spark: Embedding Semantic Context into Chart with
  Text-to-Image Generative Model
Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative Model
Shishi Xiao
Suizi Huang
Yue Lin
Yilin Ye
Weizhen Zeng
41
30
0
28 Apr 2023
Visual Instruction Tuning
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
107
4,264
0
17 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Expressive Text-to-Image Generation with Rich Text
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
79
79
0
13 Apr 2023
Segment Everything Everywhere All at Once
Segment Everything Everywhere All at Once
Xueyan Zou
Jianwei Yang
Hao Zhang
Feng Li
Linjie Li
Jianfeng Wang
Lijuan Wang
Jianfeng Gao
Yong Jae Lee
MLLM
VLM
9
457
0
13 Apr 2023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image
  Generation
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
Jaemin Cho
Linjie Li
Zhengyuan Yang
Zhe Gan
Lijuan Wang
Joey Tianyi Zhou
EGVM
11
5
0
13 Apr 2023
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into
  3D, alleviate Janus problem and Beyond
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Mohammadreza Armandpour
A. Sadeghian
Huangjie Zheng
Amir Sadeghian
Mingyuan Zhou
DiffM
18
123
0
11 Apr 2023
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image
  Generation
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation
Xu Ju
Ailing Zeng
Chenchen Zhao
Jianan Wang
Lei Zhang
Qian Xu
DiffM
25
86
0
09 Apr 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for
  High-Fidelity Text-to-Image Synthesis
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe-nan Lin
Yang Zhang
Shiyu Chang
DiffM
42
44
0
07 Apr 2023
InstantBooth: Personalized Text-to-Image Generation without Test-Time
  Finetuning
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe-nan Lin
H. J. Jung
DiffM
124
279
0
06 Apr 2023
Training-Free Layout Control with Cross-Attention Guidance
Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
135
222
0
06 Apr 2023
viz2viz: Prompt-driven stylized visualization generation using a
  diffusion model
viz2viz: Prompt-driven stylized visualization generation using a diffusion model
Jiaqi Wu
John Joon Young Chung
Eytan Adar
DiffM
6
12
0
04 Apr 2023
Reference-based Image Composition with Sketch via Structure-aware
  Diffusion Model
Reference-based Image Composition with Sketch via Structure-aware Diffusion Model
Kangyeol Kim
S. Park
Junsoo Lee
Jaegul Choo
DiffM
16
12
0
31 Mar 2023
Training-free Content Injection using h-space in Diffusion Models
Training-free Content Injection using h-space in Diffusion Models
Jaeseok Jeong
Mingi Kwon
Youngjung Uh
DiffM
26
24
0
27 Mar 2023
DreamBooth3D: Subject-Driven Text-to-3D Generation
DreamBooth3D: Subject-Driven Text-to-3D Generation
Amit Raj
S. Kaza
Ben Poole
Michael Niemeyer
Nataniel Ruiz
...
Kfir Aberman
Michael Rubinstein
Jonathan T. Barron
Yuanzhen Li
Varun Jampani
DiffM
24
220
0
23 Mar 2023
Evaluation of Sketch-Based and Semantic-Based Modalities for Mockup
  Generation
Evaluation of Sketch-Based and Semantic-Based Modalities for Mockup Generation
Tommaso Calò
Luigi De Russis
15
0
0
22 Mar 2023
Localizing Object-level Shape Variations with Text-to-Image Diffusion
  Models
Localizing Object-level Shape Variations with Text-to-Image Diffusion Models
Or Patashnik
Daniel Garibi
Idan Azuri
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
34
110
0
20 Mar 2023
Previous
123...1089
Next