ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.07093
  4. Cited By
GLIGEN: Open-Set Grounded Text-to-Image Generation

GLIGEN: Open-Set Grounded Text-to-Image Generation

17 January 2023
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
    VLM
ArXivPDFHTML

Papers citing "GLIGEN: Open-Set Grounded Text-to-Image Generation"

50 / 472 papers shown
Title
Skews in the Phenomenon Space Hinder Generalization in Text-to-Image
  Generation
Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation
Yingshan Chang
Yasi Zhang
Zhiyuan Fang
Yingnian Wu
Yonatan Bisk
Feng Gao
EGVM
42
5
0
25 Mar 2024
Generating Potent Poisons and Backdoors from Scratch with Guided
  Diffusion
Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
Hossein Souri
Arpit Bansal
Hamid Kazemi
Liam H. Fowl
Aniruddha Saha
Jonas Geiping
Andrew Gordon Wilson
Rama Chellappa
Tom Goldstein
Micah Goldblum
SILM
DiffM
21
1
0
25 Mar 2024
EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing
EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing
Xiangpeng Yang
Linchao Zhu
Hehe Fan
Yi Yang
DiffM
VGen
22
9
0
24 Mar 2024
Spectral Motion Alignment for Video Motion Transfer using Diffusion
  Models
Spectral Motion Alignment for Video Motion Transfer using Diffusion Models
Geon Yeong Park
Hyeonho Jeong
Sang Wan Lee
Jong Chul Ye
VGen
DiffM
37
10
0
22 Mar 2024
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer
Hiroki Azuma
Yusuke Matsui
Atsuto Maki
VLM
39
1
0
20 Mar 2024
ReGround: Improving Textual and Spatial Grounding at No Cost
ReGround: Improving Textual and Spatial Grounding at No Cost
Yuseung Lee
Minhyuk Sung
DiffM
28
2
0
20 Mar 2024
Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute
  Editing
Ground-A-Score: Scaling Up the Score Distillation for Multi-Attribute Editing
Hangeol Chang
Jinho Chang
Jong Chul Ye
DiffM
37
3
0
20 Mar 2024
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced
  Data Generation and Perception
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Yibo Wang
Ruiyuan Gao
Kai Chen
Kaiqiang Zhou
Yingjie Cai
...
Zhenguo Li
Lihui Jiang
Dit-Yan Yeung
Qiang Xu
Kai Zhang
DiffM
118
21
0
20 Mar 2024
One-Step Image Translation with Text-to-Image Models
One-Step Image Translation with Text-to-Image Models
Gaurav Parmar
Taesung Park
Srinivasa Narasimhan
Jun-Yan Zhu
37
43
0
18 Mar 2024
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot
  Video Editing
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing
Hyeonho Jeong
Jinho Chang
Geon Yeong Park
Jong Chul Ye
DiffM
VGen
29
13
0
18 Mar 2024
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept
  Customization in Training-Free Diffusion Models
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Yang Yang
Wen Wang
Liang Peng
Chaotian Song
Yao Chen
...
Xiaolong Yang
Qinglin Lu
Deng Cai
Boxi Wu
Wei Liu
MoMe
72
25
0
18 Mar 2024
IMPRINT: Generative Object Compositing by Learning Identity-Preserving
  Representation
IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation
Yizhi Song
Zhifei Zhang
Zhe Lin
Scott D. Cohen
Brian L. Price
Jianming Zhang
Soo Ye Kim
He Zhang
Wei Xiong
Daniel G. Aliaga
DiffM
74
36
0
15 Mar 2024
Animate Your Motion: Turning Still Images into Dynamic Videos
Animate Your Motion: Turning Still Images into Dynamic Videos
Mingxiao Li
Bo Wan
Marie-Francine Moens
Tinne Tuytelaars
VGen
DiffM
35
4
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in
  Real Images
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
43
1
0
15 Mar 2024
Desigen: A Pipeline for Controllable Design Template Generation
Desigen: A Pipeline for Controllable Design Template Generation
Haohan Weng
Danqing Huang
Yu Qiao
Zheng Hu
Chin-Yew Lin
Tong Zhang
Cheng Chen
DiffM
19
14
0
14 Mar 2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with
  Auto-Generated Data
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Jialu Li
Jaemin Cho
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
MoMe
DiffM
44
8
0
11 Mar 2024
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video
  Generation
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Guosheng Zhao
Xiaofeng Wang
Zheng Zhu
Xinze Chen
Guan Huang
Xiaoyi Bao
Xingang Wang
VGen
40
65
0
11 Mar 2024
DivCon: Divide and Conquer for Progressive Text-to-Image Generation
DivCon: Divide and Conquer for Progressive Text-to-Image Generation
Yuhao Jia
Wenhan Tan
DiffM
57
1
0
11 Mar 2024
Audio-Synchronized Visual Animation
Audio-Synchronized Visual Animation
Lin Zhang
Shentong Mo
Yijing Zhang
Pedro Morgado
DiffM
43
19
0
08 Mar 2024
VLM-PL: Advanced Pseudo Labeling Approach for Class Incremental Object
  Detection via Vision-Language Model
VLM-PL: Advanced Pseudo Labeling Approach for Class Incremental Object Detection via Vision-Language Model
Junsu Kim
Yunhoe Ku
Jihyeon Kim
Junuk Cha
Seungryul Baek
ObjD
VLM
37
12
0
08 Mar 2024
PrimeComposer: Faster Progressively Combined Diffusion for Image
  Composition with Attention Steering
PrimeComposer: Faster Progressively Combined Diffusion for Image Composition with Attention Steering
Yibin Wang
Weizhong Zhang
Jianwei Zheng
Cheng Jin
DiffM
68
9
0
08 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Yongqi Li
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
36
7
0
07 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
72
35
0
07 Mar 2024
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces
  from Disentangled Audio
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
Chao Xu
Yang Liu
Jiazheng Xing
Weida Wang
Mingze Sun
...
Tianxin Huang
Siyuan Li
Zhi-Qi Cheng
Ying Tai
Baigui Sun
CVBM
54
11
0
04 Mar 2024
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain
  Text-to-Image Customization
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
Mengqi Huang
Zhendong Mao
Mingcong Liu
Qian He
Yongdong Zhang
DiffM
45
22
0
01 Mar 2024
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on
  its Contour-following Ability
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Wenjie Xuan
Yufei Xu
Shanshan Zhao
Chaoyue Wang
Juhua Liu
Bo Du
Dacheng Tao
26
2
0
01 Mar 2024
Abductive Ego-View Accident Video Understanding for Safe Driving
  Perception
Abductive Ego-View Accident Video Understanding for Safe Driving Perception
Jianwu Fang
Lei-lei Li
Junfei Zhou
Junbin Xiao
Hongkai Yu
Chen Lv
Jianru Xue
Tat-Seng Chua
34
14
0
01 Mar 2024
Box It to Bind It: Unified Layout Control and Attribute Binding in T2I
  Diffusion Models
Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models
Ashkan Taghipour
Morteza Ghahremani
Bennamoun
Aref Miri Rekavandi
Hamid Laga
F. Boussaïd
DiffM
37
5
0
27 Feb 2024
SDDGR: Stable Diffusion-based Deep Generative Replay for Class
  Incremental Object Detection
SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection
Junsu Kim
Hoseong Cho
Jihyeon Kim
Yihalem Yimolal Tiruneh
Seungryul Baek
DiffM
43
20
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
66
85
0
27 Feb 2024
Two-stage Cytopathological Image Synthesis for Augmenting Cervical
  Abnormality Screening
Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening
Zhenrong Shen
Manman Fei
Xin Wang
Jiangdong Cai
Sheng Wang
Lichi Zhang
Qian Wang
MedIm
42
0
0
22 Feb 2024
Layout-to-Image Generation with Localized Descriptions using ControlNet
  with Cross-Attention Control
Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control
Denis Lukovnikov
Asja Fischer
DiffM
30
3
0
20 Feb 2024
FlashTex: Fast Relightable Mesh Texturing with LightControlNet
FlashTex: Fast Relightable Mesh Texturing with LightControlNet
Kangle Deng
Timothy Omernick
Alexander Weiss
Deva Ramanan
Jun-Yan Zhu
Tinghui Zhou
Maneesh Agrawala
46
27
0
20 Feb 2024
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image
  Diffusion Models
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
Xinchen Zhang
Ling Yang
Yaqi Cai
Zhaochen Yu
Kai-Ni Wang
...
Ye Tian
Minkai Xu
Yong Tang
Yujiu Yang
Bin Cui
DiffM
34
5
0
20 Feb 2024
MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object
  Diffusion
MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion
Sen Li
Ruochen Wang
Cho-Jui Hsieh
Minhao Cheng
Tianyi Zhou
MLLM
LM&Ro
40
3
0
20 Feb 2024
Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI:
  Unpredictable Plays Never Repeating The Same
Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
Sungjun Ahn
Hyun-Jeong Yim
Youngwan Lee
Sung-Ik Park
VGen
41
4
0
19 Feb 2024
Transfer learning with generative models for object detection on limited
  datasets
Transfer learning with generative models for object detection on limited datasets
Matteo Paiano
Stefano Martina
Carlotta Giannelli
Filippo Caruso
42
3
0
09 Feb 2024
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
Dewei Zhou
You Li
Fan Ma
Zongxin Yang
Yi Yang
DiffM
20
57
0
08 Feb 2024
Get What You Want, Not What You Don't: Image Content Suppression for
  Text-to-Image Diffusion Models
Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models
Senmao Li
J. Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
47
27
0
08 Feb 2024
$λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion
  Models by Leveraging CLIP Latent Space
λλλ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Maitreya Patel
Sangmin Jung
Chitta Baral
Yezhou Yang
VLM
31
28
0
07 Feb 2024
InstanceDiffusion: Instance-level Control for Image Generation
InstanceDiffusion: Instance-level Control for Image Generation
Xudong Wang
Trevor Darrell
Sai Saketh Rambhatla
Rohit Girdhar
Ishan Misra
VLM
DiffM
34
84
0
05 Feb 2024
Direct-a-Video: Customized Video Generation with User-Directed Camera
  Movement and Object Motion
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Shiyuan Yang
Liang Hou
Haibin Huang
Chongyang Ma
Pengfei Wan
Di Zhang
Xiaodong Chen
Jing Liao
VGen
DiffM
66
77
0
05 Feb 2024
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Jiawei Wang
Yuchen Zhang
Jiaxin Zou
Yan Zeng
Guoqiang Wei
Liping Yuan
Hang Li
DiffM
VGen
32
43
0
02 Feb 2024
Spatial-Aware Latent Initialization for Controllable Image Generation
Spatial-Aware Latent Initialization for Controllable Image Generation
Wenqiang Sun
Tengtao Li
Zehong Lin
Jun Zhang
39
10
0
29 Jan 2024
A Survey on Data Augmentation in Large Model Era
A Survey on Data Augmentation in Large Model Era
Yue Zhou
Chenlu Guo
Xu Wang
Yi-Ju Chang
Yuan Wu
LM&MA
VLM
49
23
0
27 Jan 2024
BootPIG: Bootstrapping Zero-shot Personalized Image Generation
  Capabilities in Pretrained Diffusion Models
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
Senthil Purushwalkam
Akash Gokul
Shafiq R. Joty
Nikhil Naik
DiffM
39
16
0
25 Jan 2024
ChatterBox: Multi-round Multimodal Referring and Grounding
ChatterBox: Multi-round Multimodal Referring and Grounding
Yunjie Tian
Tianren Ma
Lingxi Xie
Jihao Qiu
Xi Tang
Yuan Zhang
Jianbin Jiao
Qi Tian
Qixiang Ye
23
14
0
24 Jan 2024
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image
  Synthesis
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Jonghyun Lee
Hansam Cho
Youngjoon Yoo
Seoung Bum Kim
Yonghyun Jeong
DiffM
15
7
0
17 Jan 2024
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for
  Text-to-Image Generation
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Seung Hyun Lee
Yinxiao Li
Junjie Ke
Innfarn Yoo
Han Zhang
...
Junfeng He
Gang Li
Sangpil Kim
Irfan Essa
Feng Yang
EGVM
35
18
0
11 Jan 2024
Joint Generative Modeling of Scene Graphs and Images via Diffusion
  Models
Joint Generative Modeling of Scene Graphs and Images via Diffusion Models
Bicheng Xu
Qi Yan
Renjie Liao
Lele Wang
Leonid Sigal
DiffM
29
2
0
02 Jan 2024
Previous
123...1056789
Next