ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.07093
  4. Cited By
GLIGEN: Open-Set Grounded Text-to-Image Generation

GLIGEN: Open-Set Grounded Text-to-Image Generation

17 January 2023
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
    VLM
ArXivPDFHTML

Papers citing "GLIGEN: Open-Set Grounded Text-to-Image Generation"

50 / 472 papers shown
Title
Text Guided Image Editing with Automatic Concept Locating and Forgetting
Text Guided Image Editing with Automatic Concept Locating and Forgetting
Jia Li
Lijie Hu
Zhixian He
Jingfeng Zhang
Tianhang Zheng
Di Wang
DiffM
46
8
0
30 May 2024
SketchDeco: Decorating B&W Sketches with Colour
SketchDeco: Decorating B&W Sketches with Colour
Chaitat Utintu
Pinaki Nath Chowdhury
Aneeshan Sain
Subhadeep Koley
A. Bhunia
Yi-Zhe Song
DiffM
34
3
0
29 May 2024
Multi-modal Generation via Cross-Modal In-Context Learning
Multi-modal Generation via Cross-Modal In-Context Learning
Amandeep Kumar
Muzammal Naseer
Sanath Narayan
Rao Muhammad Anwer
Salman Khan
Hisham Cholakkal
MLLM
53
0
0
28 May 2024
SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance
SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance
Guibao Shen
Luozhou Wang
Jiantao Lin
Wenhang Ge
Chaozhe Zhang
...
Pengfei Wan
Zhong-ming Wang
Guangyong Chen
Yijun Li
Yingcong Chen
40
8
0
24 May 2024
ODGEN: Domain-specific Object Detection Data Generation with Diffusion
  Models
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
Jingyuan Zhu
Shiyu Li
Yuxuan Liu
Ping-Chia Huang
Jiulong Shan
Huimin Ma
Jian Yuan
37
4
0
24 May 2024
FlexEControl: Flexible and Efficient Multimodal Control for
  Text-to-Image Generation
FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation
Xuehai He
Jian Zheng
Jacob Zhiyuan Fang
Robinson Piramuthu
Mohit Bansal
Vicente Ordonez
Gunnar A. Sigurdsson
Nanyun Peng
Xin Eric Wang
DiffM
47
1
0
08 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
87
37
0
06 May 2024
Customizing Text-to-Image Models with a Single Image Pair
Customizing Text-to-Image Models with a Single Image Pair
Maxwell Jones
Sheng-Yu Wang
Nupur Kumari
David Bau
Jun-Yan Zhu
DiffM
25
19
0
02 May 2024
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance
Kelvin C. K. Chan
Yang Zhao
Xuhui Jia
Ming-Hsuan Yang
Huisheng Wang
22
3
0
02 May 2024
Automated Virtual Product Placement and Assessment in Images using
  Diffusion Models
Automated Virtual Product Placement and Assessment in Images using Diffusion Models
Mohammad Mahmudul Alam
Negin Sokhandan
Emmett Goodman
DiffM
24
0
0
02 May 2024
Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable
Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable
Haozhe Liu
Wentian Zhang
Bing Li
Bernard Ghanem
Jürgen Schmidhuber
DiffM
WIGM
AAML
33
1
0
01 May 2024
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
Minghao Chen
Iro Laina
Andrea Vedaldi
3DGS
42
23
0
29 Apr 2024
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
Tianyidan Xie
Rui Ma
Qian Wang
Xiaoqian Ye
Feixuan Liu
Ying Tai
Zhenyu Zhang
Lanjun Wang
Zili Yi
DiffM
MLLM
47
2
0
29 Apr 2024
Exposing Text-Image Inconsistency Using Diffusion Models
Exposing Text-Image Inconsistency Using Diffusion Models
Mingzhen Huang
Shan Jia
Zhou Zhou
Yan Ju
Jialing Cai
Siwei Lyu
46
7
0
28 Apr 2024
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Navve Wasserman
Noam Rotstein
Roy Ganz
Ron Kimmel
DiffM
39
15
0
28 Apr 2024
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
Ziyue Zhang
Mingbao Lin
Rongrong Ji
Yuxin Zhang
Rongrong Ji
DiffM
56
3
0
26 Apr 2024
Editable Image Elements for Controllable Synthesis
Editable Image Elements for Controllable Synthesis
Jiteng Mu
Michael Gharbi
Richard Zhang
Eli Shechtman
Nuno Vasconcelos
Xiaolong Wang
Taesung Park
DiffM
53
9
0
24 Apr 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion
  Models
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
42
6
0
24 Apr 2024
GLoD: Composing Global Contexts and Local Details in Image Generation
GLoD: Composing Global Contexts and Local Details in Image Generation
Moyuru Yamada
DiffM
21
1
0
23 Apr 2024
ControlMol: Adding Substruture Control To Molecule Diffusion Models
ControlMol: Adding Substruture Control To Molecule Diffusion Models
Zhengyang Qi
Zijing Liu
Jiying Zhang
He Cao
Yu Li
50
2
0
22 Apr 2024
Towards Better Text-to-Image Generation Alignment via Attention
  Modulation
Towards Better Text-to-Image Generation Alignment via Attention Modulation
Yihang Wu
Xiao Cao
Kaixin Li
Zitan Chen
Haonan Wang
Lei Meng
Zhiyong Huang
DiffM
32
5
0
22 Apr 2024
LTOS: Layout-controllable Text-Object Synthesis via Adaptive
  Cross-attention Fusions
LTOS: Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions
Xiaoran Zhao
Tianhao Wu
Yu Lai
Zhiliang Tian
Zhen Huang
Yahui Liu
Zejiang He
Dongsheng Li
DiffM
38
1
0
21 Apr 2024
FilterPrompt: Guiding Image Transfer in Diffusion Models
FilterPrompt: Guiding Image Transfer in Diffusion Models
Xi Wang
Yichen Peng
Heng Fang
Haoran Xie
Xi Yang
Chuntao Li
DiffM
40
0
0
20 Apr 2024
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation
Tianyi Liang
Jiangqi Liu
Sicheng Song
Shiqi Jiang
Yifei Huang
Changbo Wang
Chenhui Li
42
0
0
18 Apr 2024
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based
  Image Editing
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Kuo-Chin Lien
Misha Sra
Pradeep Sen
39
3
0
17 Apr 2024
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
  Controls to Any Diffusion Model
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Han Lin
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
DiffM
VGen
69
20
0
15 Apr 2024
Exploring Generative AI for Sim2Real in Driving Data Synthesis
Exploring Generative AI for Sim2Real in Driving Data Synthesis
Haonan Zhao
Yiting Wang
Thomas Bashford-Rogers
Valentina Donzella
Kurt Debattista
GAN
34
4
0
14 Apr 2024
COCONut: Modernizing COCO Segmentation
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
42
16
0
12 Apr 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency
  Feedback
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
Cheng Chen
35
63
0
11 Apr 2024
How is Visual Attention Influenced by Text Guidance? Database and Model
How is Visual Attention Influenced by Text Guidance? Database and Model
Yinan Sun
Xiongkuo Min
Huiyu Duan
Guangtao Zhai
112
4
0
11 Apr 2024
UMBRAE: Unified Multimodal Brain Decoding
UMBRAE: Unified Multimodal Brain Decoding
Weihao Xia
Raoul de Charette
Cengiz Öztireli
Jing-Hao Xue
38
6
0
10 Apr 2024
Move Anything with Layered Scene Diffusion
Move Anything with Layered Scene Diffusion
Jiawei Ren
Mengmeng Xu
Jui-Chieh Wu
Ziwei Liu
Tao Xiang
Antoine Toisoul
29
9
0
10 Apr 2024
Identification of Fine-grained Systematic Errors via Controlled Scene
  Generation
Identification of Fine-grained Systematic Errors via Controlled Scene Generation
Valentyn Boreiko
Matthias Hein
J. H. Metzen
35
1
0
10 Apr 2024
ZeST: Zero-Shot Material Transfer from a Single Image
ZeST: Zero-Shot Material Transfer from a Single Image
Ta-Ying Cheng
Prafull Sharma
Andrew Markham
Niki Trigoni
Varun Jampani
41
9
0
09 Apr 2024
SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual
  Editing
SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing
Jing Gu
Yilin Wang
Nanxuan Zhao
Wei Xiong
Qing Liu
Zhifei Zhang
He Zhang
Jianming Zhang
HyunJoon Jung
Xin Eric Wang
DiffM
32
8
0
08 Apr 2024
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask
  Prompt
Mask-ControlNet: Higher-Quality Image Generation with An Additional Mask Prompt
Zhiqi Huang
Hui Xiong
Haoyu Wang
Longguang Wang
Zhiheng Li
DiffM
38
0
0
08 Apr 2024
ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion
  Model
ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model
Binghui Chen
Wenyu Li
Yifeng Geng
Xuansong Xie
Wangmeng Zuo
DiffM
37
3
0
07 Apr 2024
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Shenghai Yuan
Jinfa Huang
Yujun Shi
Yongqi Xu
Ruijie Zhu
Bin Lin
Xinhua Cheng
Li-xin Yuan
Jiebo Luo
VGen
78
33
0
07 Apr 2024
BeyondScene: Higher-Resolution Human-Centric Scene Generation With
  Pretrained Diffusion
BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion
Gwanghyun Kim
Hayeon Kim
H. Seo
Dong un Kang
Se Young Chun
43
4
0
06 Apr 2024
ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing
ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing
Alec Helbling
Seongmin Lee
Polo Chau
DiffM
19
1
0
05 Apr 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
  Matching
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Dongzhi Jiang
Guanglu Song
Xiaoshi Wu
Renrui Zhang
Dazhong Shen
Zhuofan Zong
Yu Liu
Hongsheng Li
VLM
30
20
0
04 Apr 2024
Multi Positive Contrastive Learning with Pose-Consistent Generated
  Images
Multi Positive Contrastive Learning with Pose-Consistent Generated Images
Sho Inayoshi
Aji Resindra Widya
Satoshi Ozaki
Junji Otsuka
Takeshi Ohashi
3DH
52
1
0
04 Apr 2024
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Agneet Chatterjee
Gabriela Ben-Melech Stan
Estelle Aflalo
Sayak Paul
Dhruba Ghosh
...
Ludwig Schmidt
Hanna Hajishirzi
Vasudev Lal
Chitta Baral
Yezhou Yang
EGVM
VLM
59
15
0
01 Apr 2024
PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware
  Layout Generation
PosterLlama: Bridging Design Ability of Langauge Model to Contents-Aware Layout Generation
Jaejung Seol
Seojun Kim
Jaejun Yoo
3DV
VLM
34
7
0
01 Apr 2024
Relation Rectification in Diffusion Model
Relation Rectification in Diffusion Model
Yinwei Wu
Xingyi Yang
Xinchao Wang
28
6
0
29 Mar 2024
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject
  Control
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Binyuan Huang
Yuqing Wen
Yucheng Zhao
Yaosi Hu
Yingfei Liu
...
Tiancai Wang
Chi Zhang
Chang Wen Chen
Zhenzhong Chen
Xiangyu Zhang
43
15
0
28 Mar 2024
Imperceptible Protection against Style Imitation from Diffusion Models
Imperceptible Protection against Style Imitation from Diffusion Models
Namhyuk Ahn
Wonhyuk Ahn
Kiyoon Yoo
Daesik Kim
Seung-Hun Nam
WIGM
AAML
DiffM
49
5
0
28 Mar 2024
Attention Calibration for Disentangled Text-to-Image Personalization
Attention Calibration for Disentangled Text-to-Image Personalization
Yanbing Zhang
Mengping Yang
Qin Zhou
Zhe Wang
31
15
0
27 Mar 2024
DODA: Adapting Object Detectors to Dynamic Agricultural Environments in Real-Time with Diffusion
DODA: Adapting Object Detectors to Dynamic Agricultural Environments in Real-Time with Diffusion
Shuai Xiang
Pieter M. Blok
James Burridge
Haozhou Wang
Wei Guo
31
0
0
27 Mar 2024
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image
  Generation
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation
Omer Dahary
Or Patashnik
Kfir Aberman
Daniel Cohen-Or
DiffM
37
28
0
25 Mar 2024
Previous
123456...8910
Next