Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.07093
Cited By
GLIGEN: Open-Set Grounded Text-to-Image Generation
17 January 2023
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GLIGEN: Open-Set Grounded Text-to-Image Generation"
50 / 472 papers shown
Title
A Generalist FaceX via Learning Unified Facial Representation
Yue Han
Jiangning Zhang
Junwei Zhu
Xiangtai Li
Yanhao Ge
Wei Li
Chengjie Wang
Yong Liu
Xiaoming Liu
Ying Tai
DiffM
27
13
0
31 Dec 2023
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
W. Ma
J. P. Lewis
W. Kleijn
DiffM
VGen
24
34
0
31 Dec 2023
Generating Enhanced Negatives for Training Language-Based Object Detectors
Shiyu Zhao
Long Zhao
Vijay Kumar B.G
Yumin Suh
Dimitris N. Metaxas
Manmohan Chandraker
S. Schulter
ObjD
VLM
39
5
0
29 Dec 2023
iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views
Chin-Hsuan Wu
Yen-Chun Chen
Bolivar Solarte
Lu Yuan
Min Sun
24
9
0
28 Dec 2023
Semantic Guidance Tuning for Text-To-Image Diffusion Models
Hyun Kang
Dohae Lee
Myungjin Shin
In-Kwon Lee
29
1
0
26 Dec 2023
Diff-Oracle: Deciphering Oracle Bone Scripts with Controllable Diffusion Model
Jing Li
Qiuxia Wang
Siyuan Wang
Rui Zhang
Kaizhu Huang
Erik Cambria
47
0
0
21 Dec 2023
DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation
Brian Nlong Zhao
Yuhang Xiao
Lyne Tchapmi
Xinyang Jiang
Yifan Yang
Dongsheng Li
Laurent Itti
Vibhav Vineet
Yunhao Ge
VLM
112
7
0
21 Dec 2023
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process
Meng Wang
Henghui Ding
Jun Hao Liew
Jiajun Liu
Yao-Min Zhao
Yunchao Wei
DiffM
27
17
0
19 Dec 2023
MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual Storytelling via Multi-Layered Semantic-Aware Denoising
Bingyuan Wang
Hengyu Meng
Zeyu Cai
Lanjiong Li
Yue Ma
Qifeng Chen
Zeyu Wang
DiffM
29
3
0
18 Dec 2023
Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models
Senmao Li
Taihang Hu
Fahad Shahbaz Khan
Linxuan Li
Shiqi Yang
Yaxing Wang
Ming-Ming Cheng
Jian Yang
DiffM
34
1
0
15 Dec 2023
SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds
Minghao Chen
Junyu Xie
Iro Laina
Andrea Vedaldi
KELM
47
9
0
14 Dec 2023
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization
Jiafeng Mao
Xueting Wang
Kiyoharu Aizawa
DiffM
57
3
0
13 Dec 2023
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Sicheng Mo
Fangzhou Mu
Kuan Heng Lin
Yanli Liu
Bochen Guan
Yin Li
Bolei Zhou
DiffM
45
59
0
12 Dec 2023
Boosting Latent Diffusion with Flow Matching
Johannes S. Fischer
Ming Gui
Pingchuan Ma
Nick Stracke
S. A. Baumann
Bjorn Ommer
22
20
0
12 Dec 2023
NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image
Yoonwoo Jeong
Jinwoo Lee
Chiheon Kim
Minsu Cho
Doyup Lee
21
3
0
12 Dec 2023
InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models
Jiun Tian Hoe
Xudong Jiang
Chee Seng Chan
Yap-Peng Tan
Weipeng Hu
19
11
0
10 Dec 2023
ControlRoom3D: Room Generation using Semantic Proxy Rooms
Jonas Schult
Sam S. Tsai
Lukas Höllein
Bichen Wu
Jialiang Wang
...
Zijian He
Peizhao Zhang
Bastian Leibe
Peter Vajda
Ji Hou
30
31
0
08 Dec 2023
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control
Jaskirat Singh
Jianming Zhang
Qing Liu
Cameron Smith
Zhe-nan Lin
Liang Zheng
DiffM
34
11
0
08 Dec 2023
ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Maitreya Patel
Changhoon Kim
Sheng Cheng
Chitta Baral
Yezhou Yang
VLM
27
18
0
07 Dec 2023
Gen2Det: Generate to Detect
Saksham Suri
Fanyi Xiao
Animesh Sinha
Sean Culatana
Raghuraman Krishnamoorthi
Chenchen Zhu
Abhinav Shrivastava
VLM
DiffM
29
9
0
07 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
48
187
0
07 Dec 2023
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
Zhouxia Wang
Ziyang Yuan
Xintao Wang
Tianshui Chen
Menghan Xia
Ping Luo
Ying Shan
DiffM
VGen
35
196
0
06 Dec 2023
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
Zirui Wang
Zhizhou Sha
Zheng Ding
Yilin Wang
Zhuowen Tu
DiffM
27
20
0
06 Dec 2023
Context Diffusion: In-Context Aware Image Generation
Ivona Najdenkoska
Animesh Sinha
Abhimanyu Dubey
Dhruv Mahajan
Vignesh Ramanathan
Filip Radenovic
DiffM
13
10
0
06 Dec 2023
LooseControl: Lifting ControlNet for Generalized Depth Conditioning
Shariq Farooq Bhat
Niloy J. Mitra
Peter Wonka
AI4CE
DiffM
36
34
0
05 Dec 2023
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation
Jiachen Lu
Ze Huang
Zeyu Yang
Jiahui Zhang
Li Zhang
VGen
19
39
0
05 Dec 2023
Fine-grained Controllable Video Generation via Object Appearance and Context
Hsin-Ping Huang
Yu-Chuan Su
Deqing Sun
Lu Jiang
Xuhui Jia
Yukun Zhu
Ming-Hsuan Yang
DiffM
VGen
25
13
0
05 Dec 2023
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Zhuoran Yu
Chenchen Zhu
Sean Culatana
Raghuraman Krishnamoorthi
Fanyi Xiao
Yong Jae Lee
117
14
0
04 Dec 2023
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
L. Ran
Xiaodong Cun
Jia-Wei Liu
Rui Zhao
Song Zijie
Xintao Wang
Jussi Keppo
Mike Zheng Shou
32
11
0
04 Dec 2023
StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
Jeongho Kim
Gyojung Gu
Minho Park
S. Park
Jaegul Choo
DiffM
17
87
0
04 Dec 2023
Meta ControlNet: Enhancing Task Adaptation via Meta Learning
Junjie Yang
Jinze Zhao
Peihao Wang
Zhangyang Wang
Yingbin Liang
31
2
0
03 Dec 2023
Diffusion Handles: Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D
Karran Pandey
Paul Guerrero
Matheus Gadelha
Yannick Hold-Geoffroy
Karan Singh
Niloy Mitra
DiffM
21
32
0
02 Dec 2023
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
Pengxiang Li
Kai Chen
Zhili Liu
Ruiyuan Gao
Lanqing Hong
Guo Zhou
Hua Yao
Dit-Yan Yeung
Huchuan Lu
Xu Jia
VGen
DiffM
22
0
0
01 Dec 2023
Exploiting Diffusion Prior for Generalizable Dense Prediction
Hsin-Ying Lee
Hung-Yu Tseng
Hsin-Ying Lee
Ming-Hsuan Yang
DiffM
MDE
37
18
0
30 Nov 2023
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving
Yu-Quan Wang
Jiawei He
Lue Fan
Hongxin Li
Yuntao Chen
Zhaoxiang Zhang
VGen
59
116
0
29 Nov 2023
Unlocking Spatial Comprehension in Text-to-Image Diffusion Models
Mohammad Mahdi Derakhshani
Menglin Xia
Harkirat Singh Behl
Cees G. M. Snoek
Victor Rühle
19
2
0
28 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
26
43
0
28 Nov 2023
Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis
Xiaohui Chen
Yongfei Liu
Yingxiang Yang
Jianbo Yuan
Quanzeng You
Liping Liu
Hongxia Yang
DiffM
48
11
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
27
60
0
28 Nov 2023
Manifold Preserving Guided Diffusion
Yutong He
Naoki Murata
Chieh-Hsin Lai
Yuhta Takida
Toshimitsu Uesaka
...
Wei-Hsiang Liao
Yuki Mitsufuji
J. Zico Kolter
Ruslan Salakhutdinov
Stefano Ermon
DiffM
116
64
0
28 Nov 2023
Diffusion-TTA: Test-time Adaptation of Discriminative Models via Generative Feedback
Mihir Prabhudesai
Tsung-Wei Ke
Alexander C. Li
Deepak Pathak
Katerina Fragkiadaki
TTA
32
15
0
27 Nov 2023
Self-correcting LLM-controlled Diffusion Models
Tsung-Han Wu
Long Lian
Joseph E. Gonzalez
Boyi Li
Trevor Darrell
62
53
0
27 Nov 2023
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
Siteng Huang
Biao Gong
Yutong Feng
Xi Chen
Yu Fu
Yu Liu
Donglin Wang
DiffM
23
12
0
27 Nov 2023
FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax
Yu Lu
Linchao Zhu
Hehe Fan
Yi Yang
VGen
DiffM
25
13
0
27 Nov 2023
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
Biao Gong
Siteng Huang
Yutong Feng
Shiwei Zhang
Yuyuan Li
Yu Liu
DiffM
29
11
0
27 Nov 2023
Highly Detailed and Temporal Consistent Video Stylization via Synchronized Multi-Frame Diffusion
M. Xie
Hanyuan Liu
Chengze Li
Tien-Tsin Wong
VGen
DiffM
30
0
0
24 Nov 2023
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Weijia Wu
Zhuang Li
Yefei He
Mike Zheng Shou
Chunhua Shen
Lele Cheng
Yan Li
Tingting Gao
Di Zhang
VLM
133
24
0
24 Nov 2023
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
Peiang Zhao
Han Li
Ruiyang Jin
S. Kevin Zhou
DiffM
51
12
0
21 Nov 2023
SelfEval: Leveraging the discriminative nature of generative models for evaluation
Sai Saketh Rambhatla
Ishan Misra
EGVM
25
4
0
17 Nov 2023
Enhancing Object Coherence in Layout-to-Image Synthesis
Yibin Wang
Weizhong Zhang
Jianwei Zheng
Cheng Jin
DiffM
20
3
0
17 Nov 2023
Previous
1
2
3
...
10
6
7
8
9
Next