Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.14822
Cited By
Vector Quantized Diffusion Model for Text-to-Image Synthesis
29 November 2021
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vector Quantized Diffusion Model for Text-to-Image Synthesis"
50 / 566 papers shown
Title
VisorGPT: Learning Visual Prior via Generative Pre-Training
Jinheng Xie
Kai Ye
Yudong Li
Yuexiang Li
Kevin Qinghong Lin
Yefeng Zheng
Linlin Shen
Mike Zheng Shou
ViT
95
8
0
23 May 2023
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
Mengqi Huang
Zhendong Mao
Quang Wang
Yongdong Zhang
VGen
DiffM
68
21
0
23 May 2023
LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
Davide Morelli
Alberto Baldrati
Giuseppe Cartella
Marcella Cornia
Marco Bertini
Rita Cucchiara
DiffM
68
100
0
22 May 2023
If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
Shyamgopal Karthik
Karsten Roth
Massimiliano Mancini
Zeynep Akata
36
20
0
22 May 2023
MaGIC: Multi-modality Guided Image Completion
Yongsheng Yu
Hao Wang
Tiejian Luo
Hengrui Fan
Libo Zhang
16
12
0
19 May 2023
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
Mengqi Huang
Zhendong Mao
Zhuowei Chen
Yongdong Zhang
MQ
35
35
0
19 May 2023
Inspecting the Geographical Representativeness of Images from Text-to-Image Models
Aparna Basu
R. Venkatesh Babu
Danish Pruthi
DiffM
28
39
0
18 May 2023
TextDiffuser: Diffusion Models as Text Painters
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
48
112
0
18 May 2023
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Songwei Ge
Seungjun Nah
Guilin Liu
Tyler Poon
Andrew Tao
Bryan Catanzaro
David Jacobs
Jia-Bin Huang
Ming-Yu Liu
Yogesh Balaji
DiffM
VGen
42
252
0
17 May 2023
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Jianyi Wang
Zongsheng Yue
Shangchen Zhou
Kelvin C. K. Chan
Chen Change Loy
44
281
0
11 May 2023
Exploring One-shot Semi-supervised Federated Learning with A Pre-trained Diffusion Model
Min Yang
Shangchao Su
Bin Li
Xiangyang Xue
DiffM
29
30
0
06 May 2023
Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation
Kun Zhou
Yifan Li
Wayne Xin Zhao
Ji-Rong Wen
DiffM
26
14
0
06 May 2023
Multimodal Procedural Planning via Dual Text-Image Prompting
Yujie Lu
Pan Lu
Zhiyu Zoey Chen
Wanrong Zhu
Qing Guo
William Yang Wang
LM&Ro
62
43
0
02 May 2023
Long-Term Rhythmic Video Soundtracker
Jiashuo Yu
Yaohui Wang
Xinyuan Chen
Xiao Sun
Yu Qiao
DiffM
64
14
0
02 May 2023
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
Cheng Lu
Huayu Chen
Jianfei Chen
Hang Su
Chongxuan Li
Jun Zhu
DiffM
OffRL
25
58
0
25 Apr 2023
Collaborative Diffusion for Multi-Modal Face Generation and Editing
Ziqi Huang
Kelvin C. K. Chan
Yuming Jiang
Ziwei Liu
DiffM
49
103
0
20 Apr 2023
Text2Performer: Text-Driven Human Video Generation
Yuming Jiang
Shuai Yang
Tong Liang Koh
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
VGen
45
48
0
17 Apr 2023
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
Ming Cao
Xintao Wang
Zhongang Qi
Ying Shan
Xiaohu Qie
Yinqiang Zheng
DiffM
33
428
0
17 Apr 2023
Soundini: Sound-Guided Diffusion for Natural Video Editing
Seung Hyun Lee
Si-Yeol Kim
Innfarn Yoo
Feng Yang
Donghyeon Cho
Youngseo Kim
Huiwen Chang
Jinkyu Kim
Sangpil Kim
VGen
DiffM
37
15
0
13 Apr 2023
SpectralDiff: A Generative Framework for Hyperspectral Image Classification with Diffusion Models
Ning Chen
Jun Yue
Leyuan Fang
Shaobo Xia
DiffM
25
58
0
12 Apr 2023
Binary Latent Diffusion
Ze Wang
Jiang Wang
Zicheng Liu
Qiang Qiu
29
13
0
10 Apr 2023
TC-VAE: Uncovering Out-of-Distribution Data Generative Factors
Cristian Meo
Anirudh Goyal
Justin Dauwels
DRL
CoGe
CML
27
1
0
08 Apr 2023
Training-Free Layout Control with Cross-Attention Guidance
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
135
222
0
06 Apr 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
38
62
0
04 Apr 2023
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
Jaewoong Lee
Sang-Sub Jang
Jaehyeong Jo
Jaehong Yoon
Yunji Kim
Jin-Hwa Kim
Jung-Woo Ha
Sung Ju Hwang
DiffM
32
4
0
04 Apr 2023
Generative Diffusion Prior for Unified Image Restoration and Enhancement
Ben Fei
Zhaoyang Lyu
Liang Pan
Junzhe Zhang
Weidong Yang
Tian-jian Luo
Bo-Wen Zhang
Bo Dai
DiffM
37
177
0
03 Apr 2023
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models
Yuancheng Wang
Zeqian Ju
Xuejiao Tan
Lei He
Zhizheng Wu
Jiang Bian
Sheng Zhao
DiffM
19
47
0
03 Apr 2023
Diffusion Action Segmentation
Dao-jun Liu
Qiyue Li
A. Dinh
Ting Jiang
Mubarak Shah
Chan Xu
VGen
DiffM
19
68
0
31 Mar 2023
Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos
Kun Su
Kaizhi Qian
Eli Shlizerman
Antonio Torralba
Chuang Gan
VGen
AI4CE
35
20
0
29 Mar 2023
Fine-grained Audible Video Description
Xuyang Shen
Dong Li
Jinxing Zhou
Zhen Qin
Bowen He
...
Yuchao Dai
Lingpeng Kong
Meng Wang
Yu Qiao
Yiran Zhong
VGen
38
11
0
27 Mar 2023
Object Discovery from Motion-Guided Tokens
Zhipeng Bao
P. Tokmakov
Yu-xiong Wang
Adrien Gaidon
M. Hebert
OCL
43
20
0
27 Mar 2023
Seer: Language Instructed Video Prediction with Latent Diffusion Models
Xianfan Gu
Chuan Wen
Weirui Ye
Jiaming Song
Yang Gao
DiffM
VGen
21
40
0
27 Mar 2023
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Sauradip Nag
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
DiffM
VGen
41
21
0
27 Mar 2023
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
DiffM
145
155
0
25 Mar 2023
Enhancing Multiple Reliability Measures via Nuisance-extended Information Bottleneck
Jongheon Jeong
Sihyun Yu
Hankook Lee
Jinwoo Shin
AAML
44
0
0
24 Mar 2023
Conditional Image-to-Video Generation with Latent Flow Diffusion Models
Haomiao Ni
Changhao Shi
Kaican Li
Sharon X. Huang
Martin Renqiang Min
VGen
DiffM
21
164
0
24 Mar 2023
ReVersion: Diffusion-Based Relation Inversion from Images
Ziqi Huang
Tianxing Wu
Yuming Jiang
Kelvin C. K. Chan
Ziwei Liu
39
65
0
23 Mar 2023
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
K. Pnvr
Bharat Singh
P. Ghosh
Behjat Siddiquie
David Jacobs
DiffM
35
29
0
22 Mar 2023
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Lukas Höllein
Ang Cao
Andrew Owens
Justin Johnson
Matthias Nießner
DiffM
38
177
0
21 Mar 2023
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Yushi Hu
Benlin Liu
Jungo Kasai
Yizhong Wang
Mari Ostendorf
Ranjay Krishna
Noah A. Smith
EGVM
41
208
0
21 Mar 2023
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models
Junyi Zhang
Jiaqi Guo
Shizhao Sun
Jian-Guang Lou
Dongmei Zhang
DiffM
21
33
0
21 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
41
269
0
20 Mar 2023
Diffusion-based Document Layout Generation
Liu He
Yijuan Lu
John Corring
D. Florêncio
Cha Zhang
DiffM
28
21
0
19 Mar 2023
3DQD: Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process
Yuhan Li
Yishun Dou
Xuanhong Chen
Bingbing Ni
Yilin Sun
Yutian Liu
Fuzhen Wang
DiffM
29
29
0
18 Mar 2023
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model
Jiwen Yu
Yinhuai Wang
Chen Zhao
Guohao Li
Jian Zhang
DiffM
22
168
0
17 Mar 2023
Efficient Diffusion Training via Min-SNR Weighting Strategy
Tiankai Hang
Shuyang Gu
Chen Li
Jianmin Bao
Dong Chen
Han Hu
Xin Geng
B. Guo
24
150
0
16 Mar 2023
DIRE for Diffusion-Generated Image Detection
Zhendong Wang
Jianmin Bao
Wen-gang Zhou
Weilun Wang
Hezhen Hu
Hong Chen
Houqiang Li
19
193
0
16 Mar 2023
DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception
Jiayu Zou
Zheng Hua Zhu
Yun Ye
Xingang Wang
DiffM
23
20
0
15 Mar 2023
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation
Zhengxiong Luo
Dayou Chen
Yingya Zhang
Yan Huang
Liangsheng Wang
Yujun Shen
Deli Zhao
Jinren Zhou
Tien-Ping Tan
DiffM
VGen
132
215
0
15 Mar 2023
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
Naoto Inoue
Kotaro Kikuchi
E. Simo-Serra
Mayu Otani
Kota Yamaguchi
DiffM
57
101
0
14 Mar 2023
Previous
1
2
3
...
10
11
12
8
9
Next