Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.14822
Cited By
Vector Quantized Diffusion Model for Text-to-Image Synthesis
29 November 2021
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vector Quantized Diffusion Model for Text-to-Image Synthesis"
50 / 566 papers shown
Title
RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid Subjects
Sascha Kirch
Valeria Olyunina
Jan Ondřej
Rafael Pagés
Sergio Martín
Clara Pérez-Molina
22
2
0
29 Jul 2023
Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs
Or Sharir
Anima Anandkumar
32
0
0
27 Jul 2023
Text2Layer: Layered Image Generation using Latent Diffusion Model
Xinyang Zhang
Wentian Zhao
Xin Lu
J. Chien
DiffM
19
11
0
19 Jul 2023
Grounded Object Centric Learning
Avinash Kori
Francesco Locatello
Fabio De Sousa Ribeiro
Francesca Toni
Ben Glocker
OCL
22
7
0
18 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
25
782
0
10 Jul 2023
MomentDiff: Generative Video Moment Retrieval from Random to Real
P. Li
Chen-Wei Xie
Hongtao Xie
Liming Zhao
Lei Zhang
Yun Zheng
Deli Zhao
Yongdong Zhang
DiffM
VGen
39
56
0
06 Jul 2023
Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality
P. Lorenz
Ricard Durall
J. Keuper
DiffM
67
33
0
05 Jul 2023
SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection
Yuguang Shi
DiffM
40
0
0
05 Jul 2023
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
Xiang Li
Varun Belagali
Jinghuan Shang
Michael S. Ryoo
37
28
0
04 Jul 2023
JourneyDB: A Benchmark for Generative Image Understanding
Keqiang Sun
Junting Pan
Yuying Ge
Hao Li
Haodong Duan
...
Yi Wang
Jifeng Dai
Yu Qiao
Limin Wang
Hongsheng Li
54
102
0
03 Jul 2023
Evaluating the Robustness of Text-to-image Diffusion Models against Real-world Attacks
Hongcheng Gao
Hao Zhang
Yinpeng Dong
Zhijie Deng
AAML
33
21
0
16 Jun 2023
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Xiaoshi Wu
Yiming Hao
Keqiang Sun
Yixiong Chen
Feng Zhu
Rui Zhao
Hongsheng Li
46
252
0
15 Jun 2023
Norm-guided latent space exploration for text-to-image generation
Dvir Samuel
Rami Ben-Ari
N. Darshan
Haggai Maron
Gal Chechik
DiffM
29
24
0
14 Jun 2023
TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement
Carl Doersch
Yi Yang
Mel Vecerík
Dilara Gokay
Ankush Gupta
Y. Aytar
João Carreira
Andrew Zisserman
31
148
0
14 Jun 2023
GenImage: A Million-Scale Benchmark for Detecting AI-Generated Image
Mingjian Zhu
Hanting Chen
Qiang Yan
Xu Huang
Guanyu Lin
Wei Li
Zhaopeng Tu
Hailin Hu
Jie Hu
Yunhe Wang
VLM
30
121
0
14 Jun 2023
Distribution Shift Inversion for Out-of-Distribution Prediction
Runpeng Yu
Songhua Liu
Xingyi Yang
Xinchao Wang
OODD
18
18
0
14 Jun 2023
GBSD: Generative Bokeh with Stage Diffusion
Jieren Deng
Xiaoxia Zhou
Hao Tian
Zhihong Pan
Derek Aguiar
DiffM
19
1
0
14 Jun 2023
Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation
Ruoyu Wang
Yongqi Yang
Zhihao Qian
Ye Zhu
Yuehua Wu
DiffM
30
13
0
14 Jun 2023
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Chenpeng Du
Yiwei Guo
Feiyu Shen
Zhijun Liu
Zheng Liang
Xie Chen
Shuai Wang
Hui Zhang
K. Yu
DiffM
21
42
0
13 Jun 2023
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Zeju Qiu
Wei-yu Liu
Haiwen Feng
Yuxuan Xue
Yao Feng
Zhen Liu
Dan Zhang
Adrian Weller
Bernhard Schölkopf
DiffM
48
134
0
12 Jun 2023
Fast Diffusion Model
Zike Wu
Pan Zhou
Kenji Kawaguchi
Hanwang Zhang
DiffM
21
19
0
12 Jun 2023
Learning Image-Adaptive Codebooks for Class-Agnostic Image Restoration
Kechun Liu
Yitong Jiang
Inchang Choi
Liang Feng
26
13
0
10 Jun 2023
Boosting GUI Prototyping with Diffusion Models
Jialiang Wei
A. Courbis
Thomas Lambolais
Binbin Xu
P. Bernard
Gérard Dray
DiffM
24
22
0
09 Jun 2023
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Changyao Tian
Chenxin Tao
Jifeng Dai
Hao Li
Ziheng Li
Lewei Lu
Xiaogang Wang
Hongsheng Li
Gao Huang
Xizhou Zhu
DiffM
25
9
0
08 Jun 2023
Improving Tuning-Free Real Image Editing with Proximal Guidance
Ligong Han
Song Wen
Qi Chen
Zhixing Zhang
Kunpeng Song
...
Qilong Zhangli
Jindong Jiang
Zhaoyang Xia
Akash Srivastava
Dimitris N. Metaxas
DiffM
33
58
0
08 Jun 2023
Multi-Architecture Multi-Expert Diffusion Models
Yunsung Lee
Jin-Young Kim
Hyojun Go
Myeongho Jeong
Shinhyeok Oh
Seungtaek Choi
DiffM
31
29
0
08 Jun 2023
UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
Yanan Sun
Zi-Qi Zhong
Qi Fan
Chi-Keung Tang
Yu-Wing Tai
VLM
33
4
0
07 Jun 2023
Designing a Better Asymmetric VQGAN for StableDiffusion
Zixin Zhu
Xuelu Feng
Dongdong Chen
Jianmin Bao
Le Wang
Yinpeng Chen
Lu Yuan
Gang Hua
DiffM
27
34
0
07 Jun 2023
A Comprehensive Survey on Generative Diffusion Models for Structured Data
Heejoon Koo
To Eun Kim
DiffM
MedIm
33
7
0
07 Jun 2023
Detector Guidance for Multi-Object Text-to-Image Generation
Luping Liu
Zijian Zhang
Yi Ren
Rongjie Huang
Xiang Yin
Zhou Zhao
DiffM
31
9
0
04 Jun 2023
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Zeqiang Lai
Yuchen Duan
Jifeng Dai
Ziheng Li
Ying Fu
Hongsheng Li
Yu Qiao
Wen Wang
DiffM
36
17
0
02 Jun 2023
Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation
Minghui Hu
Jianbin Zheng
Daqing Liu
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
DiffM
25
9
0
01 Jun 2023
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Jinbo Xing
Menghan Xia
Yuxin Liu
Yuechen Zhang
Yong Zhang
...
Haoxin Chen
Xiaodong Cun
Xintao Wang
Ying Shan
T. Wong
VGen
DiffM
39
84
0
01 Jun 2023
AI Imagery and the Overton Window
Sarah K. Amer
14
5
0
31 May 2023
Control4D: Efficient 4D Portrait Editing with Text
Ruizhi Shao
Jingxiang Sun
Cheng Peng
Zerong Zheng
Boyao Zhou
Hongwen Zhang
Yebin Liu
DiffM
24
23
0
31 May 2023
Unsupervised Statistical Feature-Guided Diffusion Model for Sensor-based Human Activity Recognition
Si Zuo
Vitor Fortes Rey
Sungho Suh
S. Sigg
P. Lukowicz
DiffM
27
4
0
30 May 2023
DiffSketching: Sketch Control Image Synthesis with Diffusion Models
Qiang Wang
Di Kong
Fengyin Lin
Yonggang Qi
DiffM
33
14
0
30 May 2023
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing
Nazmul Karim
Umar Khalid
M. Joneidi
Chen Chen
Nazanin Rahnavard
DiffM
VGen
19
5
0
30 May 2023
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
Yuchao Gu
Xintao Wang
Jay Zhangjie Wu
Yujun Shi
Yunpeng Chen
...
Shuning Chang
Wei Yu Wu
Yixiao Ge
Ying Shan
Mike Zheng Shou
DiffM
52
166
0
29 May 2023
Photoswap: Personalized Subject Swapping in Images
Jing Gu
Yilin Wang
Nanxuan Zhao
Tsu-jui Fu
Wei Xiong
...
Zhifei Zhang
He Zhang
Jianming Zhang
Hyun-Sun Jung
Xin Eric Wang
DiffM
26
37
0
29 May 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
45
88
0
29 May 2023
TaleCrafter: Interactive Story Visualization with Multiple Characters
Yuan Gong
Youxin Pang
Xiaodong Cun
Menghan Xia
Yingqing He
...
Longyue Wang
Yong Zhang
Xintao Wang
Ying Shan
Yujiu Yang
DiffM
27
45
0
29 May 2023
Conditional Score Guidance for Text-Driven Image-to-Image Translation
Hyunsoo Lee
Minsoo Kang
Bohyung Han
DiffM
18
14
0
29 May 2023
Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling
Tianqi Chen
Mingyuan Zhou
DiffM
60
7
0
28 May 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee K. Wong
27
234
0
25 May 2023
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Xingqian Xu
Jiayi Guo
Zhangyang Wang
Gao Huang
Irfan Essa
Humphrey Shi
VLM
DiffM
35
57
0
25 May 2023
GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes
Ibrahim Ethem Hamamci
Sezgin Er
Anjany Sekuboyina
Enis Simsar
A. Tezcan
...
Hadrien Reynaud
Sarthak Pati
Christian Bluethgen
M. K. Özdemir
Bjoern H. Menze
DiffM
MedIm
42
16
0
25 May 2023
Visual Programming for Text-to-Image Generation and Evaluation
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
MLLM
26
50
0
24 May 2023
On the Generalization of Diffusion Model
Mingyang Yi
Jiacheng Sun
Zhenguo Li
22
18
0
24 May 2023
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
30
6
0
24 May 2023
Previous
1
2
3
...
10
11
12
7
8
9
Next