Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.00446
Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2
2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRL
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generating Diverse High-Fidelity Images with VQ-VAE-2"
50 / 1,128 papers shown
Title
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe Lin
Yang Zhang
Shiyu Chang
DiffM
97
46
0
07 Apr 2023
Diffusion Models as Masked Autoencoders
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffM
SyDa
102
53
0
06 Apr 2023
DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model
H. Seo
Hayeon Kim
Gwanghyun Kim
S. Chun
DiffM
120
43
0
06 Apr 2023
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Bokui Shen
Xinchen Yan
C. Qi
Mahyar Najibi
Boyang Deng
Leonidas Guibas
Yin Zhou
Drago Anguelov
3DV
103
21
0
04 Apr 2023
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
Jaewoong Lee
Sang-Sub Jang
Jaehyeong Jo
Jaehong Yoon
Yunji Kim
Jin-Hwa Kim
Jung-Woo Ha
Sung Ju Hwang
DiffM
87
4
0
04 Apr 2023
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation
Jiawei Liu
Weining Wang
Sihan Chen
Xinxin Zhu
Qingbin Liu
DiffM
VGen
84
14
0
29 Mar 2023
Implicit Diffusion Models for Continuous Super-Resolution
Sicheng Gao
Xuhui Liu
Bo-Wen Zeng
Sheng Xu
Yanjing Li
Xiaonan Luo
Jianzhuang Liu
Xiantong Zhen
Baochang Zhang
DiffM
120
232
0
29 Mar 2023
Object Discovery from Motion-Guided Tokens
Zhipeng Bao
P. Tokmakov
Yu-Xiong Wang
Adrien Gaidon
M. Hebert
OCL
100
23
0
27 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez
Guillaume Couairon
Hervé Jégou
Matthijs Douze
Teddy Furon
WIGM
137
198
0
27 Mar 2023
Learning Generative Models with Goal-conditioned Reinforcement Learning
Mariana Vargas Vieyra
Pierre Ménard
GAN
31
0
0
26 Mar 2023
Learning Versatile 3D Shape Generation with Improved AR Models
Simian Luo
Xuelin Qian
Yanwei Fu
Yinda Zhang
Ying Tai
Zhenyu Zhang
Chengjie Wang
Xiangyang Xue
108
3
0
26 Mar 2023
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
DiffM
242
171
0
25 Mar 2023
Variational Inference for Longitudinal Data Using Normalizing Flows
Clément Chadebec
S. Allassonnière
BDL
DRL
69
1
0
24 Mar 2023
DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis
Jiapeng Tang
Yinyu Nie
Lev Markhasin
Angela Dai
Justus Thies
Matthias Nießner
DiffM
126
45
0
24 Mar 2023
High Fidelity Image Synthesis With Deep VAEs In Latent Space
Troy Luhman
Eric Luhman
DRL
3DV
75
7
0
23 Mar 2023
Continuous Indeterminate Probability Neural Network
T.A. Yang
BDL
65
0
0
23 Mar 2023
Posthoc Interpretation via Quantization
Francesco Paissan
Cem Subakan
Mirco Ravanelli
MQ
110
7
0
22 Mar 2023
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
K. Pnvr
Bharat Singh
P. Ghosh
Behjat Siddiquie
David Jacobs
DiffM
91
29
0
22 Mar 2023
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Lukas Höllein
Ang Cao
Andrew Owens
Justin Johnson
Matthias Nießner
DiffM
137
183
0
21 Mar 2023
Information-containing Adversarial Perturbation for Combating Facial Manipulation Systems
Yao Zhu
YueFeng Chen
Xiaodan Li
Rong Zhang
Xiang Tian
Bo Zheng
Yao-wu Chen
AAML
109
11
0
21 Mar 2023
FullFormer: Generating Shapes Inside Shapes
Tejaswini Medi
Jawad Tayyub
M. Sarmad
Frank Lindseth
Margret Keuper
63
1
0
20 Mar 2023
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Peng Jin
Hao Li
Ze-Long Cheng
Kehan Li
Xiang Ji
Chang-rui Liu
Li-ming Yuan
Jie Chen
DiffM
VGen
100
58
0
17 Mar 2023
Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
Zixi Tuo
Huan Yang
Jianlong Fu
Yujie Dun
Xueming Qian
VGen
64
3
0
17 Mar 2023
StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model
Zipeng Xu
E. Sangineto
N. Sebe
DiffM
90
13
0
16 Mar 2023
DR2: Diffusion-based Robust Degradation Remover for Blind Face Restoration
Zhixin Wang
Xiaoyun Zhang
Ziying Zhang
Huangjie Zheng
Mingyuan Zhou
Ya Zhang
Yanfeng Wang
DiffM
88
75
0
13 Mar 2023
Regularized Vector Quantization for Tokenized Image Synthesis
Jiahui Zhang
Fangneng Zhan
Christian Theobalt
Shijian Lu
DiffM
MQ
104
33
0
11 Mar 2023
TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets
Weixin Chen
Basel Alomair
Yue Liu
DiffM
119
81
0
10 Mar 2023
Neural Vector Fields: Implicit Representation by Explicit Learning
Xianghui Yang
Guosheng Lin
Zhenghao Chen
Luping Zhou
AI4CE
111
18
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
122
554
0
07 Mar 2023
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Jiacheng Li
Longhui Wei
Zongyuan Zhan
Xinfu He
Siliang Tang
Qi Tian
Yueting Zhuang
61
4
0
07 Mar 2023
MOSO: Decomposing MOtion, Scene and Object for Video Prediction
M. Sun
Weining Wang
Xinxin Zhu
Jing Liu
102
14
0
07 Mar 2023
Sketch-based Medical Image Retrieval
Kazuma Kobayashi
Lin Gu
Ryuichiro Hataya
T. Mizuno
M. Miyake
...
Nobuji Kouno
Amina Bolatkan
Y. Kurose
Tatsuya Harada
Ryuji Hamamoto
73
8
0
07 Mar 2023
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Rui Xue
Yanqing Liu
Lei He
Xuejiao Tan
Linquan Liu
Ed Lin
Sheng Zhao
120
7
0
06 Mar 2023
Synthetic ECG Signal Generation using Probabilistic Diffusion Models
Edmond Adib
Amanda Fernandez
Fatemeh Afghah
John J. Prevost
DiffM
108
44
0
04 Mar 2023
Co-Speech Gesture Synthesis using Discrete Gesture Token Learning
Shuhong Lu
Youngwoo Yoon
Andrew W. Feng
SLR
99
12
0
04 Mar 2023
StraIT: Non-autoregressive Generation with Stratified Image Transformer
Shengju Qian
Huiwen Chang
Yuanzhen Li
Zizhao Zhang
Jiaya Jia
Han Zhang
124
12
0
01 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
Wen Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
112
2
0
01 Mar 2023
TextIR: A Simple Framework for Text-based Editable Image Restoration
Yun-Hao Bai
Cairong Wang
Shuzhao Xie
Chao Dong
Chun Yuan
Zhi Wang
DiffM
128
15
0
28 Feb 2023
Hierarchical Reinforcement Learning in Complex 3D Environments
Bernardo Avila-Pires
Feryal M. P. Behbahani
Hubert Soyer
Kyriacos Nikiforou
Thomas Keck
Satinder Singh
OffRL
90
0
0
28 Feb 2023
Improving Model Generalization by On-manifold Adversarial Augmentation in the Frequency Domain
Chang-rui Liu
Wenzhao Xiang
Yuan He
H. Xue
Shibao Zheng
Hang Su
83
4
0
28 Feb 2023
Denoising diffusion algorithm for inverse design of microstructures with fine-tuned nonlinear material properties
Nikolaos N. Vlassis
WaiChing Sun
AI4CE
DiffM
126
51
0
24 Feb 2023
Streamlining Multimodal Data Fusion in Wireless Communication and Sensor Networks
M. J. Bocus
Xiaoyang Wang
Robert Piechocki
58
1
0
24 Feb 2023
Learning Manifold Dimensions with Conditional Variational Autoencoders
Yijia Zheng
Tong He
Yixuan Qiu
David Wipf
DRL
106
22
0
23 Feb 2023
Entity-Level Text-Guided Image Manipulation
Yikai Wang
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Wei Zhang
Yanwei Fu
VGen
78
3
0
22 Feb 2023
Speech Enhancement with Multi-granularity Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie Zhang
69
0
0
16 Feb 2023
A Review of Uncertainty Estimation and its Application in Medical Imaging
K. Zou
Zhihao Chen
Xuedong Yuan
Xiaojing Shen
Meng Wang
Huazhu Fu
UQCV
134
92
0
16 Feb 2023
Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits
Xuejie Liu
Hoang Trung-Dung
Guy Van den Broeck
Yitao Liang
TPM
96
14
0
16 Feb 2023
Self-Organising Neural Discrete Representation Learning à la Kohonen
Kazuki Irie
Róbert Csordás
Jürgen Schmidhuber
SSL
91
1
0
15 Feb 2023
Vector Quantized Wasserstein Auto-Encoder
Tung-Long Vuong
Trung Le
He Zhao
Chuanxia Zheng
Mehrtash Harandi
Jianfei Cai
Dinh Q. Phung
DRL
73
20
0
12 Feb 2023
Trading Information between Latents in Hierarchical Variational Autoencoders
Tim Z. Xiao
Robert Bamler
DRL
BDL
52
6
0
09 Feb 2023
Previous
1
2
3
...
11
12
13
...
21
22
23
Next