ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.00446
  4. Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2

Generating Diverse High-Fidelity Images with VQ-VAE-2

2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
    DRLBDL
ArXiv (abs)PDFHTML

Papers citing "Generating Diverse High-Fidelity Images with VQ-VAE-2"

50 / 1,128 papers shown
Title
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis
  via Non-Autoregressive Generative Transformers
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis via Non-Autoregressive Generative Transformers
Zhu Zhang
Jianxin Ma
Chang Zhou
Rui Men
Zhikang Li
Ming Ding
Jie Tang
Jingren Zhou
Hongxia Yang
125
47
0
29 May 2021
CogView: Mastering Text-to-Image Generation via Transformers
CogView: Mastering Text-to-Image Generation via Transformers
Ming Ding
Zhuoyi Yang
Wenyi Hong
Wendi Zheng
Chang Zhou
...
Junyang Lin
Xu Zou
Zhou Shao
Hongxia Yang
Jie Tang
ViTVLM
173
784
0
26 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin
  Dynamics
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
110
25
0
17 May 2021
Priors in Bayesian Deep Learning: A Review
Priors in Bayesian Deep Learning: A Review
Vincent Fortuin
UQCVBDL
141
134
0
14 May 2021
High-Resolution Complex Scene Synthesis with Transformers
High-Resolution Complex Scene Synthesis with Transformers
Manuel Jahn
Robin Rombach
Bjorn Ommer
ViT
90
37
0
13 May 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
621
8,017
0
11 May 2021
Computer-Aided Design as Language
Computer-Aided Design as Language
Yaroslav Ganin
Sergey Bartunov
Yujia Li
E. Keller
Stefano Saliceti
3DV
164
95
0
06 May 2021
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in
  Distributed Learning
OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning
Shuo Wang
Surya Nepal
Kristen Moore
M. Grobler
Carsten Rudolph
A. Abuadbba
FedML
81
8
0
03 May 2021
Data Augmentation in High Dimensional Low Sample Size Setting Using a
  Geometry-Based Variational Autoencoder
Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder
Clément Chadebec
Elina Thibeau-Sutre
Ninon Burgos
S. Allassonnière
139
69
0
30 Apr 2021
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions
Chenfei Wu
Lun Huang
Qianxi Zhang
Binyang Li
Lei Ji
Fan Yang
Guillermo Sapiro
Nan Duan
DiffMVGen
128
245
0
30 Apr 2021
PANDA : Perceptually Aware Neural Detection of Anomalies
PANDA : Perceptually Aware Neural Detection of Anomalies
Jack W. Barker
T. Breckon
51
5
0
28 Apr 2021
Adaptive Appearance Rendering
Adaptive Appearance Rendering
Mengyao Zhai
Ruizhi Deng
Jiacheng Chen
Lei Chen
Zhiwei Deng
Greg Mori
48
1
0
24 Apr 2021
On Aliased Resizing and Surprising Subtleties in GAN Evaluation
On Aliased Resizing and Surprising Subtleties in GAN Evaluation
Gaurav Parmar
Richard Y. Zhang
Jun-Yan Zhu
EGVM
153
77
0
22 Apr 2021
IB-DRR: Incremental Learning with Information-Back Discrete
  Representation Replay
IB-DRR: Incremental Learning with Information-Back Discrete Representation Replay
Jian Jiang
Edoardo Cetin
Oya Celiktutan
63
9
0
21 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViTVGen
353
514
0
20 Apr 2021
Geometry-Free View Synthesis: Transformers and no 3D Priors
Geometry-Free View Synthesis: Transformers and no 3D Priors
Robin Rombach
Patrick Esser
Bjorn Ommer
ViT
119
95
0
15 Apr 2021
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Spectrogram Inpainting for Interactive Generation of Instrument Sounds
Théis Bazin
Gaëtan Hadjeres
P. Esling
M. Malt
63
11
0
15 Apr 2021
Diamond in the rough: Improving image realism by traversing the GAN
  latent space
Diamond in the rough: Improving image realism by traversing the GAN latent space
Jeffrey Wen
Fabian Benitez-Quiroz
Qianli Feng
Aleix M. Martinez
53
3
0
12 Apr 2021
Boltzmann Tuning of Generative Models
Boltzmann Tuning of Generative Models
Victor Berger
Michele Sebag
62
0
0
12 Apr 2021
InfinityGAN: Towards Infinite-Pixel Image Synthesis
InfinityGAN: Towards Infinite-Pixel Image Synthesis
C. Lin
Hsin-Ying Lee
Yen-Chi Cheng
Sergey Tulyakov
Ming-Hsuan Yang
101
71
0
08 Apr 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLMAI4CE
134
43
0
06 Apr 2021
Noise Estimation for Generative Diffusion Models
Noise Estimation for Generative Diffusion Models
Robin San-Roman
Eliya Nachmani
Lior Wolf
DiffM
147
107
0
06 Apr 2021
Training Deep Normalizing Flow Models in Highly Incomplete Data
  Scenarios with Prior Regularization
Training Deep Normalizing Flow Models in Highly Incomplete Data Scenarios with Prior Regularization
Edgar A. Bernal
43
1
0
03 Apr 2021
A Closer Look at Fourier Spectrum Discrepancies for CNN-generated Images
  Detection
A Closer Look at Fourier Spectrum Discrepancies for CNN-generated Images Detection
Keshigeyan Chandrasegaran
Ngoc-Trung Tran
Ngai-Man Cheung
96
83
0
31 Mar 2021
Symbolic Music Generation with Diffusion Models
Symbolic Music Generation with Diffusion Models
Gautam Mittal
Jesse Engel
Curtis Hawthorne
Ian Simon
MGenDiffM
122
194
0
30 Mar 2021
PixelTransformer: Sample Conditioned Signal Generation
PixelTransformer: Sample Conditioned Signal Generation
Shubham Tulsiani
Abhinav Gupta
76
17
0
29 Mar 2021
AttrLostGAN: Attribute Controlled Image Synthesis from Reconfigurable
  Layout and Style
AttrLostGAN: Attribute Controlled Image Synthesis from Reconfigurable Layout and Style
Stanislav Frolov
Avneesh Sharma
Jörn Hees
Tushar Karayil
Federico Raue
Andreas Dengel
93
15
0
25 Mar 2021
Decomposing Normal and Abnormal Features of Medical Images into Discrete
  Latent Codes for Content-Based Image Retrieval
Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval
Kazuma Kobayashi
Ryuichiro Hataya
Y. Kurose
M. Miyake
Masamichi Takahashi
Akiko Nakagawa
Tatsuya Harada
Ryuji Hamamoto
MedIm
101
19
0
23 Mar 2021
Paint by Word
Paint by Word
A. Andonian
David Bau
Audrey Cui
YeonHwan Park
Ali Jahanian
Antonio Torralba
A. Oliva
DiffM
120
125
0
19 Mar 2021
Generating Diverse Structure for Image Inpainting With Hierarchical
  VQ-VAE
Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE
Jialun Peng
Dong Liu
Songcen Xu
Houqiang Li
DiffM
72
196
0
18 Mar 2021
VDSM: Unsupervised Video Disentanglement with State-Space Modeling and
  Deep Mixtures of Experts
VDSM: Unsupervised Video Disentanglement with State-Space Modeling and Deep Mixtures of Experts
M. Vowels
Necati Cihan Camgöz
Richard Bowden
CoGe
87
8
0
12 Mar 2021
Variable-rate discrete representation learning
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDLDRL
85
24
0
10 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs,
  Normalizing Flows, Energy-Based and Autoregressive Models
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLMTPM
213
511
0
08 Mar 2021
Generating Images with Sparse Representations
Generating Images with Sparse Representations
C. Nash
Jacob Menick
Sander Dieleman
Peter W. Battaglia
108
211
0
05 Mar 2021
crank: An Open-Source Software for Nonparallel Voice Conversion Based on
  Vector-Quantized Variational Autoencoder
crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder
Kazuhiro Kobayashi
Wen-Chin Huang
Yi-Chiao Wu
Patrick Lumban Tobing
Tomoki Hayashi
Tomoki Toda
BDLDRL
79
19
0
04 Mar 2021
Predicting Video with VQVAE
Predicting Video with VQVAE
Jacob Walker
Ali Razavi
Aaron van den Oord
DRL
152
69
0
02 Mar 2021
Countering Malicious DeepFakes: Survey, Battleground, and Horizon
Countering Malicious DeepFakes: Survey, Battleground, and Horizon
Felix Juefei Xu
Run Wang
Yihao Huang
Qing Guo
Lei Ma
Yang Liu
AAML
123
138
0
27 Feb 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
444
5,020
0
24 Feb 2021
Unsupervised Brain Anomaly Detection and Segmentation with Transformers
Unsupervised Brain Anomaly Detection and Segmentation with Transformers
W. H. Pinaya
Petru-Daniel Tudosiu
Robert J. Gray
G. Rees
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
ViTMedIm
79
61
0
23 Feb 2021
Anytime Sampling for Autoregressive Models via Ordered Autoencoding
Anytime Sampling for Autoregressive Models via Ordered Autoencoding
Yilun Xu
Yang Song
Sahaj Garg
Linyuan Gong
Rui Shu
Aditya Grover
Stefano Ermon
DiffM
99
11
0
23 Feb 2021
FaceController: Controllable Attribute Editing for Face in the Wild
FaceController: Controllable Attribute Editing for Face in the Wild
Zhi-liang Xu
Xiyu Yu
Zhibin Hong
Zhen Zhu
Junyu Han
Jingtuo Liu
Errui Ding
X. Bai
CVBM
75
44
0
23 Feb 2021
Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding
Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding
Yangjun Ruan
Karen Ullrich
Daniel de Souza Severo
James Townsend
Ashish Khisti
Arnaud Doucet
Alireza Makhzani
Chris J. Maddison
121
25
0
22 Feb 2021
Improved Denoising Diffusion Probabilistic Models
Improved Denoising Diffusion Probabilistic Models
Alex Nichol
Prafulla Dhariwal
DiffM
381
3,749
0
18 Feb 2021
Preventing Oversmoothing in VAE via Generalized Variance
  Parameterization
Preventing Oversmoothing in VAE via Generalized Variance Parameterization
Yuhta Takida
Wei-Hsiang Liao
Chieh-Hsin Lai
Toshimitsu Uesaka
Shusuke Takahashi
Yuki Mitsufuji
DRL
101
15
0
17 Feb 2021
Certifiably Robust Variational Autoencoders
Certifiably Robust Variational Autoencoders
Ben Barrett
A. Camuto
M. Willetts
Tom Rainforth
AAMLDRL
90
17
0
15 Feb 2021
Using Deep LSD to build operators in GANs latent space with meaning in
  real space
Using Deep LSD to build operators in GANs latent space with meaning in real space
J. Q. Toledo-Marín
J. Glazier
GAN
124
3
0
09 Feb 2021
CNN with large memory layers
CNN with large memory layers
R. Karimov
Yury Malkov
Karim Iskakov
Victor Lempitsky
79
0
0
27 Jan 2021
Fast Non-line-of-sight Imaging with Two-step Deep Remapping
Fast Non-line-of-sight Imaging with Two-step Deep Remapping
Dayu Zhu
W. Cai
49
0
0
26 Jan 2021
Adversarial Text-to-Image Synthesis: A Review
Adversarial Text-to-Image Synthesis: A Review
Stanislav Frolov
Tobias Hinz
Federico Raue
Jörn Hees
Andreas Dengel
EGVM
90
178
0
25 Jan 2021
Hierarchical disentangled representation learning for singing voice
  conversion
Hierarchical disentangled representation learning for singing voice conversion
Naoya Takahashi
M. Singh
Yuki Mitsufuji
DRL
60
14
0
18 Jan 2021
Previous
123...1920212223
Next