ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.00446
  4. Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2

Generating Diverse High-Fidelity Images with VQ-VAE-2

2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
    DRLBDL
ArXiv (abs)PDFHTML

Papers citing "Generating Diverse High-Fidelity Images with VQ-VAE-2"

50 / 1,128 papers shown
Title
MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point
  Cloud Action Recognition
MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition
Xiaodong Chen
Wu Liu
Xinchen Liu
Yongdong Zhang
Jungong Han
Tao Mei
3DPC
94
13
0
01 Sep 2022
Large-Scale Auto-Regressive Modeling Of Street Networks
Large-Scale Auto-Regressive Modeling Of Street Networks
Michael Birsak
Tom Kelly
W. Para
Peter Wonka
GNNAI4TS
53
6
0
01 Sep 2022
Deep Generative Modeling on Limited Data with Regularization by
  Nontransferable Pre-trained Models
Deep Generative Modeling on Limited Data with Regularization by Nontransferable Pre-trained Models
Yong Zhong
Hongtao Liu
Xiaodong Liu
Fan Bao
Weiran Shen
Chongxuan Li
AI4CE
105
4
0
30 Aug 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
98
97
0
29 Aug 2022
Lossy Image Compression with Quantized Hierarchical VAEs
Lossy Image Compression with Quantized Hierarchical VAEs
Zhihao Duan
Ming Lu
Zhan Ma
Fengqing Zhu
99
45
0
27 Aug 2022
Discovering Transferable Forensic Features for CNN-generated Images
  Detection
Discovering Transferable Forensic Features for CNN-generated Images Detection
Keshigeyan Chandrasegaran
Ngoc-Trung Tran
Alexander Binder
Ngai-Man Cheung
AAML
94
28
0
24 Aug 2022
Dataset Condensation with Latent Space Knowledge Factorization and
  Sharing
Dataset Condensation with Latent Space Knowledge Factorization and Sharing
Haebeom Lee
Dong Bok Lee
Sung Ju Hwang
DD
73
39
0
21 Aug 2022
FaceOff: A Video-to-Video Face Swapping System
FaceOff: A Video-to-Video Face Swapping System
Aditya Agarwal
Bipasha Sen
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
PICVCVBM
86
2
0
21 Aug 2022
A Neural Approach to Spatio-Temporal Data Release with User-Level
  Differential Privacy
A Neural Approach to Spatio-Temporal Data Release with User-Level Differential Privacy
Ritesh Ahuja
Sepanta Zeighami
Gabriel Ghinita
Cyrus Shahabi
61
12
0
20 Aug 2022
Pathway to Future Symbiotic Creativity
Pathway to Future Symbiotic Creativity
Yi-Ting Guo
Qi-fei Liu
Jie Chen
Wei Xue
Jie Fu
...
Fernando Rosas
Jeffrey Shaw
Xing Wu
Jiji Zhang
Jianliang Xu
84
0
0
18 Aug 2022
Musika! Fast Infinite Waveform Music Generation
Musika! Fast Infinite Waveform Music Generation
Marco Pasini
Jan Schluter
MGen
57
31
0
18 Aug 2022
Teacher Guided Training: An Efficient Framework for Knowledge Transfer
Teacher Guided Training: An Efficient Framework for Knowledge Transfer
Manzil Zaheer
A. S. Rawat
Seungyeon Kim
Chong You
Himanshu Jain
Andreas Veit
Rob Fergus
Surinder Kumar
VLM
75
2
0
14 Aug 2022
Gradient Estimation for Binary Latent Variables via Gradient Variance
  Clipping
Gradient Estimation for Binary Latent Variables via Gradient Variance Clipping
Russell Z. Kunes
Mingzhang Yin
Max Land
Doron Haviv
Dana Peér
Simon Tavaré
BDL
103
3
0
12 Aug 2022
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal
  Fashion Design
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design
Xujie Zhang
Yuyang Sha
Michael C. Kampffmeyer
Zhenyu Xie
Zequn Jie
Chengwen Huang
Jianqing Peng
Xiaodan Liang
102
20
0
11 Aug 2022
Symbolic Music Loop Generation with Neural Discrete Representations
Symbolic Music Loop Generation with Neural Discrete Representations
Sangjun Han
H. Ihm
Moontae Lee
Woohyung Lim
98
9
0
11 Aug 2022
Model-Free Generative Replay for Lifelong Reinforcement Learning:
  Application to Starcraft-2
Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2
Z. Daniels
Aswin Raghavan
Jesse Hostetler
Abrar Rahman
Indranil Sur
M. Piacentino
Ajay Divakaran
CLLOffRL
100
13
0
09 Aug 2022
Hierarchical Residual Learning Based Vector Quantized Variational
  Autoencoder for Image Reconstruction and Generation
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and Generation
Mohammad Adiban
Kalin Stefanov
Sabato Marco Siniscalchi
G. Salvi
88
9
0
09 Aug 2022
DALLE-URBAN: Capturing the urban design expertise of large text to image
  transformers
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
111
22
0
03 Aug 2022
DSR -- A dual subspace re-projection network for surface anomaly
  detection
DSR -- A dual subspace re-projection network for surface anomaly detection
Vitjan Zavrtanik
Matej Kristan
D. Skočaj
125
115
0
02 Aug 2022
Rewriting Geometric Rules of a GAN
Rewriting Geometric Rules of a GAN
Sheng-Yu Wang
David Bau
Jun-Yan Zhu
107
36
0
28 Jul 2022
Exploiting Negative Preference in Content-based Music Recommendation
  with Contrastive Learning
Exploiting Negative Preference in Content-based Music Recommendation with Contrastive Learning
Minju Park
Kyogu Lee
47
16
0
28 Jul 2022
GAUDI: A Neural Architect for Immersive 3D Scene Generation
GAUDI: A Neural Architect for Immersive 3D Scene Generation
Miguel Angel Bautista
Pengsheng Guo
Samira Abnar
Walter A. Talbott
Alexander Toshev
...
Shuangfei Zhai
Hanlin Goh
Daniel Ulbricht
Afshin Dehghan
J. Susskind
SyDa3DGS
101
139
0
27 Jul 2022
Lighting (In)consistency of Paint by Text
Lighting (In)consistency of Paint by Text
Hany Farid
75
32
0
27 Jul 2022
Leveraging GAN Priors for Few-Shot Part Segmentation
Leveraging GAN Priors for Few-Shot Part Segmentation
M. Han
Heliang Zheng
Chaoyue Wang
Yong Luo
Han Hu
Bo Du
102
6
0
27 Jul 2022
Vector Quantized Image-to-Image Translation
Vector Quantized Image-to-Image Translation
Yu-Jie Chen
Shin-I Cheng
Wei-Chen Chiu
Hung-Yu Tseng
Hsin-Ying Lee
71
20
0
27 Jul 2022
Classifier-Free Diffusion Guidance
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
213
3,992
0
26 Jul 2022
Discrete Key-Value Bottleneck
Discrete Key-Value Bottleneck
Frederik Trauble
Anirudh Goyal
Nasim Rahaman
Michael C. Mozer
Kenji Kawaguchi
Yoshua Bengio
Bernhard Schölkopf
CLL
92
23
0
22 Jul 2022
InfiniteNature-Zero: Learning Perpetual View Generation of Natural
  Scenes from Single Images
InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images
Zhengqi Li
Qianqian Wang
Noah Snavely
Angjoo Kanazawa
VGen
113
63
0
22 Jul 2022
Few-shot Image Generation Using Discrete Content Representation
Few-shot Image Generation Using Discrete Content Representation
Y. Hong
Li Niu
Jianfu Zhang
Liqing Zhang
DiffM
84
11
0
22 Jul 2022
Unveiling the Latent Space Geometry of Push-Forward Generative Models
Unveiling the Latent Space Geometry of Push-Forward Generative Models
Thibaut Issenhuth
Ugo Tanielian
Jérémie Mary
David Picard
GANDRL
91
3
0
21 Jul 2022
Latent Discriminant deterministic Uncertainty
Latent Discriminant deterministic Uncertainty
Gianni Franchi
Xuanlong Yu
Andrei Bursuc
Emanuel Aldea
Séverine Dubuisson
David Filliat
UQCV
69
18
0
20 Jul 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
111
306
0
20 Jul 2022
Comparing the latent space of generative models
Comparing the latent space of generative models
Andrea Asperti
Valerio Tonelli
DRL
84
13
0
14 Jul 2022
Collaborative Quantization Embeddings for Intra-Subject Prostate MR
  Image Registration
Collaborative Quantization Embeddings for Intra-Subject Prostate MR Image Registration
Ziyi Shen
Qianye Yang
Yuming Shen
F. Giganti
V. Stavrinides
...
M. Rusu
G. Sonn
Philip Torr
D. Barratt
Yipeng Hu
74
3
0
13 Jul 2022
The Free Energy Principle for Perception and Action: A Deep Learning
  Perspective
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRLAI4CE
74
33
0
13 Jul 2022
Earthformer: Exploring Space-Time Transformers for Earth System
  Forecasting
Earthformer: Exploring Space-Time Transformers for Earth System Forecasting
Zhihan Gao
Xingjian Shi
Hao Wang
Yi Zhu
Yuyang Wang
Mu Li
Dit-Yan Yeung
AI4TS
100
159
0
12 Jul 2022
SkexGen: Autoregressive Generation of CAD Construction Sequences with
  Disentangled Codebooks
SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks
Xiang Xu
Karl D. D. Willis
Joseph G. Lambourne
Chin-Yi Cheng
P. Jayaraman
Yasutaka Furukawa
97
78
0
11 Jul 2022
End-to-End Binaural Speech Synthesis
End-to-End Binaural Speech Synthesis
Wen-Chin Huang
Dejan Marković
Alexander Richard
I. D. Gebru
Anjali Menon
65
9
0
08 Jul 2022
Text to Image Synthesis using Stacked Conditional Variational
  Autoencoders and Conditional Generative Adversarial Networks
Text to Image Synthesis using Stacked Conditional Variational Autoencoders and Conditional Generative Adversarial Networks
Haileleol Tibebu
Aadin Malik
V. D. Silva
GAN
58
7
0
06 Jul 2022
GLANCE: Global to Local Architecture-Neutral Concept-based Explanations
GLANCE: Global to Local Architecture-Neutral Concept-based Explanations
Avinash Kori
Ben Glocker
Francesca Toni
79
6
0
05 Jul 2022
Discrete Tree Flows via Tree-Structured Permutations
Discrete Tree Flows via Tree-Structured Permutations
Mai Elkady
Jim Lim
David I. Inouye
TPM
76
2
0
04 Jul 2022
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of
  3D Human Motions and Texts
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts
Chuan Guo
Xinxin Xuo
Sen Wang
Li Cheng
VGen
199
244
0
04 Jul 2022
TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory
  Generative Adversarial Network
TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network
Yuansheng Guan
Guochen Yu
Andong Li
C. Zheng
Jie Wang
131
9
0
04 Jul 2022
High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled
  Conditions
High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions
Sangyun Lee
Gyojung Gu
S. Park
Seunghwan Choi
Jaegul Choo
DiffM
182
138
0
28 Jun 2022
Perspective (In)consistency of Paint by Text
Perspective (In)consistency of Paint by Text
Hany Farid
DiffM
83
37
0
27 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
211
121
0
23 Jun 2022
Entropy-driven Sampling and Training Scheme for Conditional Diffusion
  Generation
Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation
Sheng-liang Li
Guangcong Zheng
Haibo Wang
Taiping Yao
Yang Chen
Shoudong Ding
Xi Li
DiffM
103
22
0
23 Jun 2022
Generative Modelling With Inverse Heat Dissipation
Generative Modelling With Inverse Heat Dissipation
Severi Rissanen
Markus Heinonen
Arno Solin
DiffM
179
121
0
21 Jun 2022
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use
  Case
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case
Clément Chadebec
Louis J. Vincent
S. Allassonnière
DRL
98
30
0
16 Jun 2022
DeepJSCC-Q: Constellation Constrained Deep Joint Source-Channel Coding
DeepJSCC-Q: Constellation Constrained Deep Joint Source-Channel Coding
Tze-Yang Tung
David Burth Kurka
Mikolaj Jankowski
Deniz Gunduz
73
80
0
16 Jun 2022
Previous
123...141516...212223
Next