ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.00446
  4. Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2

Generating Diverse High-Fidelity Images with VQ-VAE-2

2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
    DRLBDL
ArXiv (abs)PDFHTML

Papers citing "Generating Diverse High-Fidelity Images with VQ-VAE-2"

50 / 1,128 papers shown
Title
Master Face Attacks on Face Recognition Systems
Master Face Attacks on Face Recognition Systems
H. Nguyen
S´ebastien Marcel
Junichi Yamagishi
Isao Echizen
CVBM
68
18
0
08 Sep 2021
Learning Interpretable Representations of Entanglement in Quantum Optics
  Experiments using Deep Generative Models
Learning Interpretable Representations of Entanglement in Quantum Optics Experiments using Deep Generative Models
Daniel Flam-Shepherd
Tony C Wu
Xuemei Gu
Alba Cervera-Lierta
Mario Krenn
Alán Aspuru-Guzik
DRL
64
21
0
06 Sep 2021
Bilateral Denoising Diffusion Models
Bilateral Denoising Diffusion Models
Max W. Y. Lam
Jun Wang
Rongjie Huang
Jane Polak Scowcroft
Dong Yu
DiffM
103
43
0
26 Aug 2021
Improving Visual Quality of Unrestricted Adversarial Examples with
  Wavelet-VAE
Improving Visual Quality of Unrestricted Adversarial Examples with Wavelet-VAE
Wenzhao Xiang
Chang-rui Liu
Shibao Zheng
49
2
0
25 Aug 2021
ImageBART: Bidirectional Context with Multinomial Diffusion for
  Autoregressive Image Synthesis
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
119
162
0
19 Aug 2021
Transformers predicting the future. Applying attention in next-frame and
  time series forecasting
Transformers predicting the future. Applying attention in next-frame and time series forecasting
Radostin Cholakov
T. Kolev
AI4TS
55
17
0
18 Aug 2021
PixelSynth: Generating a 3D-Consistent Experience from a Single Image
PixelSynth: Generating a 3D-Consistent Experience from a Single Image
C. Rockwell
David Fouhey
Justin Johnson
VGen
148
86
0
12 Aug 2021
Sketch Your Own GAN
Sketch Your Own GAN
Sheng-Yu Wang
David Bau
Jun-Yan Zhu
GAN
113
73
0
05 Aug 2021
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing
Zhaofeng Shi
73
7
0
01 Aug 2021
Cross-Camera Feature Prediction for Intra-Camera Supervised Person
  Re-identification across Distant Scenes
Cross-Camera Feature Prediction for Intra-Camera Supervised Person Re-identification across Distant Scenes
Wenhang Ge
Chunyan Pan
Ancong Wu
Hongwei Zheng
Weishi Zheng
74
26
0
29 Jul 2021
Improving Robot Localisation by Ignoring Visual Distraction
Improving Robot Localisation by Ignoring Visual Distraction
Oscar Alejandro Mendez Maldonado
M. Vowels
Richard Bowden
40
1
0
25 Jul 2021
Abstract Reasoning via Logic-guided Generation
Abstract Reasoning via Logic-guided Generation
Sihyun Yu
Sangwoo Mo
SungSoo Ahn
Jinwoo Shin
82
6
0
22 Jul 2021
Generative Models for Security: Attacks, Defenses, and Opportunities
Generative Models for Security: Attacks, Defenses, and Opportunities
L. A. Bauer
Vincent Bindschaedler
114
4
0
21 Jul 2021
Towards Privacy-preserving Explanations in Medical Image Analysis
Towards Privacy-preserving Explanations in Medical Image Analysis
H. Montenegro
W. Silva
Jaime S. Cardoso
50
7
0
20 Jul 2021
Data synthesis and adversarial networks: A review and meta-analysis in
  cancer imaging
Data synthesis and adversarial networks: A review and meta-analysis in cancer imaging
Richard Osuala
Kaisar Kushibar
Lidia Garrucho
Akis Linardos
Zuzanna Szafranowska
Stefan Klein
Ben Glocker
Oliver Díaz
Karim Lekadir
MedIm
127
45
0
20 Jul 2021
Learning De-identified Representations of Prosody from Raw Audio
Learning De-identified Representations of Prosody from Raw Audio
J. Weston
R. Lenain
U. Meepegama
E. Fristed
SSL
68
17
0
17 Jul 2021
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
120
806
0
07 Jul 2021
Detecting Outliers with Poisson Image Interpolation
Detecting Outliers with Poisson Image Interpolation
Jeremy Tan
Benjamin Hou
Thomas Day
J. Simpson
Daniel Rueckert
Bernhard Kainz
MedIm
68
52
0
06 Jul 2021
CoReD: Generalizing Fake Media Detection with Continual Representation
  using Distillation
CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation
Minhan Kim
Shahroz Tariq
Simon S. Woo
CLL
106
50
0
06 Jul 2021
Long-Short Transformer: Efficient Transformers for Language and Vision
Long-Short Transformer: Efficient Transformers for Language and Vision
Chen Zhu
Ming-Yu Liu
Chaowei Xiao
Mohammad Shoeybi
Tom Goldstein
Anima Anandkumar
Bryan Catanzaro
ViTVLM
134
133
0
05 Jul 2021
Exploring the Latent Space of Autoencoders with Interventional Assays
Exploring the Latent Space of Autoencoders with Interventional Assays
Felix Leeb
Stefan Bauer
M. Besserve
Bernhard Schölkopf
DRL
123
18
0
30 Jun 2021
Out-of-distribution Generalization in the Presence of Nuisance-Induced
  Spurious Correlations
Out-of-distribution Generalization in the Presence of Nuisance-Induced Spurious Correlations
A. Puli
Lily H. Zhang
Eric K. Oermann
Rajesh Ranganath
OODOODD
87
49
0
29 Jun 2021
Dizygotic Conditional Variational AutoEncoder for Multi-Modal and
  Partial Modality Absent Few-Shot Learning
Dizygotic Conditional Variational AutoEncoder for Multi-Modal and Partial Modality Absent Few-Shot Learning
Yi Zhang
Sheng Huang
Xiao-song Peng
Dan Yang
86
9
0
28 Jun 2021
On Incorporating Inductive Biases into VAEs
On Incorporating Inductive Biases into VAEs
Ning Miao
Emile Mathieu
N. Siddharth
Yee Whye Teh
Tom Rainforth
CMLDRL
99
11
0
25 Jun 2021
NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image
  Generation
NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation
Xiaohui Zeng
R. Urtasun
R. Zemel
Sanja Fidler
Renjie Liao
DiffM
45
2
0
25 Jun 2021
Handling Data Heterogeneity with Generative Replay in Collaborative
  Learning for Medical Imaging
Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging
Liangqiong Qu
N. Balachandar
Miao Zhang
D. Rubin
MedIm
96
23
0
24 Jun 2021
Symmetric Wasserstein Autoencoders
Symmetric Wasserstein Autoencoders
S. Sun
Hong Guo
DiffMGAN
64
0
0
24 Jun 2021
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive
  Learning
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning
Hao Tan
Jie Lei
Thomas Wolf
Joey Tianyi Zhou
129
67
0
21 Jun 2021
Deep Generative Learning via Schrödinger Bridge
Deep Generative Learning via Schrödinger Bridge
Gefei Wang
Yuling Jiao
Qiang Xu
Yang Wang
Can Yang
DiffMOT
102
103
0
19 Jun 2021
Cascading Modular Network (CAM-Net) for Multimodal Image Synthesis
Cascading Modular Network (CAM-Net) for Multimodal Image Synthesis
Shichong Peng
Alireza Moazeni
Ke Li
GAN
76
0
0
16 Jun 2021
Discrete Auto-regressive Variational Attention Models for Text Modeling
Discrete Auto-regressive Variational Attention Models for Text Modeling
Xianghong Fang
Haoli Bai
Jian Li
Zenglin Xu
Michael Lyu
Irwin King
73
3
0
16 Jun 2021
Multi-Resolution Continuous Normalizing Flows
Multi-Resolution Continuous Normalizing Flows
Vikram S. Voleti
Chris Finlay
Adam M. Oberman
Christopher Pal
102
4
0
15 Jun 2021
BEiT: BERT Pre-Training of Image Transformers
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
447
2,858
0
15 Jun 2021
Divergence Frontiers for Generative Models: Sample Complexity,
  Quantization Effects, and Frontier Integrals
Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier Integrals
Lang Liu
Krishna Pillutla
Sean Welleck
Sewoong Oh
Yejin Choi
Zaïd Harchaoui
MQ
107
14
0
15 Jun 2021
Improved Transformer for High-Resolution GANs
Improved Transformer for High-Resolution GANs
Long Zhao
Zizhao Zhang
Ting Chen
Dimitris N. Metaxas
Han Zhang
ViT
135
96
0
14 Jun 2021
Non Gaussian Denoising Diffusion Models
Non Gaussian Denoising Diffusion Models
Eliya Nachmani
Robin San Roman
Lior Wolf
VLMDiffM
83
50
0
14 Jun 2021
CRASH: Raw Audio Score-based Generative Modeling for Controllable
  High-resolution Drum Sound Synthesis
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis
Simon Rouard
Gaëtan Hadjeres
DiffM
49
43
0
14 Jun 2021
Inverting Adversarially Robust Networks for Image Synthesis
Inverting Adversarially Robust Networks for Image Synthesis
Renan A. Rojas-Gomez
Raymond A. Yeh
Minh Do
A. Nguyen
68
5
0
13 Jun 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
Abhishek Sinha
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
140
121
0
12 Jun 2021
PriorGrad: Improving Conditional Denoising Diffusion Models with
  Data-Dependent Adaptive Prior
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior
Sang-gil Lee
Heeseung Kim
Chaehun Shin
Xu Tan
Chang-Shu Liu
Qi Meng
Tao Qin
Wei Chen
Sung-Hoon Yoon
Tie-Yan Liu
DiffM
87
89
0
11 Jun 2021
Score-based Generative Modeling in Latent Space
Score-based Generative Modeling in Latent Space
Arash Vahdat
Karsten Kreis
Jan Kautz
DiffM
153
688
0
10 Jun 2021
Vector Quantized Models for Planning
Vector Quantized Models for Planning
Sherjil Ozair
Yazhe Li
Ali Razavi
Ioannis Antonoglou
Aaron van den Oord
Oriol Vinyals
OffRL
96
51
0
08 Jun 2021
Hierarchical Lovász Embeddings for Proposal-free Panoptic Segmentation
Hierarchical Lovász Embeddings for Proposal-free Panoptic Segmentation
Tommi Kerola
Jie Li
Atsushi Kanehira
Yasunori Kudo
Alexis Vallet
Adrien Gaidon
187
8
0
08 Jun 2021
NWT: Towards natural audio-to-video generation with representation
  learning
NWT: Towards natural audio-to-video generation with representation learning
Rayhane Mama
Marc S. Tyndel
Hashiam Kadhim
Cole Clifford
Ragavan Thurairatnam
VGen
112
12
0
08 Jun 2021
Interpretable agent communication from scratch (with a generic visual
  processor emerging on the side)
Interpretable agent communication from scratch (with a generic visual processor emerging on the side)
Roberto Dessì
Eugene Kharitonov
Marco Baroni
97
28
0
08 Jun 2021
On Training Sample Memorization: Lessons from Benchmarking Generative
  Modeling with a Large-scale Competition
On Training Sample Memorization: Lessons from Benchmarking Generative Modeling with a Large-scale Competition
C. Bai
Hsuan-Tien Lin
Colin Raffel
Wendy Kan
52
35
0
06 Jun 2021
Neural Distributed Source Coding
Neural Distributed Source Coding
Jay Whang
Alliot Nagle
Anish Acharya
Hyeji Kim
A. Dimakis
93
21
0
05 Jun 2021
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction
  and An Optimal Training Framework
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework
Zeyu Yan
Fei Wen
R. Ying
Chao Ma
Peilin Liu
91
38
0
05 Jun 2021
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals
DISSECT: Disentangled Simultaneous Explanations via Concept Traversals
Asma Ghandeharioun
Been Kim
Chun-Liang Li
Brendan Jou
B. Eoff
Rosalind W. Picard
AAML
110
54
0
31 May 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
319
1,246
0
30 May 2021
Previous
123...181920212223
Next