ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.00446
  4. Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2

Generating Diverse High-Fidelity Images with VQ-VAE-2

2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
    DRLBDL
ArXiv (abs)PDFHTML

Papers citing "Generating Diverse High-Fidelity Images with VQ-VAE-2"

50 / 1,128 papers shown
Title
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni
Adam Polyak
Oron Ashual
Shelly Sheynin
Devi Parikh
Yaniv Taigman
DiffM
104
526
0
24 Mar 2022
Pixel VQ-VAEs for Improved Pixel Art Representation
Pixel VQ-VAEs for Improved Pixel Art Representation
Akash Saravanan
Matthew J. Guzdial
58
8
0
23 Mar 2022
Interpreting Class Conditional GANs with Channel Awareness
Interpreting Class Conditional GANs with Channel Awareness
Yin-Yin He
Zhiyi Zhang
Jiapeng Zhu
Yujun Shen
Qifeng Chen
GAN
69
1
0
21 Mar 2022
PublicCheck: Public Integrity Verification for Services of Run-time Deep
  Models
PublicCheck: Public Integrity Verification for Services of Run-time Deep Models
Shuo Wang
Sharif Abuadbba
Sidharth Agarwal
Kristen Moore
Ruoxi Sun
Minhui Xue
Surya Nepal
S. Çamtepe
S. Kanhere
HILM
89
7
0
21 Mar 2022
Unified Multivariate Gaussian Mixture for Efficient Neural Image
  Compression
Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression
Xiaosu Zhu
Jingkuan Song
Lianli Gao
Fengcai Zheng
Hengtao Shen
62
64
0
21 Mar 2022
ViewFormer: NeRF-free Neural Rendering from Few Images Using
  Transformers
ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers
Jonávs Kulhánek
Erik Derner
Torsten Sattler
Robert Babuvska
ViT
117
75
0
18 Mar 2022
Decouple-and-Sample: Protecting sensitive information in task agnostic
  data release
Decouple-and-Sample: Protecting sensitive information in task agnostic data release
Abhishek Singh
Ethan Garza
Ayush Chopra
Praneeth Vepakomma
Vivek Sharma
Ramesh Raskar
76
7
0
17 Mar 2022
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
Paritosh Mittal
Y. Cheng
Maneesh Singh
Shubham Tulsiani
135
230
0
17 Mar 2022
Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene
  Video from A Single Image
Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image
Xuanchi Ren
Xiaolong Wang
VGen
108
58
0
17 Mar 2022
Contrastive Learning with Positive-Negative Frame Mask for Music
  Representation
Contrastive Learning with Positive-Negative Frame Mask for Music Representation
Dongyu Yao
Zhou Zhao
Shengyu Zhang
Jieming Zhu
Yudong Zhu
Rui Zhang
Xiuqiang He
60
22
0
17 Mar 2022
Implicit Feature Decoupling with Depthwise Quantization
Implicit Feature Decoupling with Depthwise Quantization
Iordanis Fostiropoulos
Barry W. Boehm
55
2
0
15 Mar 2022
Style Transformer for Image Inversion and Editing
Style Transformer for Image Inversion and Editing
Xueqi Hu
Qiusheng Huang
Zhengyi Shi
Siyuan Li
Changxin Gao
Li Sun
Qingli Li
89
56
0
15 Mar 2022
A review of Generative Adversarial Networks for Electronic Health
  Records: applications, evaluation measures and data sources
A review of Generative Adversarial Networks for Electronic Health Records: applications, evaluation measures and data sources
Ghadeer O. Ghosheh
Jin Li
T. Zhu
103
42
0
14 Mar 2022
Semi-Discrete Normalizing Flows through Differentiable Tessellation
Semi-Discrete Normalizing Flows through Differentiable Tessellation
Ricky T. Q. Chen
Brandon Amos
Maximilian Nickel
90
10
0
14 Mar 2022
The Role of ImageNet Classes in Fréchet Inception Distance
The Role of ImageNet Classes in Fréchet Inception Distance
Tuomas Kynkaanniemi
Tero Karras
M. Aittala
Timo Aila
J. Lehtinen
EGVMVLM
183
213
0
11 Mar 2022
FlexIT: Towards Flexible Semantic Image Translation
FlexIT: Towards Flexible Semantic Image Translation
Guillaume Couairon
Asya Grechka
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
114
38
0
09 Mar 2022
ChiTransformer:Towards Reliable Stereo from Cues
ChiTransformer:Towards Reliable Stereo from Cues
Qing Su
Shihao Ji
MDEViT
79
14
0
09 Mar 2022
Hierarchical Sketch Induction for Paraphrase Generation
Hierarchical Sketch Induction for Paraphrase Generation
Tom Hosking
Hao Tang
Mirella Lapata
BDL
123
32
0
07 Mar 2022
Show Me What and Tell Me How: Video Synthesis via Multimodal
  Conditioning
Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Ligong Han
Jian Ren
Hsin-Ying Lee
Francesco Barbieri
Kyle Olszewski
Shervin Minaee
Dimitris N. Metaxas
Sergey Tulyakov
DiffMVGen
145
41
0
04 Mar 2022
UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired
  image-to-image translation
UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation
D. Torbunov
Yi Huang
Haiwang Yu
Jin-zhi Huang
Shinjae Yoo
Meifeng Lin
B. Viren
Yihui Ren
ViT
117
85
0
04 Mar 2022
Polarity Sampling: Quality and Diversity Control of Pre-Trained
  Generative Networks via Singular Values
Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values
Ahmed Imtiaz Humayun
Randall Balestriero
Richard Baraniuk
88
32
0
03 Mar 2022
Autoregressive Image Generation using Residual Quantization
Autoregressive Image Generation using Residual Quantization
Doyup Lee
Chiheon Kim
Saehoon Kim
Minsu Cho
Wook-Shin Han
VGen
298
378
0
03 Mar 2022
Variational Autoencoders Without the Variation
Variational Autoencoders Without the Variation
Gregory A. Daly
J. Fieldsend
G. Tabor
68
2
0
01 Mar 2022
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
Zihao Wang
Wei Liu
Qian He
Xin-ru Wu
Zili Yi
CLIPVLM
273
75
0
01 Mar 2022
Real-World Blind Super-Resolution via Feature Matching with Implicit
  High-Resolution Priors
Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors
Chaofeng Chen
Xinyu Shi
Yipeng Qin
Xiaoming Li
Xiaoguang Han
Taojiannan Yang
Shihui Guo
115
118
0
26 Feb 2022
Retriever: Learning Content-Style Representation as a Token-Level
  Bipartite Graph
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Dacheng Yin
Xuanchi Ren
Chong Luo
Yuwang Wang
Zhiwei Xiong
Wenjun Zeng
114
13
0
24 Feb 2022
Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial
  Auto-Encoders
Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders
Huangjie Zheng
Pengcheng He
Weizhu Chen
Mingyuan Zhou
DiffM
98
46
0
19 Feb 2022
Voice Filter: Few-shot text-to-speech speaker adaptation using voice
  conversion as a post-processing module
Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module
Adam Gabry's
Goeric Huybrechts
M. Ribeiro
C. Chien
Julian Roth
Giulia Comini
Roberto Barra-Chicote
Bartek Perz
Jaime Lorenzo-Trueba
82
21
0
16 Feb 2022
NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN
NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN
Minheng Ni
Chenfei Wu
Haoyang Huang
Daxin Jiang
W. Zuo
Nan Duan
67
19
0
10 Feb 2022
Diffusion bridges vector quantized Variational AutoEncoders
Diffusion bridges vector quantized Variational AutoEncoders
Max H. Cohen
Guillaume Quispe
Sylvain Le Corff
Charles Ollion
Eric Moulines
DiffM
92
15
0
10 Feb 2022
MaskGIT: Masked Generative Image Transformer
MaskGIT: Masked Generative Image Transformer
Huiwen Chang
Han Zhang
Lu Jiang
Ce Liu
William T. Freeman
ViT
196
696
0
08 Feb 2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of
  Text-to-Image Generation Models
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
ViT
260
193
0
08 Feb 2022
On the Pitfalls of Using the Residual Error as Anomaly Score
On the Pitfalls of Using the Residual Error as Anomaly Score
Felix Meissen
Benedikt Wiestler
Georgios Kaissis
Daniel Rueckert
UQCV
81
12
0
08 Feb 2022
VAEL: Bridging Variational Autoencoders and Probabilistic Logic
  Programming
VAEL: Bridging Variational Autoencoders and Probabilistic Logic Programming
Eleonora Misino
G. Marra
Emanuele Sansone
84
26
0
07 Feb 2022
Robust Vector Quantized-Variational Autoencoder
Chieh-Hsin Lai
Dongmian Zou
Gilad Lerman
DRL
126
6
0
04 Feb 2022
Posterior Matching for Arbitrary Conditioning
Posterior Matching for Arbitrary Conditioning
R. Strauss
Junier B. Oliva
CMLBDL
112
6
0
28 Jan 2022
From data to functa: Your data point is a function and you can treat it
  like one
From data to functa: Your data point is a function and you can treat it like one
Emilien Dupont
Hyunjik Kim
S. M. Ali Eslami
Danilo Jimenez Rezende
Dan Rosenbaum
TDI3DPC
284
159
0
28 Jan 2022
ShapeFormer: Transformer-based Shape Completion via Sparse
  Representation
ShapeFormer: Transformer-based Shape Completion via Sparse Representation
Xingguang Yan
Liqiang Lin
Niloy J. Mitra
Dani Lischinski
Daniel Cohen-Or
Hui Huang
ViT
197
118
0
25 Jan 2022
Parallel Neural Local Lossless Compression
Parallel Neural Local Lossless Compression
Mingtian Zhang
James Townsend
Ning Kang
David Barber
71
7
0
13 Jan 2022
Reproducible, incremental representation learning with Rosetta VAE
Reproducible, incremental representation learning with Rosetta VAE
Miles Martinez
John M. Pearson
DRL
49
1
0
13 Jan 2022
BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations
BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations
Daiqing Li
Huan Ling
Seung Wook Kim
Karsten Kreis
Adela Barriuso
Sanja Fidler
Antonio Torralba
148
107
0
12 Jan 2022
A Physics-Informed Vector Quantized Autoencoder for Data Compression of
  Turbulent Flow
A Physics-Informed Vector Quantized Autoencoder for Data Compression of Turbulent Flow
M. Momenifar
Enmao Diao
Vahid Tarokh
A. Bragg
AI4CE
66
4
0
10 Jan 2022
DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from
  Low-Dimensional Latents
DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents
Kushagra Pandey
Avideep Mukherjee
Piyush Rai
Abhishek Kumar
DiffM
159
121
0
02 Jan 2022
Evaluating Deep Music Generation Methods Using Data Augmentation
Evaluating Deep Music Generation Methods Using Data Augmentation
Toby Godwin
Georgios Rizos
Alice Baird
N. A. Futaisi
Vincent Brisse
Bjoern W. Schuller
MGen
34
1
0
31 Dec 2021
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
212
51
0
27 Dec 2021
Integrating Material Selection with Design Optimization via Neural
  Networks
Integrating Material Selection with Design Optimization via Neural Networks
A. Chandrasekhar
S. Sridhara
K. Suresh
AI4CE
65
6
0
23 Dec 2021
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
729
15,868
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with
  Text-Guided Diffusion Models
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
528
3,641
0
20 Dec 2021
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Zhisheng Xiao
Karsten Kreis
Arash Vahdat
DiffM
133
562
0
15 Dec 2021
Variational autoencoders in the presence of low-dimensional data:
  landscape and implicit bias
Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Frederic Koehler
Viraj Mehta
Chenghui Zhou
Andrej Risteski
DRL
89
13
0
13 Dec 2021
Previous
123...161718...212223
Next