ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.00446
  4. Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2

Generating Diverse High-Fidelity Images with VQ-VAE-2

2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
    DRLBDL
ArXiv (abs)PDFHTML

Papers citing "Generating Diverse High-Fidelity Images with VQ-VAE-2"

50 / 1,128 papers shown
Title
Discrete Contrastive Diffusion for Cross-Modal Music and Image
  Generation
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
111
49
0
15 Jun 2022
PILC: Practical Image Lossless Compression with an End-to-end GPU
  Oriented Neural Framework
PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework
Ning Kang
Shanzhao Qiu
Shifeng Zhang
Zhenguo Li
Shutao Xia
62
19
0
10 Jun 2022
Patch-based Object-centric Transformers for Efficient Video Generation
Patch-based Object-centric Transformers for Efficient Video Generation
Wilson Yan
Ryogo Okumura
Stephen James
Pieter Abbeel
DiffMViT
93
6
0
08 Jun 2022
Generating Long Videos of Dynamic Scenes
Generating Long Videos of Dynamic Scenes
Tim Brooks
Janne Hellsten
M. Aittala
Ting-Chun Wang
Timo Aila
J. Lehtinen
Xuan Li
Alexei A. Efros
Tero Karras
SyDa
104
114
0
07 Jun 2022
Intra-agent speech permits zero-shot task acquisition
Intra-agent speech permits zero-shot task acquisition
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
99
9
0
07 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
198
393
0
06 Jun 2022
Variable-rate hierarchical CPC leads to acoustic unit discovery in
  speech
Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Santiago Cuervo
Adrian Lañcucki
R. Marxer
Paweł Rychlikowski
J. Chorowski
SSL
87
13
0
05 Jun 2022
Recognition of Unseen Bird Species by Learning from Field Guides
Recognition of Unseen Bird Species by Learning from Field Guides
Andrés C. Rodríguez
Stefano Dáronco
Rodrigo Caye Daudt
Jan Dirk Wegner
Konrad Schindler
68
1
0
03 Jun 2022
Improving Diffusion Models for Inverse Problems using Manifold
  Constraints
Improving Diffusion Models for Inverse Problems using Manifold Constraints
Hyungjin Chung
Byeongsu Sim
Dohoon Ryu
J. C. Ye
DiffMMedIm
256
475
0
02 Jun 2022
PAGER: Progressive Attribute-Guided Extendable Robust Image Generation
PAGER: Progressive Attribute-Guided Extendable Robust Image Generation
Zohreh Azizi
C.-C. Jay Kuo
VLMDiffMGAN
89
9
0
01 Jun 2022
VALHALLA: Visual Hallucination for Machine Translation
VALHALLA: Visual Hallucination for Machine Translation
Yi Li
Yikang Shen
Yoon Kim
Chun-Fu Chen
Rogerio Feris
David D. Cox
Nuno Vasconcelos
MLLM
155
40
0
31 May 2022
Text2Human: Text-Driven Controllable Human Image Generation
Text2Human: Text-Driven Controllable Human Image Generation
Yuming Jiang
Shuai Yang
Haonan Qiu
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
184
48
0
31 May 2022
From Keypoints to Object Landmarks via Self-Training Correspondence: A
  novel approach to Unsupervised Landmark Discovery
From Keypoints to Object Landmarks via Self-Training Correspondence: A novel approach to Unsupervised Landmark Discovery
Dimitrios Mallis
Enrique Sanchez
Matt Bell
Georgios Tzimiropoulos
SSL3DPC
110
7
0
31 May 2022
Unsupervised Image Representation Learning with Deep Latent Particles
Unsupervised Image Representation Learning with Deep Latent Particles
Tal Daniel
Aviv Tamar
OCLSSL
76
12
0
31 May 2022
Few-Shot Diffusion Models
Few-Shot Diffusion Models
Giorgio Giannone
Didrik Nielsen
Ole Winther
DiffM
245
51
0
30 May 2022
Improving VAE-based Representation Learning
Improving VAE-based Representation Learning
Mingtian Zhang
Tim Z. Xiao
Brooks Paige
David Barber
SSLDRL
80
10
0
28 May 2022
Video2StyleGAN: Disentangling Local and Global Variations in a Video
Video2StyleGAN: Disentangling Local and Global Variations in a Video
Rameen Abdal
Peihao Zhu
Niloy J. Mitra
Peter Wonka
VGen
85
7
0
27 May 2022
Scalable Multi-Agent Model-Based Reinforcement Learning
Scalable Multi-Agent Model-Based Reinforcement Learning
Vladimir Egorov
A. Shpilman
92
27
0
25 May 2022
Structured Uncertainty in the Observation Space of Variational
  Autoencoders
Structured Uncertainty in the Observation Space of Variational Autoencoders
James A. G. Langley
M. Monteiro
Charles Jones
Nick Pawlowski
Ben Glocker
CMLOODBDLDRL
75
2
0
25 May 2022
Emergent Communication through Metropolis-Hastings Naming Game with Deep
  Generative Models
Emergent Communication through Metropolis-Hastings Naming Game with Deep Generative Models
T. Taniguchi
Yuto Yoshida
Akira Taniguchi
Y. Hagiwara
MLLM
73
25
0
24 May 2022
M6-Fashion: High-Fidelity Multi-modal Image Generation and Editing
M6-Fashion: High-Fidelity Multi-modal Image Generation and Editing
Zhikang Li
Huiling Zhou
Shuai Bai
Peike Li
Chang Zhou
Hongxia Yang
83
4
0
24 May 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
765
6,107
0
23 May 2022
Transformer-based out-of-distribution detection for clinically safe
  segmentation
Transformer-based out-of-distribution detection for clinically safe segmentation
M. Graham
Petru-Daniel Tudosiu
P. Wright
W. H. Pinaya
J. U-King-im
...
H. Jäger
D. Werring
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
MedIm
93
21
0
21 May 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
302
368
0
21 May 2022
Improvements to Self-Supervised Representation Learning for Masked Image
  Modeling
Improvements to Self-Supervised Representation Learning for Masked Image Modeling
Jia-ju Mao
Xuesong Yin
Yuan Chang
Honggu Zhou
SSL
52
1
0
21 May 2022
Deterministic training of generative autoencoders using invertible
  layers
Deterministic training of generative autoencoders using invertible layers
Gianluigi Silvestri
Daan Roos
L. Ambrogioni
TPM
84
2
0
19 May 2022
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed
  Stochastic Quantization
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Yuhta Takida
Takashi Shibuya
Wei-Hsiang Liao
Chieh-Hsin Lai
Junki Ohmura
Toshimitsu Uesaka
Naoki Murata
Shusuke Takahashi
Toshiyuki Kumakura
Yuki Mitsufuji
BDL
87
67
0
16 May 2022
VQFR: Blind Face Restoration with Vector-Quantized Dictionary and
  Parallel Decoder
VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder
Yuchao Gu
Xintao Wang
Liangbin Xie
Chao Dong
Gengyan Li
Ying Shan
Mingg-Ming Cheng
82
124
0
13 May 2022
Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Qiankun Liu
Zhentao Tan
Dongdong Chen
Qi Chu
Xiyang Dai
Yinpeng Chen
Mengchen Liu
Lu Yuan
Nenghai Yu
ViT
87
70
0
10 May 2022
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level
  Quality
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality
Xu Tan
Jiawei Chen
Haohe Liu
Jian Cong
Chen Zhang
...
Lei He
Frank Soong
Tao Qin
Sheng Zhao
Tie-Yan Liu
165
221
0
09 May 2022
An Analysis of Generative Methods for Multiple Image Inpainting
An Analysis of Generative Methods for Multiple Image Inpainting
C. Ballester
Aurélie Bugeau
Samuel Hurault
S. Parisotto
Patricia Vitoria
69
3
0
04 May 2022
End-to-End Visual Editing with a Generatively Pre-Trained Artist
End-to-End Visual Editing with a Generatively Pre-Trained Artist
A. Brown
Cheng-Yang Fu
Omkar M. Parkhi
Tamara L. Berg
Andrea Vedaldi
DiffM
89
8
0
03 May 2022
Subspace Diffusion Generative Models
Subspace Diffusion Generative Models
Bowen Jing
Gabriele Corso
Renato Berlinghieri
Tommi Jaakkola
DiffM
101
78
0
03 May 2022
Learning Discrete Structured Variational Auto-Encoder using Natural
  Evolution Strategies
Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies
Alon Berliner
Guy Rotman
Yossi Adi
Roi Reichart
Tamir Hazan
BDLDRL
82
4
0
03 May 2022
Can deep learning match the efficiency of human visual long-term memory
  in storing object details?
Can deep learning match the efficiency of human visual long-term memory in storing object details?
Emin Orhan
VLMOCL
131
0
0
27 Apr 2022
Semi-Parametric Neural Image Synthesis
Semi-Parametric Neural Image Synthesis
A. Blattmann
Robin Rombach
Kaan Oktay
Jonas Muller
Bjorn Ommer
DiffM
111
31
0
25 Apr 2022
PhysioGAN: Training High Fidelity Generative Model for Physiological
  Sensor Readings
PhysioGAN: Training High Fidelity Generative Model for Physiological Sensor Readings
M. Alzantot
L. Garcia
Mani B. Srivastava
52
1
0
25 Apr 2022
Learn from Unpaired Data for Image Restoration: A Variational Bayes
  Approach
Learn from Unpaired Data for Image Restoration: A Variational Bayes Approach
Dihan Zheng
Xiaowen Zhang
Kaisheng Ma
Chenglong Bao
DiffM
81
23
0
21 Apr 2022
Neural Space-filling Curves
Neural Space-filling Curves
Hanyu Wang
Kamal Gupta
Larry S. Davis
Abhinav Shrivastava
64
2
0
18 Apr 2022
Unconditional Image-Text Pair Generation with Multimodal Cross Quantizer
Unconditional Image-Text Pair Generation with Multimodal Cross Quantizer
Hyungyu Lee
Sungjin Park
Joonseok Lee
Edward Choi
72
2
0
15 Apr 2022
Diagnosing and Fixing Manifold Overfitting in Deep Generative Models
Diagnosing and Fixing Manifold Overfitting in Deep Generative Models
Gabriel Loaiza-Ganem
Brendan Leigh Ross
Jesse C. Cresswell
M. Volkovs
GANDRL
119
31
0
14 Apr 2022
Controllable Video Generation through Global and Local Motion Dynamics
Controllable Video Generation through Global and Local Motion Dynamics
A. Davtyan
Paolo Favaro
52
9
0
13 Apr 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLMDiffM
522
6,946
0
13 Apr 2022
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise
  Semantic Alignment and Generation
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Chunjing Xu
Yanwei Fu
114
17
0
09 Apr 2022
Simple and Effective Synthesis of Indoor 3D Scenes
Simple and Effective Synthesis of Indoor 3D Scenes
Jing Yu Koh
Harsh Agrawal
Dhruv Batra
Richard Tucker
Austin Waters
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
VGen3DV
143
30
0
06 Apr 2022
Autoregressive 3D Shape Generation via Canonical Mapping
Autoregressive 3D Shape Generation via Canonical Mapping
A. Cheng
Xueting Li
Sifei Liu
Min Sun
Ming-Hsuan Yang
3DPC
98
41
0
05 Apr 2022
High-Quality Pluralistic Image Completion via Code Shared VQGAN
High-Quality Pluralistic Image Completion via Code Shared VQGAN
Chuanxia Zheng
Guoxian Song
Tat-Jen Cham
Jianfei Cai
Dinh Q. Phung
Linjie Luo
VLM
85
10
0
05 Apr 2022
Cancer Subtyping via Embedded Unsupervised Learning on Transcriptomics
  Data
Cancer Subtyping via Embedded Unsupervised Learning on Transcriptomics Data
Ziwei Yang
Lingwei Zhu
Zheng Chen
Ming Huang
N. Ono
M. Altaf-Ul-Amin
Shigehiko Kanaya
20
2
0
02 Apr 2022
Quantized GAN for Complex Music Generation from Dance Videos
Quantized GAN for Complex Music Generation from Dance Videos
Ye Zhu
Kyle Olszewski
Yuehua Wu
Panos Achlioptas
Menglei Chai
Yan Yan
Sergey Tulyakov
MGen
118
46
0
01 Apr 2022
Generating High Fidelity Data from Low-density Regions using Diffusion
  Models
Generating High Fidelity Data from Low-density Regions using Diffusion Models
Vikash Sehwag
C. Hazirbas
Albert Gordo
Firat Ozgenel
Cristian Canton Ferrer
DiffM
119
71
0
31 Mar 2022
Previous
123...151617...212223
Next