ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.00446
  4. Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2

Generating Diverse High-Fidelity Images with VQ-VAE-2

2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
    DRLBDL
ArXiv (abs)PDFHTML

Papers citing "Generating Diverse High-Fidelity Images with VQ-VAE-2"

50 / 1,128 papers shown
Title
UbiPhysio: Support Daily Functioning, Fitness, and Rehabilitation with
  Action Understanding and Feedback in Natural Language
UbiPhysio: Support Daily Functioning, Fitness, and Rehabilitation with Action Understanding and Feedback in Natural Language
Chongyang Wang
Yuan Feng
L. Zhong
Siyi Zhu
Fangqiu Yi
...
Chen Liang
Yuntao wang
Chen-Jun He
Chun Yu
Yuanchun Shi
79
6
0
21 Aug 2023
TokenSplit: Using Discrete Speech Representations for Direct, Refined,
  and Transcript-Conditioned Speech Separation and Recognition
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Hakan Erdogan
Scott Wisdom
Xuankai Chang
Zalan Borsos
Marco Tagliasacchi
Neil Zeghidour
J. Hershey
85
11
0
21 Aug 2023
Strata-NeRF : Neural Radiance Fields for Stratified Scenes
Strata-NeRF : Neural Radiance Fields for Stratified Scenes
Ankit Dhiman
R. Srinath
Harsh Rangwani
Rishubh Parihar
Lokesh R. Boregowda
Srinath Sridhar
R. Venkatesh Babu
107
4
0
20 Aug 2023
An Efficient 1 Iteration Learning Algorithm for Gaussian Mixture Model
  And Gaussian Mixture Embedding For Neural Network
An Efficient 1 Iteration Learning Algorithm for Gaussian Mixture Model And Gaussian Mixture Embedding For Neural Network
Weiguo Lu
Xuan Wu
Deng Ding
Gangnan Yuan
BDL
66
1
0
18 Aug 2023
UAV-assisted Semantic Communication with Hybrid Action Reinforcement
  Learning
UAV-assisted Semantic Communication with Hybrid Action Reinforcement Learning
Tobias Heuer
Jun Zhao
Kwok-Yan Lam
Qing Yang
22
3
0
18 Aug 2023
Dual Associated Encoder for Face Restoration
Dual Associated Encoder for Face Restoration
Yu-Ju Tsai
Yu-Lun Liu
Lu Qi
Kelvin C. K. Chan
Ming-Hsuan Yang
65
12
0
14 Aug 2023
Neural Categorical Priors for Physics-Based Character Control
Neural Categorical Priors for Physics-Based Character Control
Qing Zhu
He Zhang
Mengting Lan
Lei Han
128
34
0
14 Aug 2023
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Binbin Yang
Yinzheng Luo
Ziliang Chen
Guangrun Wang
Xiaodan Liang
Liang Lin
DiffM
102
15
0
13 Aug 2023
Controlling Character Motions without Observable Driving Source
Controlling Character Motions without Observable Driving Source
Weiyuan Li
Bin Dai
Ziyi Zhou
Qi Yao
Baoyuan Wang
VGen
53
1
0
11 Aug 2023
A Review of Change of Variable Formulas for Generative Modeling
A Review of Change of Variable Formulas for Generative Modeling
Ullrich Kothe
74
8
0
04 Aug 2023
Synthesising Rare Cataract Surgery Samples with Guided Diffusion Models
Synthesising Rare Cataract Surgery Samples with Guided Diffusion Models
Yannik Frisch
Moritz Fuchs
Antoine Pierre Sanner
F. A. Ucar
Marius Frenzel
Joana Wasielica-Poslednik
A. Gericke
F. Wagner
Thomas Dratsch
Anirban Mukhopadhyay
MedImDiffM
40
9
0
03 Aug 2023
DiffColor: Toward High Fidelity Text-Guided Image Colorization with
  Diffusion Models
DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models
Jianxin Lin
Peng Xiao
Yijun Wang
Rongsheng Zhang
Xiangxiang Zeng
DiffM
72
3
0
03 Aug 2023
Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment
  for Markup-to-Image Generation
Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation
Guojin Zhong
Jin Yuan
Pan Wang
Kailun Yang
Weili Guan
Zhiyong Li
DiffM
71
7
0
02 Aug 2023
RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid Subjects
RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid Subjects
Sascha Kirch
Valeria Olyunina
Jan Ondřej
Rafael Pagés
Sergio Martín
Clara Pérez-Molina
80
2
0
29 Jul 2023
Online Clustered Codebook
Online Clustered Codebook
Chuanxia Zheng
Andrea Vedaldi
104
34
0
27 Jul 2023
GaitMorph: Transforming Gait by Optimally Transporting Discrete Codes
GaitMorph: Transforming Gait by Optimally Transporting Discrete Codes
Adrian Cosma
I. Radoi
107
3
0
27 Jul 2023
Learning Disentangled Discrete Representations
Learning Disentangled Discrete Representations
David Friede
Christian Reimers
Heiner Stuckenschmidt
Mathias Niepert
CoGeOCLOODDRL
92
0
0
26 Jul 2023
Deep Learning Approaches for Data Augmentation in Medical Imaging: A
  Review
Deep Learning Approaches for Data Augmentation in Medical Imaging: A Review
Aghiles Kebaili
J. Lapuyade-Lahorgue
S. Ruan
MedIm
92
157
0
24 Jul 2023
FABRIC: Personalizing Diffusion Models with Iterative Feedback
FABRIC: Personalizing Diffusion Models with Iterative Feedback
Dimitri von Rütte
Elisabetta Fedele
Jonathan Thomm
Lukas Wolf
71
13
0
19 Jul 2023
Towards Authentic Face Restoration with Iterative Diffusion Models and
  Beyond
Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond
Yang Zhao
Tingbo Hou
Yu-Chuan Su
Xuhui Jia. Yandong Li
Matthias Grundmann
DiffM
62
18
0
18 Jul 2023
Diffusion Models Beat GANs on Image Classification
Diffusion Models Beat GANs on Image Classification
Soumik Mukhopadhyay
M. Gwilliam
Vatsal Agarwal
Namitha Padmanabhan
A. Swaminathan
Srinidhi Hegde
Dinesh Manocha
Abhinav Shrivastava
DiffM
165
48
1
17 Jul 2023
Image Captions are Natural Prompts for Text-to-Image Models
Image Captions are Natural Prompts for Text-to-Image Models
Shiye Lei
Hao Chen
Senyang Zhang
Bo Zhao
Dacheng Tao
VLM
117
23
0
17 Jul 2023
Abstracting Concept-Changing Rules for Solving Raven's Progressive
  Matrix Problems
Abstracting Concept-Changing Rules for Solving Raven's Progressive Matrix Problems
Fan Shi
Bin Li
Xiangyang Xue
LRM
87
10
0
15 Jul 2023
Augmented Co-Speech Gesture Generation: Including Form and Meaning
  Features to Guide Learning-Based Gesture Synthesis
Augmented Co-Speech Gesture Generation: Including Form and Meaning Features to Guide Learning-Based Gesture Synthesis
Hendric Voss
S. Kopp
SLR
85
4
0
13 Jul 2023
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized
  Variational Autoencoder for Video Prediction
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction
Mohammad Adiban
Kalin Stefanov
Sabato Marco Siniscalchi
G. Salvi
91
2
0
13 Jul 2023
Hierarchical Autoencoder-based Lossy Compression for Large-scale
  High-resolution Scientific Data
Hierarchical Autoencoder-based Lossy Compression for Large-scale High-resolution Scientific Data
Hieu Le
Jián Tao
AI4CE
69
2
0
09 Jul 2023
Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation
Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation
Aditya Sanghi
P. Jayaraman
Arianna Rampini
Joseph Lambourne
Hooman Shayani
Evan Atherton
Saeid Asgari Taghanaki
3DV
97
15
0
08 Jul 2023
MVDiffusion: Enabling Holistic Multi-view Image Generation with
  Correspondence-Aware Diffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion
Shitao Tang
Fuyang Zhang
Jiacheng Chen
Peng Wang
Yasutaka Furukawa
160
158
0
03 Jul 2023
Hierarchical Neural Coding for Controllable CAD Model Generation
Hierarchical Neural Coding for Controllable CAD Model Generation
Xiang Xu
P. Jayaraman
Joseph G. Lambourne
Karl D. D. Willis
Yasutaka Furukawa
102
43
0
30 Jun 2023
BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark
  for Short-Term Load Forecasting
BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark for Short-Term Load Forecasting
Patrick Emami
A. Sahu
Peter Graf
AI4TS
129
15
0
30 Jun 2023
Symbol emergence as interpersonal cross-situational learning: the
  emergence of lexical knowledge with combinatoriality
Symbol emergence as interpersonal cross-situational learning: the emergence of lexical knowledge with combinatoriality
Y. Hagiwara
Kazuma Furukawa
Takafumi Horie
Akira Taniguchi
T. Taniguchi
72
0
0
27 Jun 2023
MotionGPT: Human Motion as a Foreign Language
MotionGPT: Human Motion as a Foreign Language
Biao Jiang
Xin Chen
Wen Liu
Jingyi Yu
Gang Yu
Tao Chen
MLLM
113
298
0
26 Jun 2023
Zero-shot spatial layout conditioning for text-to-image diffusion models
Zero-shot spatial layout conditioning for text-to-image diffusion models
Guillaume Couairon
Marlene Careil
Matthieu Cord
Stéphane Lathuilière
Jakob Verbeek
VLM
88
65
0
23 Jun 2023
Pushing the Limits of 3D Shape Generation at Scale
Pushing the Limits of 3D Shape Generation at Scale
Wang Yu
Xuelin Qian
Jingyang Huo
Tiejun Huang
Bo Zhao
Yanwei Fu
134
11
0
20 Jun 2023
A VAE Approach to Sample Multivariate Extremes
A VAE Approach to Sample Multivariate Extremes
N. Lafon
Philippe Naveau
Ronan Fablet
160
6
0
19 Jun 2023
Understanding Deep Generative Models with Generalized Empirical
  Likelihoods
Understanding Deep Generative Models with Generalized Empirical Likelihoods
Suman V. Ravuri
Mélanie Rey
S. Mohamed
M. Deisenroth
VLM
72
5
0
16 Jun 2023
Evaluating Data Attribution for Text-to-Image Models
Evaluating Data Attribution for Text-to-Image Models
Sheng-Yu Wang
Alexei A. Efros
Jun-Yan Zhu
Richard Y. Zhang
TDI
99
34
0
15 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
98
7
0
14 Jun 2023
Unbiased Learning of Deep Generative Models with Structured Discrete
  Representations
Unbiased Learning of Deep Generative Models with Structured Discrete Representations
H. Bendekgey
Gabriel Hope
Erik B. Sudderth
OCLBDLDRL
66
1
0
14 Jun 2023
Discrete Graph Auto-Encoder
Discrete Graph Auto-Encoder
Yoann Boget
Magda Gregorova
Alexandros Kalousis
46
4
0
13 Jun 2023
Fast Diffusion Model
Fast Diffusion Model
Zike Wu
Pan Zhou
Kenji Kawaguchi
Hanwang Zhang
DiffM
97
22
0
12 Jun 2023
High-Fidelity Audio Compression with Improved RVQGAN
High-Fidelity Audio Compression with Improved RVQGAN
Rithesh Kumar
Prem Seetharaman
Alejandro Luebs
I. Kumar
Kundan Kumar
144
339
0
11 Jun 2023
HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork
HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork
Bipasha Sen
Gaurav Singh
Aditya Agarwal
Rohith Agaram
K. M. Krishna
Srinath Sridhar
AI4CE
114
14
0
09 Jun 2023
The Age of Synthetic Realities: Challenges and Opportunities
The Age of Synthetic Realities: Challenges and Opportunities
J. P. Cardenuto
Jing Yang
Rafael Padilha
Renjie Wan
Daniel Moreira
Haoliang Li
Shiqi Wang
Fernanda A. Andaló
Sébastien Marcel
Anderson de Rezende Rocha
DeLMO
124
30
0
09 Jun 2023
ADDP: Learning General Representations for Image Recognition and
  Generation with Alternating Denoising Diffusion Process
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Changyao Tian
Chenxin Tao
Jifeng Dai
Hao Li
Ziheng Li
Lewei Lu
Xiaogang Wang
Hongsheng Li
Gao Huang
Xizhou Zhu
DiffM
106
10
0
08 Jun 2023
Subject clustering by IF-PCA and several recent methods
Subject clustering by IF-PCA and several recent methods
Dieyi Chen
Jiashun Jin
Z. Ke
76
0
0
08 Jun 2023
Gradient-Informed Quality Diversity for the Illumination of Discrete
  Spaces
Gradient-Informed Quality Diversity for the Illumination of Discrete Spaces
Raphael Boige
Guillaume Richard
Jérémie Donà
Thomas Pierrot
Antoine Cully
99
6
0
08 Jun 2023
Designing a Better Asymmetric VQGAN for StableDiffusion
Designing a Better Asymmetric VQGAN for StableDiffusion
Zixin Zhu
Xuelu Feng
DongDong Chen
Jianmin Bao
Le Wang
Yinpeng Chen
Lu Yuan
Gang Hua
DiffM
101
35
0
07 Jun 2023
Coupled Variational Autoencoder
Coupled Variational Autoencoder
Xiaoran Hao
Patrick Shafto
BDLDRL
79
4
0
05 Jun 2023
Towards Learning Discrete Representations via Self-Supervision for
  Wearables-Based Human Activity Recognition
Towards Learning Discrete Representations via Self-Supervision for Wearables-Based Human Activity Recognition
H. Haresamudram
Irfan Essa
Thomas Ploetz
104
8
0
01 Jun 2023
Previous
123...91011...212223
Next