ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.00446
  4. Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2

Generating Diverse High-Fidelity Images with VQ-VAE-2

2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
    DRLBDL
ArXiv (abs)PDFHTML

Papers citing "Generating Diverse High-Fidelity Images with VQ-VAE-2"

50 / 1,128 papers shown
Title
Harnessing the Spatial-Temporal Attention of Diffusion Models for
  High-Fidelity Text-to-Image Synthesis
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe Lin
Yang Zhang
Shiyu Chang
DiffM
97
46
0
07 Apr 2023
Diffusion Models as Masked Autoencoders
Diffusion Models as Masked Autoencoders
Chen Wei
K. Mangalam
Po-Yao (Bernie) Huang
Yanghao Li
Haoqi Fan
Hu Xu
Huiyu Wang
Cihang Xie
Alan Yuille
Christoph Feichtenhofer
DiffMSyDa
102
53
0
06 Apr 2023
DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model
DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model
H. Seo
Hayeon Kim
Gwanghyun Kim
S. Chun
DiffM
120
43
0
06 Apr 2023
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Bokui Shen
Xinchen Yan
C. Qi
Mahyar Najibi
Boyang Deng
Leonidas Guibas
Yin Zhou
Drago Anguelov
3DV
103
21
0
04 Apr 2023
Text-Conditioned Sampling Framework for Text-to-Image Generation with
  Masked Generative Models
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
Jaewoong Lee
Sang-Sub Jang
Jaehyeong Jo
Jaehong Yoon
Yunji Kim
Jin-Hwa Kim
Jung-Woo Ha
Sung Ju Hwang
DiffM
87
4
0
04 Apr 2023
Sounding Video Generator: A Unified Framework for Text-guided Sounding
  Video Generation
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation
Jiawei Liu
Weining Wang
Sihan Chen
Xinxin Zhu
Qingbin Liu
DiffMVGen
84
14
0
29 Mar 2023
Implicit Diffusion Models for Continuous Super-Resolution
Implicit Diffusion Models for Continuous Super-Resolution
Sicheng Gao
Xuhui Liu
Bo-Wen Zeng
Sheng Xu
Yanjing Li
Xiaonan Luo
Jianzhuang Liu
Xiantong Zhen
Baochang Zhang
DiffM
120
232
0
29 Mar 2023
Object Discovery from Motion-Guided Tokens
Object Discovery from Motion-Guided Tokens
Zhipeng Bao
P. Tokmakov
Yu-Xiong Wang
Adrien Gaidon
M. Hebert
OCL
100
23
0
27 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
Pierre Fernandez
Guillaume Couairon
Hervé Jégou
Matthijs Douze
Teddy Furon
WIGM
137
198
0
27 Mar 2023
Learning Generative Models with Goal-conditioned Reinforcement Learning
Learning Generative Models with Goal-conditioned Reinforcement Learning
Mariana Vargas Vieyra
Pierre Ménard
GAN
31
0
0
26 Mar 2023
Learning Versatile 3D Shape Generation with Improved AR Models
Learning Versatile 3D Shape Generation with Improved AR Models
Simian Luo
Xuelin Qian
Yanwei Fu
Yinda Zhang
Ying Tai
Zhenyu Zhang
Chengjie Wang
Xiangyang Xue
108
3
0
26 Mar 2023
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
DiffM
242
171
0
25 Mar 2023
Variational Inference for Longitudinal Data Using Normalizing Flows
Variational Inference for Longitudinal Data Using Normalizing Flows
Clément Chadebec
S. Allassonnière
BDLDRL
69
1
0
24 Mar 2023
DiffuScene: Denoising Diffusion Models for Generative Indoor Scene
  Synthesis
DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis
Jiapeng Tang
Yinyu Nie
Lev Markhasin
Angela Dai
Justus Thies
Matthias Nießner
DiffM
126
45
0
24 Mar 2023
High Fidelity Image Synthesis With Deep VAEs In Latent Space
High Fidelity Image Synthesis With Deep VAEs In Latent Space
Troy Luhman
Eric Luhman
DRL3DV
75
7
0
23 Mar 2023
Continuous Indeterminate Probability Neural Network
Continuous Indeterminate Probability Neural Network
T.A. Yang
BDL
65
0
0
23 Mar 2023
Posthoc Interpretation via Quantization
Posthoc Interpretation via Quantization
Francesco Paissan
Cem Subakan
Mirco Ravanelli
MQ
110
7
0
22 Mar 2023
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
K. Pnvr
Bharat Singh
P. Ghosh
Behjat Siddiquie
David Jacobs
DiffM
91
29
0
22 Mar 2023
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Lukas Höllein
Ang Cao
Andrew Owens
Justin Johnson
Matthias Nießner
DiffM
137
183
0
21 Mar 2023
Information-containing Adversarial Perturbation for Combating Facial
  Manipulation Systems
Information-containing Adversarial Perturbation for Combating Facial Manipulation Systems
Yao Zhu
YueFeng Chen
Xiaodan Li
Rong Zhang
Xiang Tian
Bo Zheng
Yao-wu Chen
AAML
109
11
0
21 Mar 2023
FullFormer: Generating Shapes Inside Shapes
FullFormer: Generating Shapes Inside Shapes
Tejaswini Medi
Jawad Tayyub
M. Sarmad
Frank Lindseth
Margret Keuper
63
1
0
20 Mar 2023
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Peng Jin
Hao Li
Ze-Long Cheng
Kehan Li
Xiang Ji
Chang-rui Liu
Li-ming Yuan
Jie Chen
DiffMVGen
100
58
0
17 Mar 2023
Learning Data-Driven Vector-Quantized Degradation Model for Animation
  Video Super-Resolution
Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
Zixi Tuo
Huan Yang
Jianlong Fu
Yujie Dun
Xueming Qian
VGen
64
3
0
17 Mar 2023
StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized
  Tokenizer of a Large-Scale Generative Model
StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Model
Zipeng Xu
E. Sangineto
N. Sebe
DiffM
90
13
0
16 Mar 2023
DR2: Diffusion-based Robust Degradation Remover for Blind Face
  Restoration
DR2: Diffusion-based Robust Degradation Remover for Blind Face Restoration
Zhixin Wang
Xiaoyun Zhang
Ziying Zhang
Huangjie Zheng
Mingyuan Zhou
Ya Zhang
Yanfeng Wang
DiffM
88
75
0
13 Mar 2023
Regularized Vector Quantization for Tokenized Image Synthesis
Regularized Vector Quantization for Tokenized Image Synthesis
Jiahui Zhang
Fangneng Zhan
Christian Theobalt
Shijian Lu
DiffMMQ
104
33
0
11 Mar 2023
TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets
TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets
Weixin Chen
Basel Alomair
Yue Liu
DiffM
119
81
0
10 Mar 2023
Neural Vector Fields: Implicit Representation by Explicit Learning
Neural Vector Fields: Implicit Representation by Explicit Learning
Xianghui Yang
Guosheng Lin
Zhenghao Chen
Luping Zhou
AI4CE
111
18
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
122
554
0
07 Mar 2023
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding
Jiacheng Li
Longhui Wei
Zongyuan Zhan
Xinfu He
Siliang Tang
Qi Tian
Yueting Zhuang
61
4
0
07 Mar 2023
MOSO: Decomposing MOtion, Scene and Object for Video Prediction
MOSO: Decomposing MOtion, Scene and Object for Video Prediction
M. Sun
Weining Wang
Xinxin Zhu
Jing Liu
102
14
0
07 Mar 2023
Sketch-based Medical Image Retrieval
Sketch-based Medical Image Retrieval
Kazuma Kobayashi
Lin Gu
Ryuichiro Hataya
T. Mizuno
M. Miyake
...
Nobuji Kouno
Amina Bolatkan
Y. Kurose
Tatsuya Harada
Ryuji Hamamoto
73
8
0
07 Mar 2023
FoundationTTS: Text-to-Speech for ASR Customization with Generative
  Language Model
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Rui Xue
Yanqing Liu
Lei He
Xuejiao Tan
Linquan Liu
Ed Lin
Sheng Zhao
120
7
0
06 Mar 2023
Synthetic ECG Signal Generation using Probabilistic Diffusion Models
Synthetic ECG Signal Generation using Probabilistic Diffusion Models
Edmond Adib
Amanda Fernandez
Fatemeh Afghah
John J. Prevost
DiffM
108
44
0
04 Mar 2023
Co-Speech Gesture Synthesis using Discrete Gesture Token Learning
Co-Speech Gesture Synthesis using Discrete Gesture Token Learning
Shuhong Lu
Youngwoo Yoon
Andrew W. Feng
SLR
99
12
0
04 Mar 2023
StraIT: Non-autoregressive Generation with Stratified Image Transformer
StraIT: Non-autoregressive Generation with Stratified Image Transformer
Shengju Qian
Huiwen Chang
Yuanzhen Li
Zizhao Zhang
Jiaya Jia
Han Zhang
124
12
0
01 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge
  Collaborative AutoML System
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
Wen Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
112
2
0
01 Mar 2023
TextIR: A Simple Framework for Text-based Editable Image Restoration
TextIR: A Simple Framework for Text-based Editable Image Restoration
Yun-Hao Bai
Cairong Wang
Shuzhao Xie
Chao Dong
Chun Yuan
Zhi Wang
DiffM
128
15
0
28 Feb 2023
Hierarchical Reinforcement Learning in Complex 3D Environments
Hierarchical Reinforcement Learning in Complex 3D Environments
Bernardo Avila-Pires
Feryal M. P. Behbahani
Hubert Soyer
Kyriacos Nikiforou
Thomas Keck
Satinder Singh
OffRL
90
0
0
28 Feb 2023
Improving Model Generalization by On-manifold Adversarial Augmentation
  in the Frequency Domain
Improving Model Generalization by On-manifold Adversarial Augmentation in the Frequency Domain
Chang-rui Liu
Wenzhao Xiang
Yuan He
H. Xue
Shibao Zheng
Hang Su
83
4
0
28 Feb 2023
Denoising diffusion algorithm for inverse design of microstructures with
  fine-tuned nonlinear material properties
Denoising diffusion algorithm for inverse design of microstructures with fine-tuned nonlinear material properties
Nikolaos N. Vlassis
WaiChing Sun
AI4CEDiffM
126
51
0
24 Feb 2023
Streamlining Multimodal Data Fusion in Wireless Communication and Sensor
  Networks
Streamlining Multimodal Data Fusion in Wireless Communication and Sensor Networks
M. J. Bocus
Xiaoyang Wang
Robert Piechocki
58
1
0
24 Feb 2023
Learning Manifold Dimensions with Conditional Variational Autoencoders
Learning Manifold Dimensions with Conditional Variational Autoencoders
Yijia Zheng
Tong He
Yixuan Qiu
David Wipf
DRL
106
22
0
23 Feb 2023
Entity-Level Text-Guided Image Manipulation
Entity-Level Text-Guided Image Manipulation
Yikai Wang
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Wei Zhang
Yanwei Fu
VGen
78
3
0
22 Feb 2023
Speech Enhancement with Multi-granularity Vector Quantization
Speech Enhancement with Multi-granularity Vector Quantization
Xiaokang Zhao
Qiu-shi Zhu
Jie Zhang
69
0
0
16 Feb 2023
A Review of Uncertainty Estimation and its Application in Medical
  Imaging
A Review of Uncertainty Estimation and its Application in Medical Imaging
K. Zou
Zhihao Chen
Xuedong Yuan
Xiaojing Shen
Meng Wang
Huazhu Fu
UQCV
134
92
0
16 Feb 2023
Understanding the Distillation Process from Deep Generative Models to
  Tractable Probabilistic Circuits
Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits
Xuejie Liu
Hoang Trung-Dung
Guy Van den Broeck
Yitao Liang
TPM
96
14
0
16 Feb 2023
Self-Organising Neural Discrete Representation Learning à la Kohonen
Self-Organising Neural Discrete Representation Learning à la Kohonen
Kazuki Irie
Róbert Csordás
Jürgen Schmidhuber
SSL
91
1
0
15 Feb 2023
Vector Quantized Wasserstein Auto-Encoder
Vector Quantized Wasserstein Auto-Encoder
Tung-Long Vuong
Trung Le
He Zhao
Chuanxia Zheng
Mehrtash Harandi
Jianfei Cai
Dinh Q. Phung
DRL
73
20
0
12 Feb 2023
Trading Information between Latents in Hierarchical Variational
  Autoencoders
Trading Information between Latents in Hierarchical Variational Autoencoders
Tim Z. Xiao
Robert Bamler
DRLBDL
52
6
0
09 Feb 2023
Previous
123...111213...212223
Next