Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.00446
Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2
2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRL
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generating Diverse High-Fidelity Images with VQ-VAE-2"
50 / 1,128 papers shown
Title
Towards image compression with perfect realism at ultra-low bitrates
Marlene Careil
Matthew Muckley
Jakob Verbeek
Stéphane Lathuilière
DiffM
90
57
0
16 Oct 2023
Towards Open-World Co-Salient Object Detection with Generative Uncertainty-aware Group Selective Exchange-Masking
Yang Wu
Shenglong Hu
Huihui Song
Kaihua Zhang
Bo Liu
Dong Liu
78
0
0
16 Oct 2023
Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space
Hengrui Zhang
Jiani Zhang
Balasubramaniam Srinivasan
Zhengyuan Shen
Xiao Qin
Christos Faloutsos
Huzefa Rangwala
George Karypis
DiffM
111
111
0
14 Oct 2023
Unified High-binding Watermark for Unconditional Image Generation Models
Ruinan Ma
Yu-an Tan
Shangbo Wu
Tian Chen
Yajie Wang
Yuan-zhang Li
AAML
DiffM
WIGM
79
1
0
14 Oct 2023
MAC: ModAlity Calibration for Object Detection
Yutian Lei
Jun Liu
Dong Huang
ObjD
54
1
0
14 Oct 2023
LL-VQ-VAE: Learnable Lattice Vector-Quantization For Efficient Representations
Ahmed Khalil
Robert Piechocki
Raúl Santos-Rodríguez
59
2
0
13 Oct 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
H. Chen
Yue Jiang
Yingya Zhang
Jing Dong
Caifeng Shan
85
2
0
12 Oct 2023
Improving Compositional Text-to-image Generation with Large Vision-Language Models
Song Wen
Guian Fang
Renrui Zhang
Peng Gao
Hao Dong
Dimitris N. Metaxas
92
18
0
10 Oct 2023
EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders
Gulcin Baykal
M. Kandemir
Gözde B. Ünal
71
13
0
09 Oct 2023
Locality-Aware Generalizable Implicit Neural Representation
Doyup Lee
Chiheon Kim
Minsu Cho
Wook-Shin Han
89
12
0
09 Oct 2023
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers
Shiyue Cao
Yueqin Yin
Lianghua Huang
Yu Liu
Xin Zhao
Deli Zhao
Kaiqi Huang
ViT
104
19
0
09 Oct 2023
Learning Energy-Based Prior Model with Diffusion-Amortized MCMC
Peiyu Yu
Y. Zhu
Sirui Xie
Xiaojian Ma
Ruiqi Gao
Song-Chun Zhu
Ying Nian Wu
DiffM
87
13
0
05 Oct 2023
Soft Convex Quantization: Revisiting Vector Quantization with Convex Optimization
Tanmay Gautam
Reid Pryzant
Ziyi Yang
Chenguang Zhu
Somayeh Sojoudi
MQ
58
4
0
04 Oct 2023
Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models
Huaijin Pi
Sida Peng
Minghui Yang
Xiaowei Zhou
Hujun Bao
DiffM
114
36
0
03 Oct 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
114
32
0
03 Oct 2023
Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks
Meng Zhou
Matthias W. Wagner
U. Tabori
C. Hawkins
B. Ertl-Wagner
Farzad Khalvati
MedIm
117
5
0
02 Oct 2023
A Comprehensive Review of Generative AI in Healthcare
Yasin Shokrollahi
Sahar Yarmohammadtoosky
Matthew M. Nikahd
Pengfei Dong
Xianqi Li
Linxia Gu
MedIm
AI4CE
99
20
0
01 Oct 2023
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Albert Mohwald
115
17
0
28 Sep 2023
Identity-preserving Editing of Multiple Facial Attributes by Learning Global Edit Directions and Local Adjustments
Najmeh Mohammadbagheri
Fardin Ayar
A. Nickabadi
R. Safabakhsh
CVBM
GAN
75
4
0
25 Sep 2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER
Mingzhen Sun
Weining Wang
Zihan Qin
Jiahui Sun
Si-Qing Chen
Qingbin Liu
DiffM
71
3
0
23 Sep 2023
How to train your VAE
Mariano Rivera
DRL
55
1
0
22 Sep 2023
Attentive VQ-VAE
A. Hoyos
Mariano Rivera
72
0
0
20 Sep 2023
Tree-Structured Shading Decomposition
Chen Geng
Hong-Xing Yu
Sharon Zhang
Maneesh Agrawala
Jiajun Wu
78
2
0
13 Sep 2023
Adapt and Diffuse: Sample-adaptive Reconstruction via Latent Diffusion Models
Zalan Fabian
Berk Tınaz
Mahdi Soltanolkotabi
DiffM
103
6
0
12 Sep 2023
NExT-GPT: Any-to-Any Multimodal LLM
Shengqiong Wu
Hao Fei
Leigang Qu
Wei Ji
Tat-Seng Chua
MLLM
131
507
0
11 Sep 2023
DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation
Z. Zang
Hao Luo
Kaidi Wang
Panpan Zhang
F. Wang
Stan. Z Li
Yang You
100
5
0
10 Sep 2023
MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
Yupeng Zhou
Daquan Zhou
Zuo-Liang Zhu
Yaxing Wang
Qibin Hou
Jiashi Feng
80
12
0
08 Sep 2023
A Two-Stage Training Framework for Joint Speech Compression and Enhancement
Jiayi Huang
Zeyu Yan
Wenbin Jiang
Fei Wen
71
1
0
08 Sep 2023
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng
Binxin Yang
Tiankai Hang
Chen Li
Shuyang Gu
...
Jianmin Bao
Zheng Zhang
Han Hu
DongDong Chen
Baining Guo
DiffM
VLM
131
107
0
07 Sep 2023
Addressing the Blind Spots in Spoken Language Processing
Amit Moryossef
66
0
0
06 Sep 2023
EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation
Nikolai Korber
Eduard Kromer
Andreas Siebert
S. Hauke
Daniel Mueller-Gritschneder
Björn Schuller
DiffM
VLM
96
5
0
06 Sep 2023
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
L. Yu
Bowen Shi
Ramakanth Pasunuru
Benjamin Muller
O. Yu. Golovneva
...
Yaniv Taigman
Maryam Fazel-Zarandi
Asli Celikyilmaz
Luke Zettlemoyer
Armen Aghajanyan
MLLM
116
142
0
05 Sep 2023
Probabilistic Precision and Recall Towards Reliable Evaluation of Generative Models
Dogyun Park
Suhyun Kim
EGVM
58
4
0
04 Sep 2023
Neural Vector Fields: Generalizing Distance Vector Fields by Codebooks and Zero-Curl Regularization
Xianghui Yang
Guosheng Lin
Zhenghao Chen
Luping Zhou
108
2
0
04 Sep 2023
DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion
Yunhong Lou
Linchao Zhu
Yaxiong Wang
Xiaohan Wang
Yezhou Yang
DiffM
80
27
0
04 Sep 2023
Diffusion Models with Deterministic Normalizing Flow Priors
Mohsen Zand
Ali Etemad
Michael A. Greenspan
DiffM
161
3
0
03 Sep 2023
Few shot font generation via transferring similarity guided global style and quantization local style
Wei Pan
Anna Zhu
Xinyu Zhou
Brian Kenji Iwana
Shilin Li
81
13
0
02 Sep 2023
CityDreamer: Compositional Generative Model of Unbounded 3D Cities
Haozhe Xie
Zhaoxi Chen
Fangzhou Hong
Ziwei Liu
156
43
0
01 Sep 2023
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning
Haohan Guo
Fenglong Xie
Jiawen Kang
Yujia Xiao
Xixin Wu
Helen M. Meng
97
3
0
31 Aug 2023
StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation
Yuhan Wang
Liming Jiang
Chen Change Loy
VGen
101
15
0
31 Aug 2023
Ten Years of Generative Adversarial Nets (GANs): A survey of the state-of-the-art
Tanujit Chakraborty
Ujjwal Reddy K S
Shraddha M. Naik
Madhurima Panja
B. Manvitha
129
75
0
30 Aug 2023
MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly Detection
Yixuan Zhou
Xing Xu
Jingkuan Song
Fumin Shen
Hengtao Shen
AI4CE
123
22
0
29 Aug 2023
C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model
Longbin Ji
Pengfei Wei
Yi Ren
Jinglin Liu
Chen Zhang
Xiang Yin
DiffM
72
3
0
29 Aug 2023
MEMORY-VQ: Compression for Tractable Internet-Scale Memory
Yury Zemlyanskiy
Michiel de Jong
Luke Vilnis
Santiago Ontañón
William W. Cohen
Sumit Sanghai
Joshua Ainslie
RALM
MQ
82
0
0
28 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
303
31
0
27 Aug 2023
VQ-Font: Few-Shot Font Generation with Structure-Aware Enhancement and Quantization
Mingshuai Yao
Yabo Zhang
Xianhui Lin
Xiaoming Li
W. Zuo
58
10
0
27 Aug 2023
Arbitrary Distributions Mapping via SyMOT-Flow: A Flow-based Approach Integrating Maximum Mean Discrepancy and Optimal Transport
Zhe Xiong
Qiaoqiao Ding
Xiaoqun Zhang
OOD
116
0
0
26 Aug 2023
A Survey of AI Music Generation Tools and Models
Yueyue Zhu
Jared Baca
Banafsheh Rekabdar
Reza Rawassizadeh
MGen
115
18
0
24 Aug 2023
High-quality Image Dehazing with Diffusion Model
Huikang Yu
Jie Huang
Kai Zheng
Fengmei Zhao
DiffM
89
13
0
23 Aug 2023
Efficient Transfer Learning in Diffusion Models via Adversarial Noise
Xiyu Wang
Baijiong Lin
Daochang Liu
Chang Xu
DiffM
99
3
0
23 Aug 2023
Previous
1
2
3
...
8
9
10
...
21
22
23
Next