Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.00446
Cited By
Generating Diverse High-Fidelity Images with VQ-VAE-2
2 June 2019
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRL
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generating Diverse High-Fidelity Images with VQ-VAE-2"
50 / 1,107 papers shown
Title
A Demographic-Conditioned Variational Autoencoder for fMRI Distribution Sampling and Removal of Confounds
Anton Orlichenko
Gang Qu
Ziyu Zhou
Anqi Liu
Hong-Wen Deng
Zhengming Ding
Julia M. Stephen
Tony W. Wilson
Vince D. Calhoun
Yu-Ping Wang
22
0
0
13 May 2024
Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data
Mahdi Morafah
M. Reisser
Bill Lin
Christos Louizos
FedML
34
5
0
13 May 2024
Generating Human Motion in 3D Scenes from Text Descriptions
Zhi Cen
Huaijin Pi
Sida Peng
Zehong Shen
Minghui Yang
Shuai Zhu
Hujun Bao
Xiaowei Zhou
55
19
0
13 May 2024
MAxPrototyper: A Multi-Agent Generation System for Interactive User Interface Prototyping
Mingyue Yuan
Jieshan Chen
Aaron Quigley
LLMAG
51
5
0
12 May 2024
Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation
Shengyuan Liu
Bo Wang
Ye Ma
Te Yang
Xipeng Cao
Quan Chen
Han Li
Di Dong
Peng Jiang
EGVM
44
2
0
11 May 2024
Controllable Image Generation With Composed Parallel Token Prediction
Jamie Stirling
Noura Al-Moubayed
38
0
0
10 May 2024
Detecting music deepfakes is easy but actually hard
Darius Afchar
Gabriel Meseguer-Brocal
Romain Hennequin
63
6
0
07 May 2024
MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View
Emmanuelle Bourigault
Pauline Bourigault
37
2
0
06 May 2024
Generated Contents Enrichment
Mahdi Naseri
Jiayan Qiu
Zhou Wang
37
0
0
06 May 2024
Towards Real-world Video Face Restoration: A New Benchmark
Ziyan Chen
Jingwen He
Xinqi Lin
Yu Qiao
Chao Dong
48
4
0
30 Apr 2024
Assessing Image Quality Using a Simple Generative Representation
Simon Raviv
Gal Chechik
55
0
0
28 Apr 2024
TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation
Sai Kumar Dwivedi
Yu Sun
Priyanka Patel
Yao Feng
Michael J. Black
3DH
46
28
0
25 Apr 2024
HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression
Lei Lu
Yanyue Xie
Wei Jiang
Wei Wang
Xue Lin
Yanzhi Wang
45
4
0
20 Apr 2024
Lazy Diffusion Transformer for Interactive Image Editing
Yotam Nitzan
Zongze Wu
Richard Zhang
Eli Shechtman
Daniel Cohen-Or
Taesung Park
Michael Gharbi
43
9
0
18 Apr 2024
MIDGET: Music Conditioned 3D Dance Generation
Jinwu Wang
Wei Mao
Miaomiao Liu
40
0
0
18 Apr 2024
Large Language Models: From Notes to Musical Form
Lilac Atassi
35
0
0
18 Apr 2024
Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent
Wei Chen
Zhiyuan Li
LLMAG
30
5
0
17 Apr 2024
Closely Interactive Human Reconstruction with Proxemics and Physics-Guided Adaption
Buzhen Huang
Chen Li
Chongyang Xu
Liang Pan
Yangang Wang
Gim Hee Lee
33
5
0
17 Apr 2024
Personalized Heart Disease Detection via ECG Digital Twin Generation
Yaojun Hu
Jintai Chen
Lianting Hu
Dantong Li
Jiahuan Yan
Haochao Ying
Huiying Liang
Jian Wu
36
4
0
17 Apr 2024
MaSkel: A Model for Human Whole-body X-rays Generation from Human Masking Images
Yingjie Xi
Boyuan Cheng
Jingyao Cai
Jian Jun Zhang
Xiaosong Yang
MedIm
42
0
0
13 Apr 2024
Adapting LLaMA Decoder to Vision Transformer
Jiahao Wang
Wenqi Shao
Yonghong Tian
Chengyue Wu
Yong Liu
Taiqiang Wu
Kaipeng Zhang
Songyang Zhang
Kai-xiang Chen
Ping Luo
MLLM
40
4
0
10 Apr 2024
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance
Dazhong Shen
Guanglu Song
Zeyue Xue
Fu-Yun Wang
Yu Liu
DiffM
38
11
0
08 Apr 2024
Gull: A Generative Multifunctional Audio Codec
Yi Luo
Jianwei Yu
Hangting Chen
Rongzhi Gu
Chao Weng
AuLLM
46
3
0
07 Apr 2024
Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model
Zhonghan Zhao
Ke Ma
Wenhao Chai
Xuan Wang
Kewei Chen
Dongxu Guo
Yanting Zhang
Hongwei Wang
Gaoang Wang
45
16
0
06 Apr 2024
SemGrasp: Semantic Grasp Generation via Language Aligned Discretization
Kailin Li
Jingbo Wang
Lixin Yang
Cewu Lu
Bo Dai
48
16
0
04 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Keyu Tian
Yi-Xin Jiang
Zehuan Yuan
Bingyue Peng
Liwei Wang
VGen
55
260
0
03 Apr 2024
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech
Jaehyeon Kim
Keon Lee
Seungjun Chung
Jaewoong Cho
74
41
0
03 Apr 2024
MotionChain: Conversational Motion Controllers via Multimodal Prompts
Biao Jiang
Xin Chen
C. Zhang
Fukun Yin
Zhuoyuan Li
Gang Yu
Jiayuan Fan
VGen
LRM
35
10
0
02 Apr 2024
Variational Autoencoders for exteroceptive perception in reinforcement learning-based collision avoidance
T. N. Larsen
Eirik Runde Barlaug
Adil Rasheed
DRL
28
1
0
31 Mar 2024
Transformer based Pluralistic Image Completion with Reduced Information Loss
Qiankun Liu
Yuqi Jiang
Zhentao Tan
DongDong Chen
Ying Fu
Qi Chu
Gang Hua
Nenghai Yu
ViT
73
11
0
31 Mar 2024
Towards Variable and Coordinated Holistic Co-Speech Motion Generation
Yifei Liu
Qiong Cao
Yandong Wen
Huaiguang Jiang
Changxing Ding
SLR
71
14
0
30 Mar 2024
Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting
Haiwei Chen
Yajie Zhao
DiffM
24
2
0
27 Mar 2024
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery
Siddharth Tourani
Ahmed Alwheibi
Arif Mahmood
Muhammad Haris Khan
DiffM
46
1
0
24 Mar 2024
BEND: Bagging Deep Learning Training Based on Efficient Neural Network Diffusion
Jia Wei
Xingjun Zhang
Witold Pedrycz
DiffM
26
0
0
23 Mar 2024
Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting
Alicia Durrer
J. Wolleb
Florentin Bieder
Paul Friedrich
L. Melie-García
...
Ozgur Yaldizli
Cristina Granziera
Bjoern H. Menze
Philippe C. Cattin
Florian Kofler
MedIm
21
6
0
21 Mar 2024
Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model
Jiajie Yang
40
0
0
19 Mar 2024
SC-Diff: 3D Shape Completion with Latent Diffusion Models
Juan D. Galvis
Xingxing Zuo
Simon Schaefer
Stefan Leutengger
DiffM
44
3
0
19 Mar 2024
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning
Xiaojie Li
Yibo Yang
Hefei Ling
Jianlong Wu
Yue Yu
Guohao Li
Min Zhang
SSL
36
6
0
18 Mar 2024
Artifact Feature Purification for Cross-domain Detection of AI-generated Images
Zheling Meng
Bo Peng
Jing Dong
Tieniu Tan
89
4
0
17 Mar 2024
Boosting Flow-based Generative Super-Resolution Models via Learned Prior
Li-Yuan Tsao
Yi-Chen Lo
Chia-Che Chang
Hao-Wei Chen
Roy Tseng
Chien Feng
Chun-Yi Lee
SupR
34
4
0
16 Mar 2024
Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Baoquan Zhang
Huaibin Wang
Chuyao Luo
Xutao Li
Guotao Liang
Yunming Ye
Xiaochen Qi
Yao He
40
11
0
15 Mar 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLM
MLLM
46
7
0
14 Mar 2024
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Lei Zhu
Fangyun Wei
Yanye Lu
MLLM
VLM
52
18
0
12 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Yongqi Li
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
46
7
0
07 Mar 2024
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
Benjamin Eysenbach
Vivek Myers
Ruslan Salakhutdinov
Sergey Levine
AI4TS
43
8
0
06 Mar 2024
Shapley Values-Powered Framework for Fair Reward Split in Content Produced by GenAI
Alex Glinsky
Alexey Sokolsky
35
0
0
05 Mar 2024
Large Convolutional Model Tuning via Filter Subspace
Wei Chen
Zichen Miao
Qiang Qiu
59
3
0
01 Mar 2024
Unified Generation, Reconstruction, and Representation: Generalized Diffusion with Adaptive Latent Encoding-Decoding
Guangyi Liu
Yu Wang
Zeyu Feng
Qiyu Wu
Liping Tang
...
Shuguang Cui
Julian McAuley
Zichao Yang
Eric P. Xing
Zhiting Hu
DiffM
80
3
0
29 Feb 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
47
4
0
29 Feb 2024
Latent Neural PDE Solver: a reduced-order modelling framework for partial differential equations
Zijie Li
Saurabh Patil
Francis Ogoke
Dule Shu
Wilson Zhen
Michael Schneier
John R. Buchanan
A. Farimani
AI4CE
45
5
0
27 Feb 2024
Previous
1
2
3
...
5
6
7
...
21
22
23
Next