Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,267 papers shown
Title
Textless Speech-to-Speech Translation on Real Data
Ann Lee
Hongyu Gong
Paul-Ambroise Duquenne
Holger Schwenk
Peng-Jen Chen
...
Sravya Popuri
Yossi Adi
J. Pino
Jiatao Gu
Wei-Ning Hsu
122
150
0
15 Dec 2021
Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Frederic Koehler
Viraj Mehta
Chenghui Zhou
Andrej Risteski
DRL
87
13
0
13 Dec 2021
Technical Language Supervision for Intelligent Fault Diagnosis in Process Industry
Karl Lowenmark
C. Taal
S. Schnabel
Marcus Liwicki
Fredrik Sandin
52
7
0
11 Dec 2021
Discrete neural representations for explainable anomaly detection
Stanislaw Szymanowicz
James Charles
R. Cipolla
AAML
AI4TS
FAtt
84
20
0
10 Dec 2021
Fully Context-Aware Image Inpainting with a Learned Semantic Pyramid
Wendong Zhang
Yunbo Wang
Bingbing Ni
Xiaokang Yang
90
14
0
08 Dec 2021
Emulating Spatio-Temporal Realizations of Three-Dimensional Isotropic Turbulence via Deep Sequence Learning Models
M. Momenifar
Enmao Diao
Vahid Tarokh
A. Bragg
AI4CE
42
4
0
07 Dec 2021
General Facial Representation Learning in a Visual-Linguistic Manner
Yinglin Zheng
Hao Yang
Ting Zhang
Jianmin Bao
Dongdong Chen
Yangyu Huang
Lu Yuan
Dong Chen
Ming Zeng
Fang Wen
CVBM
220
176
0
06 Dec 2021
Joint Learning of Localized Representations from Medical Images and Reports
Philipp Muller
Georgios Kaissis
Cong Zou
Daniel Munich
223
87
0
06 Dec 2021
Make It Move: Controllable Image-to-Video Generation with Text Descriptions
Yaosi Hu
Chong Luo
Zhenzhong Chen
VGen
78
89
0
06 Dec 2021
Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion
K. Akuzawa
Kotaro Onishi
Keisuke Takiguchi
Kohki Mametani
K. Mori
BDL
DRL
78
7
0
06 Dec 2021
Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
Minghui Hu
Yujie Wang
Tat-Jen Cham
Jianfei Yang
P.N.Suganthan
DiffM
63
43
0
03 Dec 2021
Video-Text Pre-training with Learned Regions
Rui Yan
Mike Zheng Shou
Yixiao Ge
Alex Jinpeng Wang
Xudong Lin
Guanyu Cai
Jinhui Tang
103
24
0
02 Dec 2021
Exploration into Translation-Equivariant Image Quantization
W. Shin
Gyubok Lee
Jiyoung Lee
Eun-Young Lyou
Joonseok Lee
Edward Choi
95
7
0
01 Dec 2021
The Exponentially Tilted Gaussian Prior for Variational Autoencoders
Griffin Floto
Stefan Kremer
Mihai Nica
DRL
44
1
0
30 Nov 2021
Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
Konpat Preechakul
Nattanat Chatthee
Suttisak Wizadwongsa
Supasorn Suwajanakorn
SyDa
DiffM
131
434
0
30 Nov 2021
Synthetic weather radar using hybrid quantum-classical machine learning
Graham Enos
M. Reagor
Maxwell P. Henderson
Christina Young
K. Horton
Mandy Birch
C. Rigetti
43
10
0
30 Nov 2021
Deep Auto-encoder with Neural Response
Xuming Ran
Jie Zhang
Ziyu Ye
Haiyan Wu
Qi Xu
Huihui Zhou
Quanying Liu
63
7
0
30 Nov 2021
EdiBERT, a generative model for image editing
Thibaut Issenhuth
Ugo Tanielian
Jérémie Mary
David Picard
DiffM
104
12
0
30 Nov 2021
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
DiffM
229
800
0
29 Nov 2021
Blended Diffusion for Text-driven Editing of Natural Images
Omri Avrahami
Dani Lischinski
Ohad Fried
DiffM
225
958
0
29 Nov 2021
Transfer Learning with Jukebox for Music Source Separation
W. Z. E. Amri
Oliver Tautz
Helge J. Ritter
Andrew Melnik
82
7
0
28 Nov 2021
Learning Physical Concepts in Cyber-Physical Systems: A Case Study
Henrik S. Steude
Alexander Windmann
Oliver Niggemann
AI4CE
72
1
0
28 Nov 2021
LAFITE: Towards Language-Free Training for Text-to-Image Generation
Yufan Zhou
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Chris Tensmeyer
Tong Yu
Jiuxiang Gu
Jinhui Xu
Tong Sun
VLM
105
168
0
27 Nov 2021
Nonequilibrium Monte Carlo for unfreezing variables in hard combinatorial optimization
Masoud Mohseni
D. Eppens
J. Strümpfer
Raffaele Marino
Vasil S. Denchev
A. Ho
Sergei V. Isakov
Sergio Boixo
F. Ricci-Tersenghi
Hartmut Neven
51
20
0
26 Nov 2021
A model of semantic completion in generative episodic memory
Zahra Fayyaz
Aya Altamimi
Sen Cheng
Laurenz Wiskott
57
22
0
26 Nov 2021
Learning source-aware representations of music in a discrete latent space
Jinsung Kim
Yeong-Seok Jeong
Woosung Choi
Jaehwa Chung
Soonyoung Jung
BDL
DRL
50
0
0
26 Nov 2021
Uncertainty Aware Proposal Segmentation for Unknown Object Detection
Yimeng Li
Jana Kosecka
UQCV
111
19
0
25 Nov 2021
Layered Controllable Video Generation
Jiahui Huang
Yuhe Jin
K. M. Yi
Leonid Sigal
VGen
81
11
0
24 Nov 2021
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
Xiaoyi Dong
Jianmin Bao
Ting Zhang
Dongdong Chen
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
Baining Guo
ViT
153
246
0
24 Nov 2021
Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes
Sam Bond-Taylor
P. Hessey
Hiroshi Sasaki
T. Breckon
Chris G. Willcocks
DiffM
126
72
0
24 Nov 2021
VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling
Tsu-Jui Fu
Linjie Li
Zhe Gan
Kevin Qinghong Lin
Wenjie Wang
Lijuan Wang
Zicheng Liu
VLM
154
221
0
24 Nov 2021
Non-Intrusive Binaural Speech Intelligibility Prediction from Discrete Latent Representations
Alex F. McKinney
Benjamin Cauchi
114
3
0
24 Nov 2021
Octree Transformer: Autoregressive 3D Shape Generation on Hierarchically Structured Sequences
Moritz Ibing
Gregor Kobsik
Leif Kobbelt
95
37
0
24 Nov 2021
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
Chenfei Wu
Jian Liang
Lei Ji
Fan Yang
Yuejian Fang
Daxin Jiang
Nan Duan
ViT
VGen
88
296
0
24 Nov 2021
Variational Learning for Unsupervised Knowledge Grounded Dialogs
Mayank Mishra
Dhiraj Madan
Gaurav Pandey
Danish Contractor
73
3
0
23 Nov 2021
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance
Heeseung Kim
Sungwon Kim
Sungroh Yoon
DiffM
BDL
134
112
0
23 Nov 2021
Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks
Linus Ericsson
Henry Gouk
Timothy M. Hospedales
SSL
95
15
0
22 Nov 2021
L-Verse: Bidirectional Generation Between Image and Text
Taehoon Kim
Gwangmo Song
Sihaeng Lee
Sangyun Kim
Yewon Seo
Soonyoung Lee
S. Kim
Honglak Lee
Kyunghoon Bae
161
26
0
22 Nov 2021
Discrete Representations Strengthen Vision Transformer Robustness
Chengzhi Mao
Lu Jiang
Mostafa Dehghani
Carl Vondrick
Rahul Sukthankar
Irfan Essa
ViT
102
43
0
20 Nov 2021
SimMIM: A Simple Framework for Masked Image Modeling
Zhenda Xie
Zheng Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
263
1,376
0
18 Nov 2021
Modular Networks Prevent Catastrophic Interference in Model-Based Multi-Task Reinforcement Learning
Robin Schiewer
Laurenz Wiskott
19
3
0
15 Nov 2021
Symbolic Music Loop Generation with VQ-VAE
Sangjun Han
H. Ihm
Woohyung Lim
MGen
61
1
0
15 Nov 2021
Textless Speech Emotion Conversion using Discrete and Decomposed Representations
Felix Kreuk
Adam Polyak
Jade Copet
Eugene Kharitonov
Tu Nguyen
M. Rivière
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
Yossi Adi
119
34
0
14 Nov 2021
Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Chao Xie
Yi-Chiao Wu
Patrick Lumban Tobing
Wen-Chin Huang
Tomoki Toda
59
11
0
13 Nov 2021
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
839
7,885
0
11 Nov 2021
Entropy optimized semi-supervised decomposed vector-quantized variational autoencoder model based on transfer learning for multiclass text classification and generation
Shivani Malhotra
Vinay Kumar
A. K. Agarwal
DRL
35
0
0
10 Nov 2021
Attention Approximates Sparse Distributed Memory
Trenton Bricken
Cengiz Pehlevan
93
35
0
10 Nov 2021
Learning from Multiple Time Series: A Deep Disentangled Approach to Diversified Time Series Forecasting
Ling-Hao Chen
Weiqiu Chen
Binqing Wu
Youdong Zhang
Bo Wen
Chenghu Yang
AI4TS
43
4
0
09 Nov 2021
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data
Priya Shukla
V. Kushwaha
G. C. Nandi
42
4
0
06 Nov 2021
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
Joel Frank
Lea Schonherr
DiffM
204
131
0
04 Nov 2021
Previous
1
2
3
...
53
54
55
...
64
65
66
Next