Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.00937
Cited By
v1
v2 (latest)
Neural Discrete Representation Learning
2 November 2017
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Discrete Representation Learning"
50 / 3,267 papers shown
Title
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
75
17
0
19 Oct 2020
CLAR: Contrastive Learning of Auditory Representations
Haider Al-Tahan
Y. Mohsenzadeh
SSL
198
56
0
19 Oct 2020
Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders
Masha Itkina
Boris Ivanovic
Ransalu Senanayake
Mykel J. Kochenderfer
Marco Pavone
128
18
0
19 Oct 2020
CQ-VAE: Coordinate Quantized VAE for Uncertainty Estimation with Application to Disk Shape Analysis from Lumbar Spine MRI Images
Linchen Qian
Jiasong Chen
Timur Urakov
Weiyong Gu
Liang Liang
25
3
0
17 Oct 2020
Latent Vector Recovery of Audio GANs
Andrew Keyes
N. Bayat
Vahid Reza Khazaie
Y. Mohsenzadeh
13
3
0
16 Oct 2020
The NeteaseGames System for Voice Conversion Challenge 2020 with Vector-quantization Variational Autoencoder and WaveNet
Haitong Zhang
DRL
38
4
0
15 Oct 2020
Smaller World Models for Reinforcement Learning
Jan Robine
Tobias Uelwer
Stefan Harmeling
DRL
59
3
0
12 Oct 2020
Deep Sequence Learning for Video Anticipation: From Discrete and Deterministic to Continuous and Stochastic
S. Aliakbarian
AI4TS
40
0
0
09 Oct 2020
Event Representation with Sequential, Semi-Supervised Discrete Variables
Mehdi Rezaee
Francis Ferraro
BDL
DRL
59
14
0
09 Oct 2020
FastVC: Fast Voice Conversion with non-parallel data
Oriol Barbany
Milos Cernak
51
7
0
08 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
124
21
0
06 Oct 2020
A Contrastive Learning Approach for Training Variational Autoencoder Priors
J. Aneja
Alex Schwing
Jan Kautz
Arash Vahdat
DRL
130
83
0
06 Oct 2020
The Academia Sinica Systems of Voice Conversion for VCC2020
Yu-Huai Peng
Cheng-Hung Hu
A. Kang
Hung-Shin Lee
Pin-Yuan Chen
Yu Tsao
Hsin-Min Wang
68
2
0
06 Oct 2020
Implicit Rank-Minimizing Autoencoder
Li Jing
Jure Zbontar
Yann LeCun
SSL
DRL
83
49
0
01 Oct 2020
The Utility of Decorrelating Colour Spaces in Vector Quantised Variational Autoencoders
A. Akbarinia
Raquel Gil-Rodríguez
Alban Flachot
Matteo Toscani
DRL
20
0
0
30 Sep 2020
Controllable Text Generation with Focused Variation
Lei Shu
Alexandros Papangelis
Yi-Chia Wang
Gokhan Tur
Hu Xu
Zhaleh Feizollahi
Bing-Quan Liu
Piero Molino
94
11
0
25 Sep 2020
A Unifying Review of Deep and Shallow Anomaly Detection
Lukas Ruff
Jacob R. Kauffmann
Robert A. Vandermeulen
G. Montavon
Wojciech Samek
Marius Kloft
Thomas G. Dietterich
Klaus-Robert Muller
UQCV
152
806
0
24 Sep 2020
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
Jaemin Cho
Jiasen Lu
Dustin Schwenk
Hannaneh Hajishirzi
Aniruddha Kembhavi
VLM
MLLM
95
102
0
23 Sep 2020
Generative Model without Prior Distribution Matching
Cong Geng
Jia Wang
Lixing Chen
Zhiyong Gao
GAN
392
1
0
23 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
280
1,472
0
21 Sep 2020
Target Conditioning for One-to-Many Generation
Marie-Anne Lachaux
Armand Joulin
Guillaume Lample
56
13
0
21 Sep 2020
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition
Chaoyou Fu
Xiang Wu
Yibo Hu
Huaibo Huang
Ran He
CVBM
78
86
0
20 Sep 2020
Discond-VAE: Disentangling Continuous Factors from the Discrete
Jaewoong Choi
Geonho Hwang
Myung-joo Kang
CoGe
CML
56
4
0
17 Sep 2020
Recurrent autoencoder with sequence-aware encoding
Robert Susik
AI4TS
31
6
0
15 Sep 2020
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas Affolter
Béni Egressy
Damian Pascual
Roger Wattenhofer
107
24
0
10 Sep 2020
not-so-BigGAN: Generating High-Fidelity Images on Small Compute with Wavelet-based Super-Resolution
Seung-Jun Han
Akash Srivastava
C. Hurwitz
P. Sattigeri
David D. Cox
63
8
0
09 Sep 2020
Multilinear Latent Conditioning for Generating Unseen Attribute Combinations
Markos Georgopoulos
Grigorios G. Chrysos
Maja Pantic
Yannis Panagakis
GAN
DRL
68
17
0
09 Sep 2020
Deep data compression for approximate ultrasonic image formation
G. Pilikos
L. Horchens
K. Batenburg
Tristan van Leeuwen
F. Lucka
41
8
0
04 Sep 2020
End-to-End Learning of Neuromorphic Wireless Systems for Low-Power Edge Artificial Intelligence
N. Skatchkovsky
Hyeryung Jang
Osvaldo Simeone
62
24
0
03 Sep 2020
Stochastic Graph Recurrent Neural Network
Tijin Yan
Hongwei Zhang
Zirui Li
Yuanqing Xia
GNN
BDL
33
5
0
01 Sep 2020
GIF: Generative Interpretable Faces
Partha Ghosh
Pravir Singh Gupta
Roy Uziel
Anurag Ranjan
Michael J. Black
Timo Bolkart
CVBM
AI4CE
114
77
0
31 Aug 2020
Hierarchical Timbre-Painting and Articulation Generation
Michael Michelashvili
Lior Wolf
90
12
0
30 Aug 2020
Dynamical Variational Autoencoders: A Comprehensive Review
Laurent Girin
Simon Leglaive
Xiaoyu Bie
Julien Diard
Thomas Hueber
Xavier Alameda-Pineda
BDL
154
223
0
28 Aug 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion
Yi Zhao
Wen-Chin Huang
Xiaohai Tian
Junichi Yamagishi
Rohan Kumar Das
Tomi Kinnunen
Zhenhua Ling
Tomoki Toda
96
211
0
28 Aug 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
107
20
0
27 Aug 2020
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
121
1
0
20 Aug 2020
Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders
Mingjie Chen
Thomas Hain
SSL
DRL
54
15
0
16 Aug 2020
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane Siriwardhana
Andrew Reis
Rivindu Weerasekera
Suranga Nanayakkara
92
112
0
15 Aug 2020
Audio- and Gaze-driven Facial Animation of Codec Avatars
Alexander Richard
Colin S. Lea
Shugao Ma
Juergen Gall
Fernando de la Torre
Yaser Sheikh
CVBM
79
81
0
11 Aug 2020
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages
Haitong Zhang
Yue Lin
60
30
0
11 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
152
329
0
09 Aug 2020
Learning Sampling in Financial Statement Audits using Vector Quantised Autoencoder Neural Networks
Marco Schreyer
Timur Sattarov
Anita Gierbl
Bernd Reimer
Damian Borth
DRL
37
3
0
06 Aug 2020
Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds
Valentin Liévin
Andrea Dittadi
Anders Christensen
Ole Winther
DRL
61
6
0
05 Aug 2020
Timbre latent space: exploration and creative aspects
Antoine Caillon
Adrien Bitton
Brice Gatinet
P. Esling
67
1
0
04 Aug 2020
Learning from Few Samples: A Survey
Nihar Bendre
Hugo Terashima-Marín
Peyman Najafirad
VLM
BDL
87
54
0
30 Jul 2020
Foveation for Segmentation of Ultra-High Resolution Images
Chen Jin
Ryutaro Tanno
Moucheng Xu
T. Mertzanidou
Daniel C. Alexander
AI4TS
53
4
0
29 Jul 2020
dMelodies: A Music Dataset for Disentanglement Learning
Ashis Pati
Siddharth Gururani
Alexander Lerch
CoGe
DRL
66
10
0
29 Jul 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
132
58
0
29 Jul 2020
Generative networks as inverse problems with fractional wavelet scattering networks
Jiasong Wu
Jing Zhang
Fuzhi Wu
Youyong Kong
Guanyu Yang
L. Senhadji
H. Shu
GAN
54
1
0
28 Jul 2020
Unsupervised Subword Modeling Using Autoregressive Pretraining and Cross-Lingual Phone-Aware Modeling
Siyuan Feng
O. Scharenborg
SSL
72
4
0
25 Jul 2020
Previous
1
2
3
...
59
60
61
...
64
65
66
Next