Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
234
1,799
0
20 Sep 2018
Neural Speech Synthesis with Transformer Network
Naihan Li
Shujie Liu
Yanqing Liu
Sheng Zhao
Ming-Yuan Liu
M. Zhou
72
102
0
19 Sep 2018
HashTran-DNN: A Framework for Enhancing Robustness of Deep Neural Networks against Adversarial Malware Samples
Deqiang Li
Ramesh Baral
Tao Li
Han Wang
Qianmu Li
Shouhuai Xu
AAML
63
21
0
18 Sep 2018
A Multi-Stage Algorithm for Acoustic Physical Model Parameters Estimation
L. Gabrielli
Stefano Tomassetti
S. Squartini
C. Zinato
Stefano Guaiana
25
1
0
14 Sep 2018
Investigation of Multimodal Features, Classifiers and Fusion Methods for Emotion Recognition
Zheng Lian
Ya Li
J. Tao
Jian Huang
57
22
0
13 Sep 2018
PhaseLink: A Deep Learning Approach to Seismic Phase Association
Zachary E. Ross
Yisong Yue
Men‐Andrin Meier
E. Hauksson
T. Heaton
54
155
0
08 Sep 2018
Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders
Tengfei Ma
Jie Chen
Cao Xiao
140
210
0
07 Sep 2018
Self-Supervised Generation of Spatial Audio for 360 Video
Pedro Morgado
Nuno Vasconcelos
Timothy R. Langlois
Oliver Wang
MDE
66
174
0
07 Sep 2018
Super-Resolution Perception for Industrial Sensor Data
Jinjin Gu
Haoyu Chen
Guolong Liu
Gaoqi Liang
Xinlei Wang
Junhua Zhao
32
3
0
06 Sep 2018
Recurrent World Models Facilitate Policy Evolution
David R Ha
Jürgen Schmidhuber
SyDa
TPM
137
959
0
04 Sep 2018
Handwriting styles: benchmarks and evaluation metrics
Omar Mohammed
Gérard Bailly
D. Pellier
24
3
0
04 Sep 2018
InteriorNet: Mega-scale Multi-sensor Photo-realistic Indoor Scenes Dataset
Wenbin Li
Sajad Saeedi
J. McCormac
R. Clark
Dimos Tzoumanikas
Qing Ye
Yuzhong Huang
Ruiming Tang
Stefan Leutenegger
3DV
83
225
0
03 Sep 2018
Spherical Latent Spaces for Stable Variational Autoencoders
Jiacheng Xu
Greg Durrett
BDL
DRL
99
195
0
31 Aug 2018
Dissecting Contextual Word Embeddings: Architecture and Representation
Matthew E. Peters
Mark Neumann
Luke Zettlemoyer
Wen-tau Yih
113
434
0
27 Aug 2018
Smoothed Dilated Convolutions for Improved Dense Prediction
Zhengyang Wang
Shuiwang Ji
75
166
0
27 Aug 2018
Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions
Ke Ning
Linchao Zhu
Ming Cai
Yi Yang
Di Xie
Leilei Gan
30
2
0
27 Aug 2018
Deep Learning: Computational Aspects
Nicholas G. Polson
Vadim Sokolov
PINN
BDL
AI4CE
58
14
0
26 Aug 2018
Semi-Autoregressive Neural Machine Translation
Chunqi Wang
Ji Zhang
Haiqing Chen
85
89
0
26 Aug 2018
Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification
Junyang Lin
Qi Su
Pengcheng Yang
Shuming Ma
Xu Sun
VLM
68
60
0
26 Aug 2018
Voice Conversion with Conditional SampleRNN
Cong Zhou
Michael Horgan
Vivek Kumar
Cristina Vasco
Dan Darcy
47
20
0
24 Aug 2018
Scalable Population Synthesis with Deep Generative Modeling
S. Borysov
Jeppe Rich
Francisco Câmara Pereira
41
58
0
21 Aug 2018
Machine Learning for Spatiotemporal Sequence Forecasting: A Survey
Xingjian Shi
Dit-Yan Yeung
AI4TS
70
88
0
21 Aug 2018
Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks
Sercan O. Arik
Heewoo Jun
G. Diamos
76
108
0
20 Aug 2018
Peptide-Spectra Matching from Weak Supervision
S. Schoenholz
Sean Hackett
Laura Deming
E. Melamud
Navdeep Jaitly
...
Jonathon J. O’Brien
George E. Dahl
Bryson D. Bennett
Andrew M. Dai
D. Koller
OOD
28
10
0
20 Aug 2018
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)
Y. Hsu
Yu-Chen Lin
Szu-Wei Fu
Yu Tsao
Tei-Wei Kuo
MQ
65
15
0
17 Aug 2018
LARNN: Linear Attention Recurrent Neural Network
Guillaume Chevalier
HAI
AIMat
28
11
0
16 Aug 2018
Investigation of Using Disentangled and Interpretable Representations for One-shot Cross-lingual Voice Conversion
Seyed Hamidreza Mohammadi
Taehwan Kim
DRL
80
15
0
15 Aug 2018
A Simple Convolutional Generative Network for Next Item Recommendation
Fajie Yuan
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
Xiangnan He
71
553
0
15 Aug 2018
GestureGAN for Hand Gesture-to-Gesture Translation in the Wild
Hao Tang
Wei Wang
Dan Xu
Yan Yan
N. Sebe
GAN
SLR
76
88
0
14 Aug 2018
Small Sample Learning in Big Data Era
Jun Shu
Zongben Xu
Deyu Meng
108
72
0
14 Aug 2018
iNNvestigate neural networks!
Maximilian Alber
Sebastian Lapuschkin
P. Seegerer
Miriam Hagele
Kristof T. Schütt
G. Montavon
Wojciech Samek
K. Müller
Sven Dähne
Pieter-Jan Kindermans
79
349
0
13 Aug 2018
Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction
Maha Elbayad
Laurent Besacier
Jakob Verbeek
HAI
94
82
0
11 Aug 2018
Neural Importance Sampling
Thomas Müller
Brian McWilliams
Fabrice Rousselle
Markus Gross
Jan Novák
100
365
0
11 Aug 2018
This Time with Feeling: Learning Expressive Musical Performance
Sageev Oore
Ian Simon
Sander Dieleman
Douglas Eck
Karen Simonyan
155
217
0
10 Aug 2018
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
Cheng-chieh Yeh
Po-Chun Hsu
Ju-Chieh Chou
Hung-yi Lee
Lin-Shan Lee
59
23
0
09 Aug 2018
Improved Deep Spectral Convolution Network For Hyperspectral Unmixing With Multinomial Mixture Kernel and Endmember Uncertainty
Savas Ozkan
G. Akar
UQCV
109
14
0
03 Aug 2018
Likelihood-free inference with an improved cross-entropy estimator
M. Stoye
Johann Brehmer
Gilles Louppe
J. Pavez
Kyle Cranmer
FedML
UQCV
BDL
163
48
0
02 Aug 2018
Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects
Hieu-Thi Luong
Xin Wang
Junichi Yamagishi
Nobuyuki Nishizawa
51
16
0
02 Aug 2018
Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
Yi Zhao
Shinji Takaki
Hieu-Thi Luong
Junichi Yamagishi
Daisuke Saito
Nobuaki Minematsu
71
64
0
31 Jul 2018
Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Hieu-Thi Luong
Junichi Yamagishi
89
7
0
31 Jul 2018
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis
G. Henter
Jaime Lorenzo-Trueba
Xin Wang
Junichi Yamagishi
DRL
SSL
88
61
0
30 Jul 2018
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
203
724
0
29 Jul 2018
Analysing Shortcomings of Statistical Parametric Speech Synthesis
G. Henter
Simon King
Thomas Merritt
G. Degottex
11
3
0
28 Jul 2018
End-to-end Deep Learning from Raw Sensor Data: Atrial Fibrillation Detection using Wearables
Igor Gotlibovych
Stuart Crawford
Dileep Goyal
Jiaqi Liu
Yaniv Kerem
D. Benaron
Defne Yilmaz
G. Marcus
Yihan Li
73
44
0
27 Jul 2018
Generating 3D faces using Convolutional Mesh Autoencoders
Anurag Ranjan
Timo Bolkart
Soubhik Sanyal
Michael J. Black
CVBM
3DH
106
575
0
26 Jul 2018
Noise Contrastive Priors for Functional Uncertainty
Danijar Hafner
Dustin Tran
Timothy Lillicrap
A. Irpan
James Davidson
AAML
BDL
UQCV
150
74
0
24 Jul 2018
GRAINS: Generative Recursive Autoencoders for INdoor Scenes
Manyi Li
A. Patil
Kai Xu
S. Chaudhuri
Owais Khan
Ariel Shamir
Changhe Tu
Baoquan Chen
Daniel Cohen-Or
Hao Zhang
3DV
VGen
88
143
0
24 Jul 2018
Recent Advances in Deep Learning: An Overview
Matiur Rahman Minar
Jibon Naher
VLM
104
117
0
21 Jul 2018
Deep Learning
Nicholas G. Polson
Vadim Sokolov
AI4CE
BDL
75
1
0
20 Jul 2018
ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
Ming-Yu Liu
Kainan Peng
Jitong Chen
102
347
0
19 Jul 2018
Previous
1
2
3
...
54
55
56
...
60
61
62
Next