Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences
Pranay Manocha
Adam Finkelstein
Richard Y. Zhang
Nicholas J. Bryan
G. J. Mysore
Zeyu Jin
89
69
0
13 Jan 2020
Advbox: a toolbox to generate adversarial examples that fool neural networks
Dou Goodman
Xin Hao
Yang Wang
Yuesheng Wu
Junfeng Xiong
Huan Zhang
AAML
138
55
0
13 Jan 2020
Deep Learning based Pedestrian Inertial Navigation: Methods, Dataset and On-Device Inference
Changhao Chen
Peijun Zhao
Chris Xiaoxuan Lu
Wei Wang
Andrew Markham
A. Trigoni
71
114
0
13 Jan 2020
Temporally Folded Convolutional Neural Networks for Sequence Forecasting
Matthias Weissenbacher
AI4TS
41
0
0
10 Jan 2020
Mel-spectrogram augmentation for sequence to sequence voice conversion
Yeongtae Hwang
Hyemin Cho
Hongsun Yang
Dong-Ok Won
Insoo Oh
Seong-Whan Lee
65
15
0
06 Jan 2020
Temporal Tensor Transformation Network for Multivariate Time Series Prediction
Yuya Jeremy Ong
Mu Qiao
D. Jadav
AI4TS
20
4
0
04 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
96
82
0
02 Jan 2020
Prediction of wall-bounded turbulence from wall quantities using convolutional neural networks
L. Guastoni
M. P. Encinar
P. Schlatter
Hossein Azizpour
R. Vinuesa
MDE
37
33
0
30 Dec 2019
nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks
K. Cheuk
Hans Anderson
Kat R. Agres
Dorien Herremans
248
5
0
27 Dec 2019
Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network
Yusong Wu
Shengchen Li
Chengzhu Yu
Heng Lu
Chao Weng
Liqiang Zhang
Dong Yu
59
5
0
27 Dec 2019
Score and Lyrics-Free Singing Voice Generation
Jen-Yu Liu
Yu-Hua Chen
Yin-Cheng Yeh
Yi-Hsuan Yang
70
22
0
26 Dec 2019
Deep Learning-based Vehicle Behaviour Prediction For Autonomous Driving Applications: A Review
Sajjad Mozaffari
Omar Y. Al-Jarrah
M. Dianati
P. Jennings
A. Mouzakitis
93
135
0
25 Dec 2019
Multi-level Convolutional Autoencoder Networks for Parametric Prediction of Spatio-temporal Dynamics
Jiayang Xu
Karthik Duraisamy
AI4CE
95
143
0
23 Dec 2019
Probing the phonetic and phonological knowledge of tones in Mandarin TTS models
Jian Zhu
62
8
0
23 Dec 2019
Learning Singing From Speech
Liqiang Zhang
Chengzhu Yu
Heng Lu
Chao Weng
Yusong Wu
Xiang Xie
Zijin Li
Dong Yu
53
8
0
20 Dec 2019
HiLLoC: Lossless Image Compression with Hierarchical Latent Variable Models
James Townsend
Thomas Bird
Julius Kunze
David Barber
BDL
VLM
151
56
0
20 Dec 2019
Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatial-Temporal Graph Convolutional Network for Action Recognition
Konstantinos Papadopoulos
Enjie Ghorbel
Djamila Aouada
Björn E. Ottersten
GNN
162
42
0
20 Dec 2019
Smart Home Appliances: Chat with Your Fridge
Denis A. Gudovskiy
Gyuri Han
Takuya Yamaguchi
Sotaro Tsukizawa
LRM
26
4
0
19 Dec 2019
Learning a Spatio-Temporal Embedding for Video Instance Segmentation
Anthony Hu
Alex Kendall
R. Cipolla
123
19
0
19 Dec 2019
Water Supply Prediction Based on Initialized Attention Residual Network
Yu Long
Jingcheng Wang
Jingyi Wang
28
1
0
17 Dec 2019
A Transferable Adaptive Domain Adversarial Neural Network for Virtual Reality Augmented EMG-Based Gesture Recognition
Ulysse Côté-Allard
Gabriel Gagnon-Turcotte
A. Phinyomark
K. Glette
Erik J. Scheme
François Laviolette
Benoit Gosselin
41
7
0
16 Dec 2019
Efficient Convolutional Neural Networks for Diacritic Restoration
Sawsan Alqahtani
Ajay K. Mishra
Mona T. Diab
46
24
0
14 Dec 2019
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
80
99
0
14 Dec 2019
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong
S. Schwarcz
Peng Xu
Davide D‘Ambrosio
Juhana Kangaspunta
A. Angelova
Huong Phan
Navdeep Jaitly
65
7
0
13 Dec 2019
Human Motion Anticipation with Symbolic Label
Julian Tanke
A. Weber
Juergen Gall
76
6
0
12 Dec 2019
Singing Synthesis: with a little help from my attention
Orazio Angelini
Alexis Moinet
K. Yanagisawa
Thomas Drugman
61
17
0
12 Dec 2019
Learning to Model Aspects of Hearing Perception Using Neural Loss Functions
Prateek Verma
J. Berger
AAML
31
3
0
11 Dec 2019
Incrementally Improving Graph WaveNet Performance on Traffic Prediction
Sam Shleifer
Clara H. McCreery
Vamsi Chitters
GNN
AI4TS
74
21
0
11 Dec 2019
Neural Voice Puppetry: Audio-driven Facial Reenactment
Justus Thies
Mohamed A. Elgharib
A. Tewari
Christian Theobalt
Matthias Nießner
VGen
3DH
88
375
0
11 Dec 2019
Encoding Musical Style with Transformer Autoencoders
Kristy Choi
Curtis Hawthorne
Ian Simon
Monica Dinculescu
Jesse Engel
95
90
0
10 Dec 2019
Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
Brenden K. Petersen
Mikel Landajuela
T. Nathan Mundhenk
Claudio Santiago
Soo K. Kim
Joanne T. Kim
68
320
0
10 Dec 2019
MITAS: A Compressed Time-Domain Audio Separation Network with Parameter Sharing
Chao-I Tuan
Yuan-Kuei Wu
Hung-yi Lee
Yu Tsao
30
2
0
09 Dec 2019
Connecting Vision and Language with Localized Narratives
Jordi Pont-Tuset
J. Uijlings
Soravit Changpinyo
Radu Soricut
V. Ferrari
ObjD
143
252
0
06 Dec 2019
Normalizing Flows for Probabilistic Modeling and Inference
George Papamakarios
Eric T. Nalisnick
Danilo Jimenez Rezende
S. Mohamed
Balaji Lakshminarayanan
TPM
AI4CE
219
1,725
0
05 Dec 2019
Towards Robust Neural Vocoding for Speech Generation: A Survey
Po-Chun Hsu
Chun-hsuan Wang
Andy T. Liu
Hung-yi Lee
OOD
78
25
0
05 Dec 2019
PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network
Chen Deng
Chengzhu Yu
Heng Lu
Chao Weng
Dong Yu
SSL
71
40
0
04 Dec 2019
WaveFlow: A Compact Flow-based Model for Raw Audio
Ming-Yu Liu
Kainan Peng
Kexin Zhao
Z. Song
104
117
0
03 Dec 2019
High-quality Speech Synthesis Using Super-resolution Mel-Spectrogram
Leyuan Sheng
Dong-Yan Huang
Evgeny Nikolaevich Pavlovskiy
82
15
0
03 Dec 2019
Bayesian-Deep-Learning Estimation of Earthquake Location from Single-Station Observations
S. Mousavi
G. Beroza
BDL
26
104
0
03 Dec 2019
Long Distance Relationships without Time Travel: Boosting the Performance of a Sparse Predictive Autoencoder in Sequence Modeling
J. Gordon
D. Rawlinson
Subutai Ahmad
61
5
0
02 Dec 2019
TeaNet: universal neural network interatomic potential inspired by iterative electronic relaxations
So Takamoto
S. Izumi
Ju Li
GNN
65
80
0
02 Dec 2019
Flow Contrastive Estimation of Energy-Based Models
Ruiqi Gao
Erik Nijkamp
Diederik P. Kingma
Zhen Xu
Andrew M. Dai
Ying Nian Wu
GAN
101
115
0
02 Dec 2019
STConvS2S: Spatiotemporal Convolutional Sequence to Sequence Network for Weather Forecasting
Rafaela C. Nascimento
Y. M. Souto
Eduardo S. Ogasawara
Fábio Porto
Eduardo Bezerra
AI4TS
98
89
0
30 Nov 2019
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech
Vatsal Aggarwal
Marius Cotescu
N. Prateek
Jaime Lorenzo-Trueba
Roberto Barra-Chicote
98
31
0
28 Nov 2019
Designing the Next Generation of Intelligent Personal Robotic Assistants for the Physically Impaired
Basit Ayantunde
Jane Odum
Fadlullah Olawumi
Joshua Olalekan
23
1
0
28 Nov 2019
AR-Net: A simple Auto-Regressive Neural Network for time-series
Oskar Triebe
N. Laptev
Ram Rajagopal
AI4TS
AI4CE
102
59
0
27 Nov 2019
ConCare: Personalized Clinical Feature Embedding via Capturing the Healthcare Context
Liantao Ma
Chaohe Zhang
Yasha Wang
Wenjie Ruan
Jiantao Wang
Wen Tang
Xinyu Ma
Xin Gao
Junyi Gao
66
159
0
27 Nov 2019
AdaCare: Explainable Clinical Health Status Representation Learning via Scale-Adaptive Feature Extraction and Recalibration
Liantao Ma
Junyi Gao
Yasha Wang
Chaohe Zhang
Jiangtao Wang
Wenjie Ruan
Wen Tang
Xin Gao
Xinyu Ma
OOD
AI4TS
79
121
0
27 Nov 2019
Jejueo Datasets for Machine Translation and Speech Synthesis
Kyubyong Park
Yo Joong Choe
Jiyeon Ham
26
5
0
27 Nov 2019
SchrödingeRNN: Generative Modeling of Raw Audio as a Continuously Observed Quantum State
Beñat Mencia Uranga
A. Lamacraft
96
3
0
26 Nov 2019
Previous
1
2
3
...
43
44
45
...
60
61
62
Next