Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Recurrent Neural Networks for Time Series Forecasting: Current Status and Future Directions
Hansika Hewamalage
Christoph Bergmeir
Kasun Bandara
AI4TS
124
910
0
02 Sep 2019
Reusing Convolutional Activations from Frame to Frame to Speed up Training and Inference
Arno Khachatourian
28
0
0
02 Sep 2019
Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset
Bill Byrne
Karthikeyan K
Chinnadhurai Sankar
Arvind Neelakantan
Daniel Duckworth
Semih Yavuz
Ben Goodrich
Amit Dubey
A. Cedilnik
Kyu-Young Kim
67
219
0
01 Sep 2019
READ: Recursive Autoencoders for Document Layout Generation
A. Patil
Omri Ben-Eliezer
Or Perel
Hadar Averbuch-Elor
3DV
SyDa
78
68
0
01 Sep 2019
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
58
8
0
30 Aug 2019
Maximizing Mutual Information for Tacotron
Peng Liu
Xixin Wu
Shiyin Kang
Guangzhi Li
Jane Polak Scowcroft
Dong Yu
86
16
0
30 Aug 2019
Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning
Linchao Zhu
Sercan O. Arik
Yezhou Yang
Tomas Pfister
76
5
0
29 Aug 2019
Convolutional Recurrent Neural Network Based Progressive Learning for Monaural Speech Enhancement
Andong Li
Minmin Yuan
C. Zheng
Xiaodong Li
53
8
0
28 Aug 2019
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis
Xin Wang
Junichi Yamagishi
66
32
0
27 Aug 2019
Multi-Task Gaussian Processes and Dilated Convolutional Networks for Reconstruction of Reproductive Hormonal Dynamics
Iñigo Urteaga
Tristan Bertin
Theresa M. Hardy
D. Albers
Noémie Elhadad
AI4TS
AI4CE
20
5
0
27 Aug 2019
Overview of Tasks and Investigation of Subjective Evaluation Methods in Environmental Sound Synthesis and Conversion
Yuki Okamoto
Keisuke Imoto
Tatsuya Komatsu
Shinnosuke Takamichi
Takumi Yagyu
Ryosuke Yamanishi
Y. Yamashita
49
5
0
27 Aug 2019
PixelVAE++: Improved PixelVAE with Discrete Prior
Hossein Sadeghi
Evgeny Andriyash
W. Vinci
L. Buffoni
Mohammad H. Amin
BDL
DRL
53
33
0
26 Aug 2019
Spiking Neural Predictive Coding for Continual Learning from Data Streams
Alexander Ororbia
89
28
0
23 Aug 2019
RNNs Evolving on an Equilibrium Manifold: A Panacea for Vanishing and Exploding Gradients?
Anil Kag
Ziming Zhang
Venkatesh Saligrama
44
8
0
22 Aug 2019
Trajectory Space Factorization for Deep Video-Based 3D Human Pose Estimation
Jiahao Lin
Gim Hee Lee
62
75
0
22 Aug 2019
Quantile Convolutional Neural Networks for Value at Risk Forecasting
Gábor Petneházi
AI4TS
19
2
0
21 Aug 2019
Developing Creative AI to Generate Sculptural Objects
Songwei Ge
Austin Dill
Eunsu Kang
Chun-Liang Li
Lingyao Zhang
Manzil Zaheer
Barnabás Póczós
3DPC
40
8
0
20 Aug 2019
TabNet: Attentive Interpretable Tabular Learning
Sercan O. Arik
Tomas Pfister
LMTD
230
1,386
0
20 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
89
25
0
19 Aug 2019
Salient Speech Representations Based on Cloned Networks
W. Kleijn
Felicia S. C. Lim
Michael Chinen
Jan Skoglund
27
3
0
19 Aug 2019
A Dual-Staged Context Aggregation Method Towards Efficient End-To-End Speech Enhancement
Kai Zhen
Mi Suk Lee
Minje Kim
44
3
0
18 Aug 2019
JVS corpus: free Japanese multi-speaker voice corpus
Shinnosuke Takamichi
Kentaro Mitsui
Yuki Saito
Tomoki Koriyama
Naoko Tanji
Hiroshi Saruwatari
69
72
0
17 Aug 2019
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DV
VLM
AI4TS
100
212
0
16 Aug 2019
Unconstrained Monotonic Neural Networks
Antoine Wehenkel
Gilles Louppe
TPM
269
150
0
14 Aug 2019
Predicting 3D Human Dynamics from Video
Jason Y. Zhang
Panna Felsen
Angjoo Kanazawa
Jitendra Malik
3DH
47
110
0
13 Aug 2019
Mix & Match: training convnets with mixed image sizes for improved accuracy, speed and scale resiliency
Elad Hoffer
Berry Weinstein
Itay Hubara
Tal Ben-Nun
Torsten Hoefler
Daniel Soudry
113
20
0
12 Aug 2019
Universal Adversarial Audio Perturbations
Sajjad Abdoli
L. G. Hafemann
Jérôme Rony
Ismail Ben Ayed
P. Cardinal
Alessandro Lameiras Koerich
AAML
95
52
0
08 Aug 2019
Continuous Graph Flow
Zhiwei Deng
Megha Nawhal
Lili Meng
Greg Mori
56
3
0
07 Aug 2019
Adversarially Trained End-to-end Korean Singing Voice Synthesis System
Juheon Lee
Hyeong-Seok Choi
Chang-Bin Jeon
Junghyun Koo
Kyogu Lee
95
78
0
06 Aug 2019
Likelihood Contribution based Multi-scale Architecture for Generative Flows
Hari Prasanna Das
Pieter Abbeel
C. Spanos
DRL
AI4CE
59
5
0
05 Aug 2019
Acoustic Sounds for Wellbeing: A Novel Dataset and Baseline Results
Alice Baird
Bjoern Schuller
10
0
0
05 Aug 2019
V2S attack: building DNN-based voice conversion from automatic speaker verification
Taiki Nakamura
Yuki Saito
Shinnosuke Takamichi
Yusuke Ijima
Hiroshi Saruwatari
46
7
0
05 Aug 2019
Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation
Hao Tang
Dan Xu
Gaowen Liu
Wei Wang
N. Sebe
Yan Yan
GAN
115
82
0
02 Aug 2019
Sound source detection, localization and classification using consecutive ensemble of CRNN models
Slawomir Kapka
M. Lewandowski
122
66
0
02 Aug 2019
Marine Mammal Species Classification using Convolutional Neural Networks and a Novel Acoustic Representation
Mark R. P. Thomas
B. Martin
Katie A. Kowarski
B. Gaudet
Stan Matwin
54
48
0
30 Jul 2019
Multi-Frame Cross-Entropy Training for Convolutional Neural Networks in Speech Recognition
Tom Sercu
Neil Rohit Mallinar
17
0
0
29 Jul 2019
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
100
144
0
29 Jul 2019
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks
Michelle A. Lee
Yuke Zhu
Peter Zachares
Matthew Tan
K. Srinivasan
Silvio Savarese
Fei-Fei Li
Animesh Garg
Jeannette Bohg
SSL
85
213
0
28 Jul 2019
Deep Generative Quantile-Copula Models for Probabilistic Forecasting
Ruofeng Wen
Kari Torkkola
AI4TS
68
32
0
24 Jul 2019
MadMiner: Machine learning-based inference for particle physics
Johann Brehmer
F. Kling
Irina Espejo
Kyle Cranmer
81
115
0
24 Jul 2019
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder
Patrick Lumban Tobing
Yi-Chiao Wu
Tomoki Hayashi
Kazuhiro Kobayashi
Tomoki Toda
74
68
0
24 Jul 2019
Temporally Consistent Horizon Lines
Florian Kluger
H. Ackermann
M. Yang
Bodo Rosenhahn
AI4TS
188
16
0
23 Jul 2019
Deep Learning for Time Series Forecasting: The Electric Load Case
Alberto Gasparin
S. Lukovic
Cesare Alippi
AI4TS
80
231
0
22 Jul 2019
Statistical Voice Conversion with Quasi-Periodic WaveNet Vocoder
Yi-Chiao Wu
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
Tomoki Toda
140
2
0
21 Jul 2019
DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis
Yuki Saito
Shinnosuke Takamichi
Hiroshi Saruwatari
40
10
0
19 Jul 2019
Forward-Backward Decoding for Regularizing End-to-End TTS
Yibin Zheng
Xi Wang
Lei He
Shifeng Pan
Frank Soong
Zhengqi Wen
J. Tao
51
13
0
18 Jul 2019
Towards Understanding Generalization in Gradient-Based Meta-Learning
Simon Guiroy
Vikas Verma
C. Pal
73
21
0
16 Jul 2019
Quant GANs: Deep Generation of Financial Time Series
Magnus Wiese
R. Knobloch
R. Korn
Peter Kretschmer
GAN
AI4TS
AIFin
105
281
0
15 Jul 2019
Hierarchical Sequence to Sequence Voice Conversion with Limited Data
P. Narayanan
Punarjay Chakravarty
F. Charette
G. Puskorius
53
3
0
15 Jul 2019
The Bach Doodle: Approachable music composition with machine learning at scale
Cheng-Zhi Anna Huang
Curtis Hawthorne
Adam Roberts
Monica Dinculescu
James Wexler
Leon Hong
Jacob Howcroft
166
27
0
14 Jul 2019
Previous
1
2
3
...
46
47
48
...
60
61
62
Next