Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03499
Cited By
v1
v2 (latest)
WaveNet: A Generative Model for Raw Audio
12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"WaveNet: A Generative Model for Raw Audio"
50 / 3,082 papers shown
Title
Neural Percussive Synthesis Parameterised by High-Level Timbral Features
António Ramires
Pritish Chandna
Xavier Favory
Emilia Gómez
Xavier Serra
81
23
0
25 Nov 2019
Natural Image Manipulation for Autoregressive Models Using Fisher Scores
Wilson Yan
Jonathan Ho
Pieter Abbeel
28
0
0
25 Nov 2019
Adversarial Learning of Privacy-Preserving and Task-Oriented Representations
Taihong Xiao
Yi-Hsuan Tsai
Kihyuk Sohn
Manmohan Chandraker
Ming-Hsuan Yang
76
75
0
22 Nov 2019
Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks
Yong Wang
Longyue Wang
Shuming Shi
Victor O.K. Li
Zhaopeng Tu
62
25
0
22 Nov 2019
Adversarial Robustness of Flow-Based Generative Models
Phillip E. Pope
Yogesh Balaji
Soheil Feizi
AAML
48
20
0
20 Nov 2019
Deep-Learning Estimation of Band Gap with the Reading-Periodic-Table Method and Periodic Convolution Layer
Tomohiko Konno
71
1
0
16 Nov 2019
Granular Motor State Monitoring of Free Living Parkinson's Disease Patients via Deep Learning
K. Yuksel
Jann Goschenhofer
H. V. Varma
U. Fietzek
Franz MJ Pfister
OOD
30
0
0
15 Nov 2019
Deep Long Audio Inpainting
Ya-Liang Chang
Kuan-Ying Lee
Po-Yu Wu
Hung-yi Lee
Winston H. Hsu
68
33
0
15 Nov 2019
Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling
Ruizhe Zhao
Brian K. Vogel
Tanvir Ahmed
Wayne Luk
61
37
0
14 Nov 2019
Compressive Transformers for Long-Range Sequence Modelling
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALM
VLM
KELM
105
656
0
13 Nov 2019
Rate-Regularization and Generalization in VAEs
Alican Bozkurt
Babak Esmaeili
Jean-Baptiste Tristan
Dana H. Brooks
Jennifer G. Dy
Jan-Willem van de Meent
DRL
92
8
0
11 Nov 2019
GMAN: A Graph Multi-Attention Network for Traffic Prediction
Chuanpan Zheng
Xiaoliang Fan
Cheng-Yu Wang
Jianzhong Qi
AI4TS
AI4CE
154
1,406
0
11 Nov 2019
Generative Autoregressive Networks for 3D Dancing Move Synthesis from Music
Hyemin Ahn
Jaehun Kim
Kihyun Kim
Songhwai Oh
GAN
81
44
0
11 Nov 2019
Feedback Recurrent AutoEncoder
Yang Yang
Guillaume Sautière
J. Jon Ryu
Taco S. Cohen
106
21
0
11 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
122
338
0
10 Nov 2019
Transformation of low-quality device-recorded speech to high-quality speech using improved SEGAN model
Seyyed Saeed Sarfjoo
Xin Wang
G. Henter
Jaime Lorenzo-Trueba
Shinji Takaki
Junichi Yamagishi
45
8
0
10 Nov 2019
Characterizing dynamically varying acoustic scenes from egocentric audio recordings in workplace setting
Arindam Jati
Amrutha Nadarajan
Karel Mundnich
Shrikanth Narayanan
34
2
0
10 Nov 2019
XceptionTime: A Novel Deep Architecture based on Depthwise Separable Convolutions for Hand Gesture Classification
E. Rahimian
Soheil Zabihi
S. F. Atashzar
A. Asif
Arash Mohammadi
71
45
0
09 Nov 2019
On the Relationship between Self-Attention and Convolutional Layers
Jean-Baptiste Cordonnier
Andreas Loukas
Martin Jaggi
179
535
0
08 Nov 2019
Teacher-Student Training for Robust Tacotron-based TTS
Rui Liu
Berrak Sisman
Jingdong Li
F. Bao
Guanglai Gao
Haizhou Li
109
38
0
07 Nov 2019
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Mingbo Ma
Baigong Zheng
Kaibo Liu
Renjie Zheng
Hairong Liu
Kainan Peng
Kenneth Church
Liang Huang
66
31
0
07 Nov 2019
Deep Hedging: Learning to Simulate Equity Option Markets
Magnus Wiese
Lianjun Bai
Ben Wood
Hans Buehler
GAN
90
69
0
05 Nov 2019
Emotional speech synthesis with rich and granularized control
Seyun Um
Sangshin Oh
Kyungguen Byun
Inseon Jang
C. Ahn
Hong-Goo Kang
85
90
0
05 Nov 2019
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
Xin Wang
Junichi Yamagishi
Massimiliano Todisco
Héctor Delgado
A. Nautsch
...
J. Bonastre
Avashna Govender
S. Ronanki
Jing-Xuan Zhang
Zhenhua Ling
99
12
0
05 Nov 2019
The frontier of simulation-based inference
Kyle Cranmer
Johann Brehmer
Gilles Louppe
AI4CE
277
859
0
04 Nov 2019
Deep-Gap: A deep learning framework for forecasting crowdsourcing supply-demand gap based on imaging time series and residual learning
Ahmed Ben Said
A. Erradi
AI4TS
40
7
0
02 Nov 2019
Deep convolutional neural networks for multi-scale time-series classification and application to disruption prediction in fusion devices
R. Churchill
the DIII-D team
AI4CE
36
10
0
31 Oct 2019
Neural Density Estimation and Likelihood-free Inference
George Papamakarios
BDL
DRL
100
47
0
29 Oct 2019
Disentangling Timbre and Singing Style with Multi-singer Singing Synthesis System
Juheon Lee
Hyeong-Seok Choi
Junghyun Koo
Kyogu Lee
45
18
0
29 Oct 2019
Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis
Mingrui Yuan
Z. Duan
23
1
0
29 Oct 2019
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
28
2
0
28 Oct 2019
Transferring neural speech waveform synthesizers to musical instrument sounds generation
Yi Zhao
Xin Wang
Lauri Juvela
Junichi Yamagishi
92
17
0
27 Oct 2019
Implicit Posterior Variational Inference for Deep Gaussian Processes
Haibin Yu
Yizhou Chen
Zhongxiang Dai
K. H. Low
Patrick Jaillet
88
43
0
26 Oct 2019
Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency
M. Whitehill
Shuang Ma
Daniel J. McDuff
Yale Song
111
35
0
25 Oct 2019
Study of Deep Generative Models for Inorganic Chemical Compositions
Yoshihide Sawada
Koji Morikawa
Mikiya Fujii
GAN
69
13
0
25 Oct 2019
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
195
821
0
25 Oct 2019
Hierarchical Representation Learning in Graph Neural Networks with Node Decimation Pooling
F. Bianchi
Daniele Grattarola
L. Livi
Cesare Alippi
199
49
0
24 Oct 2019
Towards Fine-Grained Prosody Control for Voice Conversion
Zheng Lian
Zhengqi Wen
70
19
0
24 Oct 2019
Vision-Infused Deep Audio Inpainting
Hang Zhou
Ziwei Liu
Lingfeng Guo
Ping Luo
Dahua Lin
142
88
0
24 Oct 2019
Fast and High-Quality Singing Voice Synthesis System based on Convolutional Neural Networks
Kazuhiro Nakamura
Shinji Takaki
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
84
19
0
24 Oct 2019
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Tomoki Hayashi
Ryuichi Yamamoto
Katsuki Inoue
Takenori Yoshimura
Shinji Watanabe
Tomoki Toda
K. Takeda
Yu Zhang
Xu Tan
VLM
93
205
0
24 Oct 2019
Expression Analysis Based on Face Regions in Read-world Conditions
Zheng Lian
Ya Li
J. Tao
Jian Huang
Mingyue Niu
CVBM
47
58
0
23 Oct 2019
Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales
Sanjay Thakur
H. V. Hoof
Gunshi Gupta
David Meger
BDL
42
2
0
23 Oct 2019
Detecting Out-of-Distribution Inputs in Deep Neural Networks Using an Early-Layer Output
Vahdat Abdelzad
Krzysztof Czarnecki
Rick Salay
Taylor Denouden
Sachin Vernekar
Buu Phan
OODD
64
47
0
23 Oct 2019
Complex Transformer: A Framework for Modeling Complex-Valued Sequence
Muqiao Yang
Martin Q. Ma
Dongyu Li
Yao-Hung Hubert Tsai
Ruslan Salakhutdinov
ViT
53
38
0
22 Oct 2019
GANspection
Hammad A. Ayyubi
GAN
18
0
0
21 Oct 2019
You May Not Need Order in Time Series Forecasting
Yunkai Zhang
Qiao Jiang
Shurui Li
Xiaoyong Jin
Xueying Ma
Xifeng Yan
AI4TS
28
3
0
21 Oct 2019
XL-Editor: Post-editing Sentences with XLNet
Yong-Siang Shih
Wei-Cheng Chang
Yiming Yang
KELM
79
11
0
19 Oct 2019
Label-efficient audio classification through multitask learning and self-supervision
Tyler Lee
Ting Gong
Suchismita Padhy
Andrew Rouditchenko
A. Ndirango
SSL
VLM
62
7
0
19 Oct 2019
Decoupling feature propagation from the design of graph auto-encoders
P. Scherer
Helena Andrés-Terré
Pietro Lio
M. Jamnik
BDL
16
1
0
18 Oct 2019
Previous
1
2
3
...
44
45
46
...
60
61
62
Next