ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
PixelSNAIL: An Improved Autoregressive Generative Model
PixelSNAIL: An Improved Autoregressive Generative Model
Xi Chen
Nikhil Mishra
Mostafa Rohaninejad
Pieter Abbeel
DRLDiffMBDLGAN
80
276
0
28 Dec 2017
CNN Is All You Need
CNN Is All You Need
Qiming Chen
R. Wu
AIMat
17
17
0
27 Dec 2017
Towards Structured Analysis of Broadcast Badminton Videos
Towards Structured Analysis of Broadcast Badminton Videos
Anurag Ghosh
Suriya Singh
C. V. Jawahar
48
46
0
23 Dec 2017
On Using Backpropagation for Speech Texture Generation and Voice
  Conversion
On Using Backpropagation for Speech Texture Generation and Voice Conversion
J. Chorowski
Ron J. Weiss
Rif A. Saurous
Samy Bengio
43
19
0
22 Dec 2017
Adversarial Examples: Attacks and Defenses for Deep Learning
Adversarial Examples: Attacks and Defenses for Deep Learning
Xiaoyong Yuan
Pan He
Qile Zhu
Xiaolin Li
SILMAAML
156
1,629
0
19 Dec 2017
Dynamic Weight Alignment for Temporal Convolutional Neural Networks
Dynamic Weight Alignment for Temporal Convolutional Neural Networks
Brian Kenji Iwana
S. Uchida
AI4TS
59
8
0
18 Dec 2017
Generating and designing DNA with deep generative models
Generating and designing DNA with deep generative models
N. Killoran
Leo J. Lee
Andrew Delong
David Duvenaud
B. Frey
AI4CE
63
147
0
17 Dec 2017
Deep Learning for Distant Speech Recognition
Deep Learning for Distant Speech Recognition
Mirco Ravanelli
79
16
0
17 Dec 2017
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram
  Predictions
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Jonathan Shen
Ruoming Pang
Ron J. Weiss
M. Schuster
Navdeep Jaitly
...
Yuxuan Wang
RJ Skerry-Ryan
Rif A. Saurous
Yannis Agiomyrgiannakis
Yonghui Wu
96
2,707
0
16 Dec 2017
A Hierarchical Recurrent Neural Network for Symbolic Melody Generation
A Hierarchical Recurrent Neural Network for Symbolic Melody Generation
Jian Wu
Changran Hu
Yulong Wang
Xiaolin Hu
Jun Zhu
75
84
0
14 Dec 2017
DLR : Toward a deep learned rhythmic representation for music content
  analysis
DLR : Toward a deep learned rhythmic representation for music content analysis
Yeon-ju Jeong
Keunwoo Choi
Hosan Jeong
114
4
0
14 Dec 2017
Motion Switching with Sensory and Instruction Signals by designing
  Dynamical Systems using Deep Neural Network
Motion Switching with Sensory and Instruction Signals by designing Dynamical Systems using Deep Neural Network
Kanata Suzuki
Hiroki Mori
T. Ogata
52
20
0
14 Dec 2017
Over the Air Deep Learning Based Radio Signal Classification
Over the Air Deep Learning Based Radio Signal Classification
Tim O'Shea
Tamoghna Roy
T. Clancy
78
1,096
0
13 Dec 2017
Music Generation by Deep Learning - Challenges and Directions
Music Generation by Deep Learning - Challenges and Directions
Jean-Pierre Briot
F. Pachet
MGen
99
131
0
09 Dec 2017
Learning to Fuse Music Genres with Generative Adversarial Dual Learning
Learning to Fuse Music Genres with Generative Adversarial Dual Learning
Zhiqian Chen
Chih-Wei Wu
Yen-Cheng Lu
Alexander Lerch
Chang-Tien Lu
GAN
55
8
0
05 Dec 2017
Visual to Sound: Generating Natural Sound for Videos in the Wild
Visual to Sound: Generating Natural Sound for Videos in the Wild
Yipin Zhou
Zhaowen Wang
Chen Fang
Trung Bui
Tamara L. Berg
VGen
98
209
0
04 Dec 2017
Learning Fast and Slow: PROPEDEUTICA for Real-time Malware Detection
Learning Fast and Slow: PROPEDEUTICA for Real-time Malware Detection
Ruimin Sun
Xiaoyong Yuan
Pan He
Qile Zhu
Aokun Chen
André Grégio
Daniela Oliveira
Xiaolin Li
AAML
42
11
0
04 Dec 2017
Spatial PixelCNN: Generating Images from Patches
Spatial PixelCNN: Generating Images from Patches
Nader Akoury
Anh Totti Nguyen
62
4
0
03 Dec 2017
DeepWear: Adaptive Local Offloading for On-Wearable Deep Learning
DeepWear: Adaptive Local Offloading for On-Wearable Deep Learning
Mengwei Xu
Feng Qian
Mengze Zhu
Feifan Huang
Saumay Pushp
Xuanzhe Liu
71
23
0
01 Dec 2017
Utilizing Domain Knowledge in End-to-End Audio Processing
Utilizing Domain Knowledge in End-to-End Audio Processing
T. M. S. Tax
J. Antich
Hendrik Purwins
Lars Maaløe
19
8
0
01 Dec 2017
Wavenet based low rate speech coding
Wavenet based low rate speech coding
W. Kleijn
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Florian Stimberg
Quan Wang
Thomas C. Walters
61
143
0
01 Dec 2017
Time Domain Neural Audio Style Transfer
Time Domain Neural Audio Style Transfer
P. Mital
60
12
0
29 Nov 2017
A Multi-Horizon Quantile Recurrent Forecaster
A Multi-Horizon Quantile Recurrent Forecaster
Ruofeng Wen
Kari Torkkola
Balakrishnan Narayanaswamy
Dhruv Madeka
BDLAI4TS
65
433
0
29 Nov 2017
TensorFlow Distributions
TensorFlow Distributions
Joshua V. Dillon
I. Langmore
Dustin Tran
E. Brevdo
Srinivas Vasudevan
David A. Moore
Brian Patton
Alexander A. Alemi
Matt Hoffman
Rif A. Saurous
GP
123
352
0
28 Nov 2017
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Aaron van den Oord
Yazhe Li
Igor Babuschkin
Karen Simonyan
Oriol Vinyals
...
Alex Graves
Helen King
T. Walters
Dan Belov
Demis Hassabis
233
859
0
28 Nov 2017
Quantifying the Effects of Enforcing Disentanglement on Variational
  Autoencoders
Quantifying the Effects of Enforcing Disentanglement on Variational Autoencoders
Momchil Peychev
Petar Velickovic
Pietro Lio
CoGeDRL
86
0
0
24 Nov 2017
Invariance of Weight Distributions in Rectified MLPs
Invariance of Weight Distributions in Rectified MLPs
Russell Tsuchida
Farbod Roosta-Khorasani
M. Gallagher
MLT
120
36
0
24 Nov 2017
Non-local Neural Networks
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
366
8,948
0
21 Nov 2017
JamBot: Music Theory Aware Chord Based Generation of Polyphonic Music
  with LSTMs
JamBot: Music Theory Aware Chord Based Generation of Polyphonic Music with LSTMs
Gino Brunner
Yuyi Wang
Roger Wattenhofer
Jonas Wiesendanger
MGen
66
49
0
21 Nov 2017
Speech Dereverberation with Context-aware Recurrent Neural Networks
Speech Dereverberation with Context-aware Recurrent Neural Networks
J. F. Santos
T. Falk
47
36
0
16 Nov 2017
How Generative Adversarial Networks and Their Variants Work: An Overview
How Generative Adversarial Networks and Their Variants Work: An Overview
Yongjun Hong
Uiwon Hwang
Jaeyoon Yoo
Sungroh Yoon
GAN
131
159
0
16 Nov 2017
Emotional End-to-End Neural Speech Synthesizer
Emotional End-to-End Neural Speech Synthesizer
Younggun Lee
Azam Rabiee
Soo-Young Lee
93
106
0
15 Nov 2017
Non-Autoregressive Neural Machine Translation
Non-Autoregressive Neural Machine Translation
Jiatao Gu
James Bradbury
Caiming Xiong
Victor O.K. Li
R. Socher
107
798
0
07 Nov 2017
Convolutional Normalizing Flows
Convolutional Normalizing Flows
Guoqing Zheng
Yiming Yang
J. Carbonell
BDL
72
11
0
07 Nov 2017
Wider and Deeper, Cheaper and Faster: Tensorized LSTMs for Sequence
  Learning
Wider and Deeper, Cheaper and Faster: Tensorized LSTMs for Sequence Learning
Zhen He
Shaobing Gao
Liang Xiao
Daxue Liu
Hangen He
David Barber
AIMat
92
65
0
05 Nov 2017
Learning Filterbanks from Raw Speech for Phone Recognition
Learning Filterbanks from Raw Speech for Phone Recognition
Neil Zeghidour
Nicolas Usunier
Iasonas Kokkinos
Thomas Schatz
Gabriel Synnaeve
Emmanuel Dupoux
76
120
0
03 Nov 2017
Neural Discrete Representation Learning
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDLSSLOCL
259
5,093
0
02 Nov 2017
Hi, how can I help you?: Automating enterprise IT support help desks
Hi, how can I help you?: Automating enterprise IT support help desks
Senthil Mani
Neelamadhav Gantayat
Rahul Aralikatte
Monika Gupta
Sampath Dechu
A. Sankaran
Shreya Khare
Barry Mitchell
H. Subramanian
Hema Venkatarangan
47
13
0
02 Nov 2017
TasNet: time-domain audio separation network for real-time,
  single-channel speech separation
TasNet: time-domain audio separation network for real-time, single-channel speech separation
Yi Luo
N. Mesgarani
111
633
0
01 Nov 2017
Uncovering Latent Style Factors for Expressive Speech Synthesis
Uncovering Latent Style Factors for Expressive Speech Synthesis
Yuxuan Wang
RJ Skerry-Ryan
Y. Xiao
Daisy Stanton
Joel Shor
Eric Battenberg
R. Clark
Rif A. Saurous
63
53
0
01 Nov 2017
Melody Generation for Pop Music via Word Representation of Musical
  Properties
Melody Generation for Pop Music via Word Representation of Musical Properties
Andrew Shin
Léopold Crestel
Hiroharu Kato
Kuniaki Saito
Katsunori Ohnishi
Masataka Yamaguchi
Masahiro Nakawaki
Yoshitaka Ushiku
Tatsuya Harada
MGen
72
12
0
31 Oct 2017
Audio style transfer
Audio style transfer
Eric Grinstein
Ngoc Q. K. Duong
A. Ozerov
P. Pérez
71
68
0
31 Oct 2017
Onsets and Frames: Dual-Objective Piano Transcription
Onsets and Frames: Dual-Objective Piano Transcription
Curtis Hawthorne
Erich Elsen
Jialin Song
Adam Roberts
Ian Simon
Colin Raffel
Jesse Engel
Sageev Oore
Douglas Eck
204
281
0
30 Oct 2017
JSUT corpus: free large-scale Japanese speech corpus for end-to-end
  speech synthesis
JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis
Ryosuke Sonobe
Shinnosuke Takamichi
Hiroshi Saruwatari
3DV
105
140
0
28 Oct 2017
Multi-level Residual Networks from Dynamical Systems View
Multi-level Residual Networks from Dynamical Systems View
B. Chang
Lili Meng
E. Haber
Frederick Tung
David Begert
96
172
0
27 Oct 2017
Progressive Growing of GANs for Improved Quality, Stability, and
  Variation
Progressive Growing of GANs for Improved Quality, Stability, and Variation
Tero Karras
Timo Aila
S. Laine
J. Lehtinen
GAN
233
7,395
0
27 Oct 2017
Malware Detection by Eating a Whole EXE
Malware Detection by Eating a Whole EXE
Edward Raff
Jon Barker
Jared Sylvester
Robert Brandon
Bryan Catanzaro
Charles K. Nicholas
89
549
0
25 Oct 2017
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional
  Networks with Guided Attention
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention
Hideyuki Tachibana
Katsuya Uenoyama
Shunsuke Aihara
76
267
0
24 Oct 2017
Listening to the World Improves Speech Command Recognition
Listening to the World Improves Speech Command Recognition
B. McMahan
D. Rao
102
38
0
23 Oct 2017
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence
  Learning
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Ming-Yu Liu
Kainan Peng
Andrew Gibiansky
Sercan O. Arik
Ajay Kannan
Sharan Narang
Jonathan Raiman
John Miller
102
309
0
20 Oct 2017
Previous
123...5859606162
Next