ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Dilated Convolution with Dilated GRU for Music Source Separation
Dilated Convolution with Dilated GRU for Music Source Separation
Jen-Yu Liu
Yi-Hsuan Yang
72
41
0
04 Jun 2019
MelNet: A Generative Model for Audio in the Frequency Domain
MelNet: A Generative Model for Audio in the Frequency Domain
Sean Vasquez
M. Lewis
DiffM
85
132
0
04 Jun 2019
Blow: a single-scale hyperconditioned flow for non-parallel raw-audio
  voice conversion
Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion
Joan Serrà
Santiago Pascual
Carlos Segura
CVBM
76
85
0
03 Jun 2019
Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with
  SampleRNN
Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN
David Álvarez
Santiago Pascual
Antonio Bonafonte
73
12
0
03 Jun 2019
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic
  Attention for Neural TTS
Robust Sequence-to-Sequence Acoustic Modeling with Stepwise Monotonic Attention for Neural TTS
Mutian He
Yan Deng
Lei He
97
81
0
03 Jun 2019
Generating Diverse High-Fidelity Images with VQ-VAE-2
Generating Diverse High-Fidelity Images with VQ-VAE-2
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRLBDL
209
1,832
0
02 Jun 2019
Graph WaveNet for Deep Spatial-Temporal Graph Modeling
Graph WaveNet for Deep Spatial-Temporal Graph Modeling
Zonghan Wu
Shirui Pan
Guodong Long
Jing Jiang
Chengqi Zhang
GNNAI4TS
108
2,208
0
31 May 2019
Self-Referencing Embedded Strings (SELFIES): A 100% robust molecular
  string representation
Self-Referencing Embedded Strings (SELFIES): A 100% robust molecular string representation
Mario Krenn
Florian Hase
AkshatKumar Nigam
Pascal Friederich
Alán Aspuru-Guzik
116
71
0
31 May 2019
Learning Sparse Networks Using Targeted Dropout
Learning Sparse Networks Using Targeted Dropout
Aidan Gomez
Ivan Zhang
Siddhartha Rao Kamalakara
Divyam Madaan
Kevin Swersky
Y. Gal
Geoffrey E. Hinton
112
98
0
31 May 2019
Unsupervised Model Selection for Variational Disentangled Representation
  Learning
Unsupervised Model Selection for Variational Disentangled Representation Learning
Sunny Duan
Loic Matthey
Andre Saraiva
Nicholas Watters
Christopher P. Burgess
Alexander Lerchner
I. Higgins
OODDRL
96
80
0
29 May 2019
Global Guarantees for Blind Demodulation with Generative Priors
Global Guarantees for Blind Demodulation with Generative Priors
Paul Hand
Babhru Joshi
129
33
0
29 May 2019
Rethinking Full Connectivity in Recurrent Neural Networks
Rethinking Full Connectivity in Recurrent Neural Networks
Matthijs Van Keirsbilck
A. Keller
Xiaodong Yang
LRM
41
14
0
29 May 2019
Complex-valued neural networks for machine learning on non-stationary
  physical data
Complex-valued neural networks for machine learning on non-stationary physical data
Jesper Sören Dramsch
M. Lüthje
Anders Christensen
80
36
0
29 May 2019
SignalTrain: Profiling Audio Compressors with Deep Neural Networks
SignalTrain: Profiling Audio Compressors with Deep Neural Networks
Scott H. Hawley
Benjamin Colburn
S. I. Mimilakis
42
12
0
28 May 2019
Learning distant cause and effect using only local and immediate credit
  assignment
Learning distant cause and effect using only local and immediate credit assignment
D. Rawlinson
Abdelrahman Ahmed
Gideon Kowadlo
45
3
0
28 May 2019
Validation of Approximate Likelihood and Emulator Models for
  Computationally Intensive Simulations
Validation of Approximate Likelihood and Emulator Models for Computationally Intensive Simulations
Niccolò Dalmasso
Ann B. Lee
Rafael Izbicki
T. Pospisil
Ilmun Kim
Chieh-An Lin
78
8
0
27 May 2019
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for
  Zerospeech Challenge 2019
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019
Andros Tjandra
Berrak Sisman
Mingyang Zhang
S. Sakti
Haizhou Li
Satoshi Nakamura
85
72
0
27 May 2019
Learning by stochastic serializations
Learning by stochastic serializations
Pablo Strasser
S. Armand
Stéphane Marchand-Maillet
Alexandros Kalousis
31
0
0
27 May 2019
ViterbiNet: A Deep Learning Based Viterbi Algorithm for Symbol Detection
ViterbiNet: A Deep Learning Based Viterbi Algorithm for Symbol Detection
Nir Shlezinger
Nariman Farsad
Yonina C. Eldar
Andrea J. Goldsmith
91
145
0
26 May 2019
Learning to Reason in Large Theories without Imitation
Learning to Reason in Large Theories without Imitation
Kshitij Bansal
Christian Szegedy
M. Rabe
Sarah M. Loos
Viktor Toman
NAILRM
107
42
0
25 May 2019
Fast computation of loudness using a deep neural network
Fast computation of loudness using a deep neural network
Josef Schlittenlacher
Richard Turner
B. Moore
14
2
0
24 May 2019
Zero-shot Knowledge Transfer via Adversarial Belief Matching
Zero-shot Knowledge Transfer via Adversarial Belief Matching
P. Micaelli
Amos Storkey
82
230
0
23 May 2019
Quantifying Long Range Dependence in Language and User Behavior to
  improve RNNs
Quantifying Long Range Dependence in Language and User Behavior to improve RNNs
Francois Belletti
Minmin Chen
Ed H. Chi
AI4TS
45
23
0
23 May 2019
Compression with Flows via Local Bits-Back Coding
Compression with Flows via Local Bits-Back Coding
Jonathan Ho
Evan Lohn
Pieter Abbeel
103
54
0
21 May 2019
Effective parameter estimation methods for an ExcitNet model in
  generative text-to-speech systems
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Ohsung Kwon
Eunwoo Song
Jae-Min Kim
Hong-Goo Kang
54
4
0
21 May 2019
Non-Autoregressive Neural Text-to-Speech
Non-Autoregressive Neural Text-to-Speech
Kainan Peng
Ming-Yu Liu
Z. Song
Kexin Zhao
101
40
0
21 May 2019
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven
  Dynamic Hierarchical Conditional Variational Network
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network
V. Wan
Chun-an Chan
Tom Kenter
Jakub Vít
R. Clark
71
75
0
17 May 2019
Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with
  Hierarchical Latent Variables
Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables
F. Kingma
Pieter Abbeel
Jonathan Ho
106
98
0
16 May 2019
MoGlow: Probabilistic and controllable motion synthesis using
  normalising flows
MoGlow: Probabilistic and controllable motion synthesis using normalising flows
G. Henter
Simon Alexanderson
Jonas Beskow
94
98
0
16 May 2019
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Kaizhi Qian
Yang Zhang
Shiyu Chang
Xuesong Yang
M. Hasegawa-Johnson
170
471
0
14 May 2019
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Yi Ren
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
95
102
0
13 May 2019
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis
Jan Skoglund
J. Valin
88
38
0
12 May 2019
Machine learning in acoustics: theory and applications
Machine learning in acoustics: theory and applications
Michael J. Bianco
Peter Gerstoft
James Traer
Emma Ozanich
M. Roch
Sharon Gannot
Charles-Alban Deledalle
AI4CE
89
391
0
11 May 2019
FastDraw: Addressing the Long Tail of Lane Detection by Adapting a
  Sequential Prediction Network
FastDraw: Addressing the Long Tail of Lane Detection by Adapting a Sequential Prediction Network
Jonah Philion
137
159
0
10 May 2019
Deep Unsupervised Cardinality Estimation
Deep Unsupervised Cardinality Estimation
Zongheng Yang
Eric Liang
Amog Kamsetty
Chenggang Wu
Yan Duan
Peter Chen
Pieter Abbeel
J. M. Hellerstein
S. Krishnan
Ion Stoica
96
208
0
10 May 2019
Semi-supervised and Population Based Training for Voice Commands
  Recognition
Semi-supervised and Population Based Training for Voice Commands Recognition
Oguz H. Elibol
Gokce Keskin
Anil Thomas
26
2
0
10 May 2019
AI in the media and creative industries
AI in the media and creative industries
Giuseppe Amato
Malte Behrmann
Frédéric Bimbot
Baptiste Caramiaux
Fabrizio Falchi
...
Andrew Perkis
R. Redondo
Enrico Turrin
T. Viéville
Emmanuel Vincent
66
43
0
10 May 2019
Universal Adversarial Perturbations for Speech Recognition Systems
Universal Adversarial Perturbations for Speech Recognition Systems
Paarth Neekhara
Shehzeen Samarah Hussain
Prakhar Pandey
Shlomo Dubnov
Julian McAuley
F. Koushanfar
AAML
82
118
0
09 May 2019
Think Globally, Act Locally: A Deep Neural Network Approach to
  High-Dimensional Time Series Forecasting
Think Globally, Act Locally: A Deep Neural Network Approach to High-Dimensional Time Series Forecasting
Rajat Sen
Hsiang-Fu Yu
Inderjit Dhillon
AI4TS
140
366
0
09 May 2019
Generative Model with Dynamic Linear Flow
Generative Model with Dynamic Linear Flow
Huadong Liao
Jiawei He
Kun-xian Shu
DRL
49
5
0
08 May 2019
Capture, Learning, and Synthesis of 3D Speaking Styles
Capture, Learning, and Synthesis of 3D Speaking Styles
Daniel Cudeiro
Timo Bolkart
Cassidy Laidlaw
Anurag Ranjan
Michael J. Black
CVBM3DH
108
344
0
08 May 2019
Generalized Dilation Neural Networks
Generalized Dilation Neural Networks
Gavneet Singh Chadha
Jan Niclas Reimann
Andreas Schwung
MLTAI4TSMedIm
21
0
0
08 May 2019
Neural Architecture Refinement: A Practical Way for Avoiding Overfitting
  in NAS
Neural Architecture Refinement: A Practical Way for Avoiding Overfitting in NAS
Yangzhou Jiang
Cong Zhao
Zeyang Dou
Lei Pang
59
5
0
07 May 2019
Multivariate Time Series Classification using Dilated Convolutional
  Neural Network
Multivariate Time Series Classification using Dilated Convolutional Neural Network
Omolbanin Yazdanbakhsh
S. Dick
57
33
0
05 May 2019
Temporal Graph Convolutional Networks for Automatic Seizure Detection
Temporal Graph Convolutional Networks for Automatic Seizure Detection
Ian Covert
B. Krishnan
I. Najm
Jiening Zhan
Matthew Shore
J. Hixson
M. Po
60
71
0
03 May 2019
High quality, lightweight and adaptable TTS using LPCNet
High quality, lightweight and adaptable TTS using LPCNet
Zvi Kons
Slava Shechtman
A. Sorin
Carmel Rabinovitz
R. Hoory
69
54
0
02 May 2019
Deep Learning for Audio Signal Processing
Deep Learning for Audio Signal Processing
Hendrik Purwins
Yue Liu
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
119
598
0
30 Apr 2019
PYRO-NN: Python Reconstruction Operators in Neural Networks
PYRO-NN: Python Reconstruction Operators in Neural Networks
Christopher Syben
Markus Michen
Bernhard Stimpel
Stephan Seitz
Stefan B. Ploner
Andreas Maier
AI4CE
51
62
0
30 Apr 2019
Curriculum Learning in Deep Neural Networks for Financial Forecasting
Curriculum Learning in Deep Neural Networks for Financial Forecasting
Allison Koenecke
Amita Gajewar
AI4TS
37
16
0
29 Apr 2019
Neural source-filter waveform models for statistical parametric speech
  synthesis
Neural source-filter waveform models for statistical parametric speech synthesis
Xin Wang
Shinji Takaki
Junichi Yamagishi
97
118
0
27 Apr 2019
Previous
123...484950...606162
Next