ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Semi-Recurrent CNN-based VAE-GAN for Sequential Data Generation
Semi-Recurrent CNN-based VAE-GAN for Sequential Data Generation
Mohammad Akbari
Jie Liang
GAN
75
20
0
01 Jun 2018
Backpropagation for Implicit Spectral Densities
Backpropagation for Implicit Spectral Densities
Aditya A. Ramesh
Yann LeCun
55
10
0
01 Jun 2018
Inverting Supervised Representations with Autoregressive Neural Density
  Models
Inverting Supervised Representations with Autoregressive Neural Density Models
C. Nash
Nate Kushman
Christopher K. I. Williams
DRL
64
25
0
01 Jun 2018
Mining gold from implicit models to improve likelihood-free inference
Mining gold from implicit models to improve likelihood-free inference
Johann Brehmer
Gilles Louppe
J. Pavez
Kyle Cranmer
AI4CETPM
188
181
0
30 May 2018
Theory and Experiments on Vector Quantized Autoencoders
Theory and Experiments on Vector Quantized Autoencoders
Aurko Roy
Ashish Vaswani
Arvind Neelakantan
Niki Parmar
91
88
0
28 May 2018
Lipschitz regularity of deep neural networks: analysis and efficient
  estimation
Lipschitz regularity of deep neural networks: analysis and efficient estimation
Kevin Scaman
Aladin Virmaux
158
533
0
28 May 2018
Real-valued parametric conditioning of an RNN for interactive sound
  synthesis
Real-valued parametric conditioning of an RNN for interactive sound synthesis
L. Wyse
38
9
0
28 May 2018
Stable Recurrent Models
Stable Recurrent Models
John Miller
Moritz Hardt
83
119
0
25 May 2018
ASR-based Features for Emotion Recognition: A Transfer Learning Approach
ASR-based Features for Emotion Recognition: A Transfer Learning Approach
Noé Tits
Kevin El Haddad
Thierry Dutoit
65
28
0
23 May 2018
CNN+CNN: Convolutional Decoders for Image Captioning
CNN+CNN: Convolutional Decoders for Image Captioning
Qingzhong Wang
Antoni B. Chan
VLM
73
86
0
23 May 2018
Generative timbre spaces: regularizing variational auto-encoders with
  perceptual metrics
Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics
P. Esling
Axel Chemla-Romeu-Santos
Adrien Bitton
52
32
0
22 May 2018
Meta-learning with differentiable closed-form solvers
Meta-learning with differentiable closed-form solvers
Luca Bertinetto
João F. Henriques
Philip Torr
Andrea Vedaldi
ODL
123
932
0
21 May 2018
A Universal Music Translation Network
A Universal Music Translation Network
Noam Mor
Lior Wolf
Adam Polyak
Yaniv Taigman
89
110
0
21 May 2018
An Evaluation of Trajectory Prediction Approaches and Notes on the
  TrajNet Benchmark
An Evaluation of Trajectory Prediction Approaches and Notes on the TrajNet Benchmark
S. Becker
Ronny Hug
Wolfgang Hubner
Michael Arens
108
71
0
19 May 2018
The global optimum of shallow neural network is attained by ridgelet
  transform
The global optimum of shallow neural network is attained by ridgelet transform
Sho Sonoda
Isao Ishikawa
Masahiro Ikeda
Kei Hagihara
Y. Sawano
Takuo Matsubara
Noboru Murata
35
1
0
19 May 2018
Number Sequence Prediction Problems for Evaluating Computational Powers
  of Neural Networks
Number Sequence Prediction Problems for Evaluating Computational Powers of Neural Networks
Hyoungwook Nam
Segwang Kim
Kyomin Jung
AIMat
66
15
0
19 May 2018
Sequential Neural Likelihood: Fast Likelihood-free Inference with
  Autoregressive Flows
Sequential Neural Likelihood: Fast Likelihood-free Inference with Autoregressive Flows
George Papamakarios
D. Sterratt
Iain Murray
BDL
552
370
0
18 May 2018
QuaterNet: A Quaternion-based Recurrent Model for Human Motion
QuaterNet: A Quaternion-based Recurrent Model for Human Motion
Dario Pavllo
David Grangier
Michael Auli
3DH
76
263
0
16 May 2018
Towards a universal neural network encoder for time series
Towards a universal neural network encoder for time series
Joan Serrà
Santiago Pascual
Alexandros Karatzoglou
AI4TS
84
123
0
10 May 2018
Intracranial Error Detection via Deep Learning
Intracranial Error Detection via Deep Learning
M. Völker
Jiří Hammer
R. Schirrmeister
Joos Behncke
L. Fiederer
A. Schulze-Bonhage
Petr Marusič
Wolfram Burgard
T. Ball
65
10
0
04 May 2018
Randomly weighted CNNs for (music) audio classification
Randomly weighted CNNs for (music) audio classification
Jordi Pons
Xavier Serra
79
86
0
01 May 2018
Collapsed speech segment detection and suppression for WaveNet vocoder
Collapsed speech segment detection and suppression for WaveNet vocoder
Yi-Chiao Wu
Kazuhiro Kobayashi
Tomoki Hayashi
Patrick Lumban Tobing
Tomoki Toda
65
25
0
30 Apr 2018
Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Albert Haque
Corinna Fukushima
21
0
0
30 Apr 2018
Deep Speech Denoising with Vector Space Projections
Deep Speech Denoising with Vector Space Projections
Jeff Hetherly
Paul Gamble
M. Barrios
Cory Stephenson
Karl S. Ni
32
0
0
27 Apr 2018
Detection of Glottal Closure Instants from Raw Speech using
  Convolutional Neural Networks
Detection of Glottal Closure Instants from Raw Speech using Convolutional Neural Networks
Mohit Goyal
Varun Srivastava
P. PrathoshA.
40
2
0
26 Apr 2018
JUNIPR: a Framework for Unsupervised Machine Learning in Particle
  Physics
JUNIPR: a Framework for Unsupervised Machine Learning in Particle Physics
Anders Andreassen
Ilya Feige
Christopher Frye
M. Schwartz
MU
100
137
0
25 Apr 2018
Speaker-independent raw waveform model for glottal excitation
Speaker-independent raw waveform model for glottal excitation
Lauri Juvela
Vassilis Tsiaras
Bajibabu Bollepalli
Manu Airaksinen
Junichi Yamagishi
P. Alku
54
39
0
25 Apr 2018
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging
  from Spoofing Countermeasures for Speech Artifact Assessment
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment
Tomi Kinnunen
Jaime Lorenzo-Trueba
Junichi Yamagishi
Tomoki Toda
Daisuke Saito
F. Villavicencio
Zhenhua Ling
51
28
0
23 Apr 2018
Deep Layered Learning in MIR
Deep Layered Learning in MIR
Anders Elowsson
46
4
0
18 Apr 2018
The unreasonable effectiveness of the forget gate
The unreasonable effectiveness of the forget gate
J. Westhuizen
Joan Lasenby
77
89
0
13 Apr 2018
Blood Vessel Geometry Synthesis using Generative Adversarial Networks
Blood Vessel Geometry Synthesis using Generative Adversarial Networks
J. Wolterink
T. Leiner
Ivana Isgum
GANMedIm
43
25
0
12 Apr 2018
The Voice Conversion Challenge 2018: Promoting Development of Parallel
  and Nonparallel Methods
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods
Jaime Lorenzo-Trueba
Junichi Yamagishi
Tomoki Toda
Daisuke Saito
F. Villavicencio
Tomi Kinnunen
Zhenhua Ling
69
321
0
12 Apr 2018
Understanding disentangling in $β$-VAE
Understanding disentangling in βββ-VAE
Christopher P. Burgess
I. Higgins
Arka Pal
Loic Matthey
Nicholas Watters
Guillaume Desjardins
Alexander Lerchner
CoGeDRL
73
832
0
10 Apr 2018
A comparison of recent waveform generation and acoustic modeling methods
  for neural-network-based speech synthesis
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis
Xin Wang
Jaime Lorenzo-Trueba
Shinji Takaki
Lauri Juvela
Junichi Yamagishi
70
67
0
07 Apr 2018
Expressive Speech Synthesis via Modeling Expressions with Variational
  Autoencoder
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder
K. Akuzawa
Yusuke Iwasawa
Y. Matsuo
87
139
0
06 Apr 2018
Structured Disentangled Representations
Structured Disentangled Representations
Babak Esmaeili
Hao Wu
Sarthak Jain
Alican Bozkurt
N. Siddharth
Brooks Paige
Dana H. Brooks
Jennifer Dy
Jan-Willem van de Meent
OODCMLBDLDRL
88
169
0
06 Apr 2018
Fine-grained Video Attractiveness Prediction Using Multimodal Deep
  Learning on a Large Real-world Dataset
Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset
Xinpeng Chen
Jingyuan Chen
Lin Ma
Jian Yao
Wen Liu
Jiebo Luo
Tong Zhang
AI4TSVGen
37
20
0
04 Apr 2018
Music Genre Classification using Machine Learning Techniques
Music Genre Classification using Machine Learning Techniques
Hareesh Bahuleyan
VLM
34
104
0
03 Apr 2018
Neural Autoregressive Flows
Neural Autoregressive Flows
Chin-Wei Huang
David M. Krueger
Alexandre Lacoste
Aaron Courville
DRLAI4CE
161
447
0
03 Apr 2018
Conditional End-to-End Audio Transforms
Conditional End-to-End Audio Transforms
Albert Haque
Michelle Guo
Prateek Verma
114
41
0
30 Mar 2018
Parallel Grid Pooling for Data Augmentation
Parallel Grid Pooling for Data Augmentation
Akito Takeki
Daiki Ikami
Go Irie
Kiyoharu Aizawa
56
7
0
30 Mar 2018
Detecting Alzheimer's Disease Using Gated Convolutional Neural Network
  from Audio Data
Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data
Tifani Warnita
Nakamasa Inoue
Koichi Shinoda
36
40
0
30 Mar 2018
Machine Speech Chain with One-shot Speaker Adaptation
Machine Speech Chain with One-shot Speaker Adaptation
Andros Tjandra
S. Sakti
Satoshi Nakamura
71
56
0
28 Mar 2018
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
202
1,102
0
27 Mar 2018
Complex-Valued Restricted Boltzmann Machine for Direct Speech
  Parameterization from Complex Spectra
Complex-Valued Restricted Boltzmann Machine for Direct Speech Parameterization from Complex Spectra
Toru Nakashika
Shinji Takaki
Junichi Yamagishi
13
1
0
27 Mar 2018
MOrdReD: Memory-based Ordinal Regression Deep Neural Networks for Time
  Series Forecasting
MOrdReD: Memory-based Ordinal Regression Deep Neural Networks for Time Series Forecasting
Bernardo Pérez Orozco
G. Abbati
Stephen J. Roberts
OODAI4TS
38
14
0
26 Mar 2018
HAMLET: Interpretable Human And Machine co-LEarning Technique
HAMLET: Interpretable Human And Machine co-LEarning Technique
Olivier Deiss
Siddharth Biswal
Jing Jin
Haoqi Sun
M. P. M. Brandon Westover
Jimeng Sun
60
11
0
26 Mar 2018
Calibrated Prediction Intervals for Neural Network Regressors
Calibrated Prediction Intervals for Neural Network Regressors
Gil Keren
N. Cummins
Björn Schuller
UQCV
87
31
0
26 Mar 2018
Style Tokens: Unsupervised Style Modeling, Control and Transfer in
  End-to-End Speech Synthesis
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Yuxuan Wang
Daisy Stanton
Yu Zhang
RJ Skerry-Ryan
Eric Battenberg
Joel Shor
Y. Xiao
Fei Ren
Ye Jia
Rif A. Saurous
68
827
0
23 Mar 2018
Generalization Challenges for Neural Architectures in Audio Source
  Separation
Generalization Challenges for Neural Architectures in Audio Source Separation
Shariq Mobin
Brian Cheung
Bruno A. Olshausen
DRL
46
2
0
23 Mar 2018
Previous
123...565758...606162
Next