ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.03499
  4. Cited By
WaveNet: A Generative Model for Raw Audio
v1v2 (latest)

WaveNet: A Generative Model for Raw Audio

12 September 2016
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
    DiffM
ArXiv (abs)PDFHTML

Papers citing "WaveNet: A Generative Model for Raw Audio"

50 / 3,082 papers shown
Title
Generating High Fidelity Images with Subscale Pixel Networks and
  Multidimensional Upscaling
Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling
Jacob Menick
Nal Kalchbrenner
106
151
0
04 Dec 2018
Timeception for Complex Action Recognition
Timeception for Complex Action Recognition
Noureldien Hussein
E. Gavves
A. Smeulders
147
215
0
04 Dec 2018
Pedestrian Detection with Autoregressive Network Phases
Pedestrian Detection with Autoregressive Network Phases
Garrick Brazil
Xiaoming Liu
88
72
0
02 Dec 2018
Cross-Modulation Networks for Few-Shot Learning
Cross-Modulation Networks for Few-Shot Learning
Hugo Prol
Vincent Dumoulin
Luis Herranz
71
15
0
01 Dec 2018
Effects of Loss Functions And Target Representations on Adversarial
  Robustness
Effects of Loss Functions And Target Representations on Adversarial Robustness
Sean Saito
S. Roy
AAML
72
7
0
01 Dec 2018
SwishNet: A Fast Convolutional Neural Network for Speech, Music and
  Noise Classification and Segmentation
SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation
Md Shamim Hussain
M. A. Haque
42
48
0
01 Dec 2018
LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis
LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis
Min-Jae Hwang
Frank Soong
Fenglong Xie
Xi Wang
Hyeonjoo Kang
Hong-Goo Kang
53
21
0
29 Nov 2018
3D human pose estimation in video with temporal convolutions and
  semi-supervised training
3D human pose estimation in video with temporal convolutions and semi-supervised training
Dario Pavllo
Christoph Feichtenhofer
David Grangier
Michael Auli
3DH
81
1,015
0
28 Nov 2018
Play as You Like: Timbre-enhanced Multi-modal Music Style Transfer
Play as You Like: Timbre-enhanced Multi-modal Music Style Transfer
Chien-Yu Lu
Min-Xin Xue
Chia-Che Chang
Che-Rung Lee
Li Su
89
34
0
28 Nov 2018
UFANS: U-shaped Fully-Parallel Acoustic Neural Structure For Statistical
  Parametric Speech Synthesis With 20X Faster
UFANS: U-shaped Fully-Parallel Acoustic Neural Structure For Statistical Parametric Speech Synthesis With 20X Faster
Dabiao Ma
Zhiba Su
Yuhao Lu
Wenxuan Wang
Zhen Li
34
3
0
28 Nov 2018
Improved Speech Enhancement with the Wave-U-Net
Improved Speech Enhancement with the Wave-U-Net
Can Eren Sezener
Tillman Weyde
65
165
0
27 Nov 2018
Class-Distinct and Class-Mutual Image Generation with GANs
Class-Distinct and Class-Mutual Image Generation with GANs
Takuhiro Kaneko
Yoshitaka Ushiku
Tatsuya Harada
100
9
0
27 Nov 2018
Refined WaveNet Vocoder for Variational Autoencoder Based Voice
  Conversion
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
Wen-Chin Huang
Yi-Chiao Wu
Hsin-Te Hwang
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
Tomoki Toda
Yu Tsao
H. Wang
61
20
0
27 Nov 2018
Planning in Dynamic Environments with Conditional Autoregressive Models
Planning in Dynamic Environments with Conditional Autoregressive Models
Johanna Hansen
Kyle Kastner
Aaron Courville
Gregory Dudek
53
1
0
25 Nov 2018
An overview of deep learning in medical imaging focusing on MRI
An overview of deep learning in medical imaging focusing on MRI
A. Lundervold
A. Lundervold
OOD
112
1,654
0
25 Nov 2018
Interpretable Convolutional Filters with SincNet
Interpretable Convolutional Filters with SincNet
Mirco Ravanelli
Yoshua Bengio
93
107
0
23 Nov 2018
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre
  Transfer
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Sicong Huang
Qiyang Li
Cem Anil
Xuchan Bao
Sageev Oore
Roger C. Grosse
92
98
0
22 Nov 2018
Sequential Neural Methods for Likelihood-free Inference
Sequential Neural Methods for Likelihood-free Inference
Conor Durkan
George Papamakarios
Iain Murray
BDL
187
25
0
21 Nov 2018
Measuring Depression Symptom Severity from Spoken Language and 3D Facial
  Expressions
Measuring Depression Symptom Severity from Spoken Language and 3D Facial Expressions
Albert Haque
Michelle Guo
Adam S. Miner
Li Fei-Fei
46
112
0
21 Nov 2018
The Effect of Explicit Structure Encoding of Deep Neural Networks for
  Symbolic Music Generation
The Effect of Explicit Structure Encoding of Deep Neural Networks for Symbolic Music Generation
Kai Chen
Weilin Zhang
Shlomo Dubnov
Gus Xia
Wei Li
MGen
41
5
0
20 Nov 2018
Black-Box Autoregressive Density Estimation for State-Space Models
Black-Box Autoregressive Density Estimation for State-Space Models
Tom Ryder
Andrew Golightly
A. Mcgough
D. Prangle
BDL
46
6
0
20 Nov 2018
Multi-scale aggregation of phase information for reducing computational
  cost of CNN based DOA estimation
Multi-scale aggregation of phase information for reducing computational cost of CNN based DOA estimation
Soumitro Chakrabarty
Emanuel Habets
45
6
0
20 Nov 2018
Improving Sequence-to-Sequence Acoustic Modeling by Adding
  Text-Supervision
Improving Sequence-to-Sequence Acoustic Modeling by Adding Text-Supervision
Jing-Xuan Zhang
Zhenhua Ling
Yuan Jiang
Li-Juan Liu
Chen Liang
Lirong Dai
80
30
0
20 Nov 2018
Learning Robust Heterogeneous Signal Features from Parallel Neural
  Network for Audio Sentiment Analysis
Learning Robust Heterogeneous Signal Features from Parallel Neural Network for Audio Sentiment Analysis
Feiyang Chen
Ziqian Luo
59
19
0
20 Nov 2018
Coupled Recurrent Models for Polyphonic Music Composition
Coupled Recurrent Models for Polyphonic Music Composition
John Thickstun
Zaïd Harchaoui
Dean Phillips Foster
Sham Kakade
42
12
0
20 Nov 2018
Efficient keyword spotting using dilated convolutions and gating
Efficient keyword spotting using dilated convolutions and gating
A. Coucke
M. Chlieh
Thibault Gisselbrecht
David Leroy
Mathieu Poumeyrol
Thibaut Lavril
101
100
0
19 Nov 2018
Harmonic Recomposition using Conditional Autoregressive Modeling
Harmonic Recomposition using Conditional Autoregressive Modeling
Kyle Kastner
Rithesh Kumar
Tim Cooijmans
Aaron Courville
52
0
0
18 Nov 2018
Representation Mixing for TTS Synthesis
Representation Mixing for TTS Synthesis
Kyle Kastner
J. F. Santos
Yoshua Bengio
Aaron Courville
55
43
0
17 Nov 2018
High Quality Prediction of Protein Q8 Secondary Structure by Diverse
  Neural Network Architectures
High Quality Prediction of Protein Q8 Secondary Structure by Diverse Neural Network Architectures
Iddo Drori
Isht Dwivedi
Pranav Shrestha
Jeffrey Wan
Yueqi Wang
...
Kaveri A. Thakoor
Chinmay Joshi
Sonam Goenka
C. Keasar
I. Pe’er
74
27
0
17 Nov 2018
Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and
  Beatles
Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and Beatles
Zack Zukowski
CJ Carr
50
18
0
16 Nov 2018
Generating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands
Generating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands
CJ Carr
Zack Zukowski
MGen
35
20
0
16 Nov 2018
Learning to Predict the Cosmological Structure Formation
Learning to Predict the Cosmological Structure Formation
Siyu He
Yin Li
Yu Feng
S. Ho
Siamak Ravanbakhsh
Wei Chen
Barnabás Póczós
92
172
0
15 Nov 2018
Effect of data reduction on sequence-to-sequence neural TTS
Effect of data reduction on sequence-to-sequence neural TTS
Javier Latorre
Jakub Lachowicz
Jaime Lorenzo-Trueba
Thomas Merritt
Thomas Drugman
S. Ronanki
Klimkov Viacheslav
92
59
0
15 Nov 2018
Comprehensive evaluation of statistical speech waveform synthesis
Comprehensive evaluation of statistical speech waveform synthesis
Thomas Merritt
Bartosz Putrycz
Adam Nadolski
Tianjun Ye
Daniel Korzekwa
...
Alexis Moinet
A. Breen
Rafal Kuklinski
N. Strom
Roberto Barra-Chicote
51
18
0
15 Nov 2018
Towards achieving robust universal neural vocoding
Towards achieving robust universal neural vocoding
Jaime Lorenzo-Trueba
Thomas Drugman
Javier Latorre
Thomas Merritt
Bartosz Putrycz
Roberto Barra-Chicote
Alexis Moinet
Vatsal Aggarwal
DRL
137
19
0
15 Nov 2018
Melodic Phrase Segmentation By Deep Neural Networks
Melodic Phrase Segmentation By Deep Neural Networks
Y. Guan
Jinyu Zhao
Yiqin Qiu
Zheng Zhang
Gus Xia
37
11
0
14 Nov 2018
Neural Wavetable: a playable wavetable synthesizer using neural networks
Neural Wavetable: a playable wavetable synthesizer using neural networks
Lamtharn Hantrakul
Li-Chia Yang
41
3
0
13 Nov 2018
Hallucinating Point Cloud into 3D Sculptural Object
Hallucinating Point Cloud into 3D Sculptural Object
Chun-Liang Li
Eunsu Kang
Songwei Ge
Lingyao Zhang
Austin Dill
Manzil Zaheer
Barnabás Póczós
3DPC
52
2
0
13 Nov 2018
Agent Embeddings: A Latent Representation for Pole-Balancing Networks
Agent Embeddings: A Latent Representation for Pole-Balancing Networks
Oscar Chang
Robert Kwiatkowski
Siyuan Chen
Hod Lipson
147
6
0
12 Nov 2018
PerformanceNet: Score-to-Audio Music Generation with Multi-Band
  Convolutional Residual Network
PerformanceNet: Score-to-Audio Music Generation with Multi-Band Convolutional Residual Network
Bryan Wang
Yi-Hsuan Yang
71
38
0
11 Nov 2018
ExcitNet vocoder: A neural excitation model for parametric speech
  synthesis systems
ExcitNet vocoder: A neural excitation model for parametric speech synthesis systems
Eunwoo Song
Kyungguen Byun
Hong-Goo Kang
75
29
0
09 Nov 2018
Mode matching in GANs through latent space learning and inversion
Mode matching in GANs through latent space learning and inversion
Chao Weng
AP Prathosh
Dong Yu
Varun Srivastava
S. Chaudhury
GAN
62
2
0
08 Nov 2018
Speaker-adaptive neural vocoders for parametric speech synthesis systems
Speaker-adaptive neural vocoders for parametric speech synthesis systems
Eunwoo Song
Xiang Yu
Erik Cambria
Jagath Rajapakse
49
3
0
08 Nov 2018
Learning Disentangled Representations for Timber and Pitch in Music
  Audio
Learning Disentangled Representations for Timber and Pitch in Music Audio
Yun-Ning Hung
Yian Chen
Yi-Hsuan Yang
85
16
0
08 Nov 2018
Blockwise Parallel Decoding for Deep Autoregressive Models
Blockwise Parallel Decoding for Deep Autoregressive Models
Mitchell Stern
Noam M. Shazeer
Ashley J. Llorens
86
238
0
07 Nov 2018
High-quality speech coding with SampleRNN
High-quality speech coding with SampleRNN
Adam Conkey
Per Hedelin
Cong Zhou
Tucker Hermans
Lars Villemoes
71
59
0
07 Nov 2018
Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using
  a WaveNet Approach
Reconstructing Speech Stimuli From Human Auditory Cortex Activity Using a WaveNet Approach
Ran Wang
Yao Wang
A. Flinker
29
7
0
06 Nov 2018
FloWaveNet : A Generative Flow for Raw Audio
FloWaveNet : A Generative Flow for Raw Audio
Sungwon Kim
Sang-gil Lee
Jongyoon Song
Jaehyeon Kim
Sungroh Yoon
118
169
0
06 Nov 2018
ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion
ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion
Hirokazu Kameoka
Kou Tanaka
Damian Kwaśny
Takuhiro Kaneko
Nobukatsu Hojo
92
64
0
05 Nov 2018
Nonparallel Emotional Speech Conversion
Nonparallel Emotional Speech Conversion
Jian Gao
Deep Chakraborty
H. Tembine
Olaitan Olaleye
87
69
0
03 Nov 2018
Previous
123...525354...606162
Next