Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.07837
Cited By
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
22 December 2016
Soroush Mehri
Kundan Kumar
Ishaan Gulrajani
Rithesh Kumar
Shubham Jain
Jose M. R. Sotelo
Aaron Courville
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SampleRNN: An Unconditional End-to-End Neural Audio Generation Model"
50 / 274 papers shown
Title
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis
Jan Skoglund
J. Valin
37
38
0
12 May 2019
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo-wen Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
24
586
0
30 Apr 2019
Neural source-filter waveform models for statistical parametric speech synthesis
Xin Wang
Shinji Takaki
Junichi Yamagishi
31
117
0
27 Apr 2019
The Zero Resource Speech Challenge 2019: TTS without T
Ewan Dunbar
Robin Algayres
Julien Karadayi
Mathieu Bernard
Juan Benjumea
...
Lucas Ondel
A. Black
Laurent Besacier
S. Sakti
Emmanuel Dupoux
17
116
0
25 Apr 2019
Generating Long Sequences with Sparse Transformers
R. Child
Scott Gray
Alec Radford
Ilya Sutskever
16
1,848
0
23 Apr 2019
Singing voice synthesis based on convolutional neural networks
Kazuhiro Nakamura
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
18
33
0
15 Apr 2019
RNN-based speech synthesis using a continuous sinusoidal model
M. S. Al-Radhi
T. Csapó
Géza Németh
12
4
0
12 Apr 2019
End-to-end Binaural Sound Localisation from the Raw Waveform
Paolo Vecchiotti
Ning Ma
S. Squartini
Guy J. Brown
11
59
0
03 Apr 2019
Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform
Shinji Takaki
Hirokazu Kameoka
Junichi Yamagishi
6
2
0
29 Mar 2019
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
J. Valin
Jan Skoglund
24
78
0
28 Mar 2019
Bandwidth Extension on Raw Audio via Generative Adversarial Networks
S. Kim
V. Sathe
GAN
8
26
0
21 Mar 2019
GANSynth: Adversarial Neural Audio Synthesis
Jesse Engel
Kumar Krishna Agrawal
Shuo Chen
Ishaan Gulrajani
Chris Donahue
Adam Roberts
46
385
0
23 Feb 2019
Capacity allocation through neural network layers
Jonathan Donier
14
3
0
22 Feb 2019
Capacity allocation analysis of neural networks: A tool for principled architecture design
Jonathan Donier
22
4
0
12 Feb 2019
Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning
Frederik Benzing
M. Gauy
Asier Mujika
A. Martinsson
Angelika Steger
23
22
0
11 Feb 2019
Adversarial Generation of Time-Frequency Features with application in audio synthesis
Andrés Marafioti
Nicki Holighaus
Nathanael Perraudin
P. Majdak
17
68
0
11 Feb 2019
Classical Music Generation in Distinct Dastgahs with AlimNet ACGAN
Saber Malekzadeh
Maryam Samami
Shahla Rezazadeh Azar
Maryam Rayegan
GAN
MGen
22
3
0
15 Jan 2019
Introduction to Voice Presentation Attack Detection and Recent Advances
Md. Sahidullah
Héctor Delgado
Massimiliano Todisco
Tomi Kinnunen
Nicholas W. D. Evans
Junichi Yamagishi
Kong-Aik Lee
AAML
8
75
0
04 Jan 2019
Interpretable Convolutional Filters with SincNet
Mirco Ravanelli
Yoshua Bengio
21
104
0
23 Nov 2018
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Sicong Huang
Qiyang Li
Cem Anil
Xuchan Bao
Sageev Oore
Roger C. Grosse
19
97
0
22 Nov 2018
Representation Mixing for TTS Synthesis
Kyle Kastner
J. F. Santos
Yoshua Bengio
Aaron Courville
11
43
0
17 Nov 2018
Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and Beatles
Zack Zukowski
CJ Carr
11
18
0
16 Nov 2018
Generating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands
CJ Carr
Zack Zukowski
MGen
8
20
0
16 Nov 2018
Comprehensive evaluation of statistical speech waveform synthesis
Thomas Merritt
Bartosz Putrycz
Adam Nadolski
Tianjun Ye
Daniel Korzekwa
...
Alexis Moinet
A. Breen
Rafal Kuklinski
N. Strom
Roberto Barra-Chicote
14
17
0
15 Nov 2018
Speaker-adaptive neural vocoders for parametric speech synthesis systems
Eunwoo Song
Xiang Yu
Erik Cambria
Jagath Rajapakse
6
3
0
08 Nov 2018
High-quality speech coding with SampleRNN
Adam Conkey
Per Hedelin
Cong Zhou
Tucker Hermans
Lars Villemoes
11
59
0
07 Nov 2018
Modeling Melodic Feature Dependency with Modularized Variational Auto-Encoder
Yu-An Wang
Yu-Kai Huang
Tzu-Chuan Lin
Shang-Yu Su
Yun-Nung (Vivian) Chen
14
3
0
31 Oct 2018
Audio inpainting of music by means of neural networks
Andrés Marafioti
Nicki Holighaus
P. Majdak
Nathanael Perraudin
16
18
0
29 Oct 2018
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction
J. Valin
Jan Skoglund
6
448
0
28 Oct 2018
SING: Symbol-to-Instrument Neural Generator
Alexandre Défossez
Neil Zeghidour
Nicolas Usunier
Léon Bottou
Francis R. Bach
13
59
0
23 Oct 2018
Modulated Variational auto-Encoders for many-to-many musical timbre transfer
Adrien Bitton
P. Esling
Axel Chemla-Romeu-Santos
20
25
0
29 Sep 2018
MIDI-VAE: Modeling Dynamics and Instrumentation of Music with Applications to Style Transfer
Gino Brunner
Andres Konrad
Yuyi Wang
Roger Wattenhofer
33
133
0
20 Sep 2018
Neural Speech Synthesis with Transformer Network
Naihan Li
Shujie Liu
Yanqing Liu
Sheng Zhao
Ming-Yu Liu
M. Zhou
16
102
0
19 Sep 2018
Voice Conversion with Conditional SampleRNN
Cong Zhou
Michael Horgan
Vivek Kumar
Cristina Vasco
Dan Darcy
15
20
0
24 Aug 2018
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis
G. Henter
Jaime Lorenzo-Trueba
Xin Wang
Junichi Yamagishi
DRL
SSL
13
61
0
30 Jul 2018
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
44
698
0
29 Jul 2018
ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
Ming-Yu Liu
Kainan Peng
Jitong Chen
12
342
0
19 Jul 2018
The challenge of realistic music generation: modelling raw audio at scale
Sander Dieleman
Aaron van den Oord
Karen Simonyan
21
184
0
26 Jun 2018
Sounderfeit: Cloning a Physical Model using a Conditional Adversarial Autoencoder
Stephen Sinclair
GAN
13
1
0
25 Jun 2018
Voice Imitating Text-to-Speech Neural Networks
Younggun Lee
Taesu Kim
Soo-Young Lee
24
11
0
04 Jun 2018
Real-valued parametric conditioning of an RNN for interactive sound synthesis
L. Wyse
12
9
0
28 May 2018
Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics
P. Esling
Axel Chemla-Romeu-Santos
Adrien Bitton
12
32
0
22 May 2018
Collapsed speech segment detection and suppression for WaveNet vocoder
Yi-Chiao Wu
Kazuhiro Kobayashi
Tomoki Hayashi
Patrick Lumban Tobing
T. Toda
7
25
0
30 Apr 2018
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods
Jaime Lorenzo-Trueba
Junichi Yamagishi
T. Toda
Daisuke Saito
F. Villavicencio
Tomi Kinnunen
Zhenhua Ling
14
318
0
12 Apr 2018
Efficient Neural Audio Synthesis
Nal Kalchbrenner
Erich Elsen
Karen Simonyan
Seb Noury
Norman Casagrande
Edward Lockhart
Florian Stimberg
Aaron van den Oord
Sander Dieleman
Koray Kavukcuoglu
21
863
0
23 Feb 2018
Neural Voice Cloning with a Few Samples
Sercan Ö. Arik
Jitong Chen
Kainan Peng
Ming-Yu Liu
Yanqi Zhou
17
381
0
14 Feb 2018
Adversarial Audio Synthesis
Chris Donahue
Julian McAuley
M. Puckette
GAN
33
602
0
12 Feb 2018
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension
Zhenhua Ling
Yang Ai
Yu Gu
Lirong Dai
16
61
0
24 Jan 2018
Attacking Speaker Recognition With Deep Generative Models
Wilson Cai
Anish Doshi
Rafael Valle
GAN
10
22
0
08 Jan 2018
Towards a Deep Improviser: a prototype deep learning post-tonal free music generator
R. Dean
J. Forth
24
10
0
21 Dec 2017
Previous
1
2
3
4
5
6
Next