SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

22 December 2016

Aaron Courville

Papers citing "SampleRNN: An Unconditional End-to-End Neural Audio Generation Model"

50 / 274 papers shown

Title
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis Jan Skoglund J. Valin 37 38 0 12 May 2019
Deep Learning for Audio Signal Processing Hendrik Purwins Bo-wen Li Tuomas Virtanen Jan Schlüter Shuo-yiin Chang Tara N. Sainath VLM 24 586 0 30 Apr 2019
Neural source-filter waveform models for statistical parametric speech synthesis Xin Wang Shinji Takaki Junichi Yamagishi 31 117 0 27 Apr 2019
The Zero Resource Speech Challenge 2019: TTS without T Ewan Dunbar Robin Algayres Julien Karadayi Mathieu Bernard Juan Benjumea ... Lucas Ondel A. Black Laurent Besacier S. Sakti Emmanuel Dupoux 17 116 0 25 Apr 2019
Generating Long Sequences with Sparse Transformers R. Child Scott Gray Alec Radford Ilya Sutskever 16 1,848 0 23 Apr 2019
Singing voice synthesis based on convolutional neural networks Kazuhiro Nakamura Kei Hashimoto Keiichiro Oura Yoshihiko Nankaku K. Tokuda 18 33 0 15 Apr 2019
RNN-based speech synthesis using a continuous sinusoidal model M. S. Al-Radhi T. Csapó Géza Németh 12 4 0 12 Apr 2019
End-to-end Binaural Sound Localisation from the Raw Waveform Paolo Vecchiotti Ning Ma S. Squartini Guy J. Brown 11 59 0 03 Apr 2019
Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform Shinji Takaki Hirokazu Kameoka Junichi Yamagishi 6 2 0 29 Mar 2019
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet J. Valin Jan Skoglund 24 78 0 28 Mar 2019
Bandwidth Extension on Raw Audio via Generative Adversarial Networks S. Kim V. Sathe GAN 8 26 0 21 Mar 2019
GANSynth: Adversarial Neural Audio Synthesis Jesse Engel Kumar Krishna Agrawal Shuo Chen Ishaan Gulrajani Chris Donahue Adam Roberts 46 385 0 23 Feb 2019
Capacity allocation through neural network layers Jonathan Donier 14 3 0 22 Feb 2019
Capacity allocation analysis of neural networks: A tool for principled architecture design Jonathan Donier 22 4 0 12 Feb 2019
Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning Frederik Benzing M. Gauy Asier Mujika A. Martinsson Angelika Steger 23 22 0 11 Feb 2019
Adversarial Generation of Time-Frequency Features with application in audio synthesis Andrés Marafioti Nicki Holighaus Nathanael Perraudin P. Majdak 17 68 0 11 Feb 2019
Classical Music Generation in Distinct Dastgahs with AlimNet ACGAN Saber Malekzadeh Maryam Samami Shahla Rezazadeh Azar Maryam Rayegan GAN MGen 22 3 0 15 Jan 2019
Introduction to Voice Presentation Attack Detection and Recent Advances Md. Sahidullah Héctor Delgado Massimiliano Todisco Tomi Kinnunen Nicholas W. D. Evans Junichi Yamagishi Kong-Aik Lee AAML 8 75 0 04 Jan 2019
Interpretable Convolutional Filters with SincNet Mirco Ravanelli Yoshua Bengio 21 104 0 23 Nov 2018
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer Sicong Huang Qiyang Li Cem Anil Xuchan Bao Sageev Oore Roger C. Grosse 19 97 0 22 Nov 2018
Representation Mixing for TTS Synthesis Kyle Kastner J. F. Santos Yoshua Bengio Aaron Courville 11 43 0 17 Nov 2018
Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and Beatles Zack Zukowski CJ Carr 11 18 0 16 Nov 2018
Generating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands CJ Carr Zack Zukowski MGen 8 20 0 16 Nov 2018
Comprehensive evaluation of statistical speech waveform synthesis Thomas Merritt Bartosz Putrycz Adam Nadolski Tianjun Ye Daniel Korzekwa ... Alexis Moinet A. Breen Rafal Kuklinski N. Strom Roberto Barra-Chicote 14 17 0 15 Nov 2018
Speaker-adaptive neural vocoders for parametric speech synthesis systems Eunwoo Song Xiang Yu Erik Cambria Jagath Rajapakse 6 3 0 08 Nov 2018
High-quality speech coding with SampleRNN Adam Conkey Per Hedelin Cong Zhou Tucker Hermans Lars Villemoes 11 59 0 07 Nov 2018
Modeling Melodic Feature Dependency with Modularized Variational Auto-Encoder Yu-An Wang Yu-Kai Huang Tzu-Chuan Lin Shang-Yu Su Yun-Nung (Vivian) Chen 14 3 0 31 Oct 2018
Audio inpainting of music by means of neural networks Andrés Marafioti Nicki Holighaus P. Majdak Nathanael Perraudin 16 18 0 29 Oct 2018
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction J. Valin Jan Skoglund 6 448 0 28 Oct 2018
SING: Symbol-to-Instrument Neural Generator Alexandre Défossez Neil Zeghidour Nicolas Usunier Léon Bottou Francis R. Bach 13 59 0 23 Oct 2018
Modulated Variational auto-Encoders for many-to-many musical timbre transfer Adrien Bitton P. Esling Axel Chemla-Romeu-Santos 20 25 0 29 Sep 2018
MIDI-VAE: Modeling Dynamics and Instrumentation of Music with Applications to Style Transfer Gino Brunner Andres Konrad Yuyi Wang Roger Wattenhofer 33 133 0 20 Sep 2018
Neural Speech Synthesis with Transformer Network Naihan Li Shujie Liu Yanqing Liu Sheng Zhao Ming-Yu Liu M. Zhou 16 102 0 19 Sep 2018
Voice Conversion with Conditional SampleRNN Cong Zhou Michael Horgan Vivek Kumar Cristina Vasco Dan Darcy 15 20 0 24 Aug 2018
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis G. Henter Jaime Lorenzo-Trueba Xin Wang Junichi Yamagishi DRL SSL 13 61 0 30 Jul 2018
Speaker Recognition from Raw Waveform with SincNet Mirco Ravanelli Yoshua Bengio 44 698 0 29 Jul 2018
ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech Ming-Yu Liu Kainan Peng Jitong Chen 12 342 0 19 Jul 2018
The challenge of realistic music generation: modelling raw audio at scale Sander Dieleman Aaron van den Oord Karen Simonyan 21 184 0 26 Jun 2018
Sounderfeit: Cloning a Physical Model using a Conditional Adversarial Autoencoder Stephen Sinclair GAN 13 1 0 25 Jun 2018
Voice Imitating Text-to-Speech Neural Networks Younggun Lee Taesu Kim Soo-Young Lee 24 11 0 04 Jun 2018
Real-valued parametric conditioning of an RNN for interactive sound synthesis L. Wyse 12 9 0 28 May 2018
Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics P. Esling Axel Chemla-Romeu-Santos Adrien Bitton 12 32 0 22 May 2018
Collapsed speech segment detection and suppression for WaveNet vocoder Yi-Chiao Wu Kazuhiro Kobayashi Tomoki Hayashi Patrick Lumban Tobing T. Toda 7 25 0 30 Apr 2018
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods Jaime Lorenzo-Trueba Junichi Yamagishi T. Toda Daisuke Saito F. Villavicencio Tomi Kinnunen Zhenhua Ling 14 318 0 12 Apr 2018
Efficient Neural Audio Synthesis Nal Kalchbrenner Erich Elsen Karen Simonyan Seb Noury Norman Casagrande Edward Lockhart Florian Stimberg Aaron van den Oord Sander Dieleman Koray Kavukcuoglu 21 863 0 23 Feb 2018
Neural Voice Cloning with a Few Samples Sercan Ö. Arik Jitong Chen Kainan Peng Ming-Yu Liu Yanqi Zhou 17 381 0 14 Feb 2018
Adversarial Audio Synthesis Chris Donahue Julian McAuley M. Puckette GAN 33 602 0 12 Feb 2018
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension Zhenhua Ling Yang Ai Yu Gu Lirong Dai 16 61 0 24 Jan 2018
Attacking Speaker Recognition With Deep Generative Models Wilson Cai Anish Doshi Rafael Valle GAN 10 22 0 08 Jan 2018
Towards a Deep Improviser: a prototype deep learning post-tonal free music generator R. Dean J. Forth 24 10 0 21 Dec 2017