SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

22 December 2016

Aaron Courville

Papers citing "SampleRNN: An Unconditional End-to-End Neural Audio Generation Model"

50 / 274 papers shown

Title
SoundStream: An End-to-End Neural Audio Codec Neil Zeghidour Alejandro Luebs Ahmed Omran Jan Skoglund Marco Tagliasacchi AI4TS 43 731 0 07 Jul 2021
Energy Consumption of Deep Generative Audio Models Constance Douwes P. Esling Jean-Pierre Briot MedIm 17 13 0 06 Jul 2021
A Survey on Neural Speech Synthesis Xu Tan Tao Qin Frank Soong Tie-Yan Liu AI4TS 18 352 0 29 Jun 2021
Distilling the Knowledge from Conditional Normalizing Flows Dmitry Baranchuk Vladimir Aliev Artem Babenko BDL 36 2 0 24 Jun 2021
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis Jian Cong Shan Yang Lei Xie Dan Su DRL 18 29 0 21 Jun 2021
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis Simon Rouard Gaëtan Hadjeres DiffM 19 42 0 14 Jun 2021
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example Gal Greshler Tamar Rott Shaham T. Michaeli 18 25 0 11 Jun 2021
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters Pritish Chandna António Ramires Xavier Serra Emilia Gómez 24 4 0 21 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics V. Jayaram John Thickstun DiffM 20 23 0 17 May 2021
ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio Generation Shoule Wu Ziqiang Shi DiffM 21 11 0 17 May 2021
Review of end-to-end speech synthesis technology based on deep learning Zhaoxi Mu Xinyu Yang Yizhuo Dong AuLLM ALM 26 24 0 20 Apr 2021
Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN Reo Yoneyama Yi-Chiao Wu T. Toda 14 12 0 10 Apr 2021
The AS-NU System for the M2VoC Challenge Cheng-Hung Hu Yi-Chiao Wu Wen-Chin Huang Yu-Huai Peng Yu-Wen Chen Pin-Jui Ku T. Toda Yu Tsao Hsin-Min Wang 11 1 0 07 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN Giorgio Barnabò Giovanni Trappolini L. Lastilla Cesare Campagnano Angela Fan Fabio Petroni Fabrizio Silvestri 14 4 0 01 Apr 2021
Improve GAN-based Neural Vocoder using Pointwise Relativistic LeastSquare GAN Cong Wang Yu Chen Bin Wang Yi Shi 32 1 0 26 Mar 2021
Latent Space Explorations of Singing Voice Synthesis using DDSP J. Alonso Cumhur Erkut 41 12 0 12 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models Sam Bond-Taylor Adam Leach Yang Long Chris G. Willcocks VLM TPM 41 481 0 08 Mar 2021
Handling Background Noise in Neural Speech Generation Tom Denton Alejandro Luebs Felicia S. C. Lim Andrew Storus Hengchin Yeh W. Kleijn Jan Skoglund 8 2 0 23 Feb 2021
Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure Zixun Guo D. Makris Dorien Herremans 16 24 0 19 Feb 2021
AudioVisual Speech Synthesis: A brief literature review Efthymios Georgiou Athanasios Katsamanis 21 0 0 18 Feb 2021
Environment Transfer for Distributed Systems Chunheng Jiang Jae-wook Ahn N. Desai 28 1 0 06 Jan 2021
Group Communication with Context Codec for Lightweight Source Separation Yi Luo Cong Han N. Mesgarani 26 20 0 14 Dec 2020
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch Joseph P. Turian Max Henry 24 29 0 08 Dec 2020
SRECG: ECG Signal Super-resolution Framework for Portable/Wearable Devices in Cardiac Arrhythmias Classification Tsai-Min Chen Yuan-Hong Tsai Huan-Hsin Tseng Kai-Chun Liu Jhih-Yu Chen Chih-Han Huang Guo-Yuan Li Chun-Yen Shen Yu Tsao 41 22 0 07 Dec 2020
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements Kun Su Xiulong Liu Eli Shlizerman 24 28 0 07 Dec 2020
Text-to-speech for the hearing impaired Josef Schlittenlacher T. Baer 9 0 0 03 Dec 2020
MTCRNN: A multi-scale RNN for directed audio texture synthesis M. Huzaifah L. Wyse 14 2 0 25 Nov 2020
End-To-End Dilated Variational Autoencoder with Bottleneck Discriminative Loss for Sound Morphing -- A Preliminary Study Matteo Lionello Hendrik Purwins 28 0 0 19 Nov 2020
Vertical-Horizontal Structured Attention for Generating Music with Chords Yizhou Zhao Liang Qiu Wensi Ai Feng Shi Song-Chun Zhu MGen 27 2 0 18 Nov 2020
Towards transformation-resilient provenance detection of digital media Jamie Hayes Krishnamurthy Dvijotham Dvijotham Yutian Chen Sander Dieleman Pushmeet Kohli Norman Casagrande 18 3 0 14 Nov 2020
A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions Shulei Ji Jing Luo Xinyu Yang MGen 13 125 0 13 Nov 2020
Sound Synthesis, Propagation, and Rendering: A Survey Shiguang Liu Tianyi Zhou 27 26 0 11 Nov 2020
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization Ahmed Mustafa N. Pia Guillaume Fuchs 19 71 0 03 Nov 2020
Listening to Sounds of Silence for Speech Denoising Ruilin Xu Rundi Wu Y. Ishiwaka Carl Vondrick Changxi Zheng 25 32 0 22 Oct 2020
NU-GAN: High resolution neural upsampling with GAN Rithesh Kumar Kundan Kumar Vicki Anand Yoshua Bengio Aaron Courville 24 25 0 22 Oct 2020
AI Song Contest: Human-AI Co-Creation in Songwriting Cheng-Zhi Anna Huang Hendrik Vincent Koops Ed Newton-Rex Monica Dinculescu Carrie J. Cai 13 89 0 12 Oct 2020
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders Wen-Chin Huang Patrick Lumban Tobing Yi-Chiao Wu Kazuhiro Kobayashi T. Toda 19 8 0 09 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics Hirokazu Kameoka Takuhiro Kaneko Kou Tanaka Nobukatsu Hojo Shogo Seki DiffM 23 21 0 06 Oct 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis Zhifeng Kong Ming-Yu Liu Jiaji Huang Kexin Zhao Bryan Catanzaro DiffM BDL 34 1,392 0 21 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation Nanxin Chen Yu Zhang Heiga Zen Ron J. Weiss Mohammad Norouzi William Chan DiffM BDL 14 771 0 02 Sep 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks Hirokazu Kameoka Takuhiro Kaneko Kou Tanaka Nobukatsu Hojo 13 20 0 27 Aug 2020
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems Ravichander Vipperla Sangjun Park Kihyun Choo Samin S. Ishtiaq Kyoungbo Min S. Bhattacharya Abhinav Mehrotra Alberto Gil C. P. Ramos Nicholas D. Lane 18 26 0 11 Aug 2020
Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording Conditions D. Paul Yannis Pantazis Y. Stylianou DRL 8 29 0 09 Aug 2020
Unsupervised Cross-Domain Singing Voice Conversion Adam Polyak Lior Wolf Yossi Adi Yaniv Taigman 12 44 0 06 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning Jing-Xuan Zhang Zhenhua Ling Lirong Dai 15 6 0 05 Aug 2020
Diet deep generative audio models with structured lottery P. Esling Ninon Devis Adrien Bitton Antoine Caillon Axel Chemla-Romeu-Santos Constance Douwes 6 6 0 31 Jul 2020
Generating Visually Aligned Sound from Videos Peihao Chen Yang Zhang Mingkui Tan Hongdong Xiao Deng Huang Chuang Gan VGen 18 95 0 14 Jul 2020
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network Yi-Chiao Wu Tomoki Hayashi Patrick Lumban Tobing Kazuhiro Kobayashi T. Toda 27 18 0 11 Jul 2020
Face-to-Music Translation Using a Distance-Preserving Generative Adversarial Network with an Auxiliary Discriminator Chelhwon Kim Andrew Allan Port Mitesh Patel CVBM 19 1 0 24 Jun 2020
Audeo: Audio Generation for a Silent Performance Video Kun Su Xiulong Liu Eli Shlizerman VGen 26 67 0 23 Jun 2020