ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.07837
  4. Cited By
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

22 December 2016
Soroush Mehri
Kundan Kumar
Ishaan Gulrajani
Rithesh Kumar
Shubham Jain
Jose M. R. Sotelo
Aaron Courville
Yoshua Bengio
ArXivPDFHTML

Papers citing "SampleRNN: An Unconditional End-to-End Neural Audio Generation Model"

50 / 274 papers shown
Title
SoundStream: An End-to-End Neural Audio Codec
SoundStream: An End-to-End Neural Audio Codec
Neil Zeghidour
Alejandro Luebs
Ahmed Omran
Jan Skoglund
Marco Tagliasacchi
AI4TS
43
731
0
07 Jul 2021
Energy Consumption of Deep Generative Audio Models
Energy Consumption of Deep Generative Audio Models
Constance Douwes
P. Esling
Jean-Pierre Briot
MedIm
17
13
0
06 Jul 2021
A Survey on Neural Speech Synthesis
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
18
352
0
29 Jun 2021
Distilling the Knowledge from Conditional Normalizing Flows
Distilling the Knowledge from Conditional Normalizing Flows
Dmitry Baranchuk
Vladimir Aliev
Artem Babenko
BDL
36
2
0
24 Jun 2021
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational
  Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Jian Cong
Shan Yang
Lei Xie
Dan Su
DRL
18
29
0
21 Jun 2021
CRASH: Raw Audio Score-based Generative Modeling for Controllable
  High-resolution Drum Sound Synthesis
CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis
Simon Rouard
Gaëtan Hadjeres
DiffM
19
42
0
14 Jun 2021
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
Catch-A-Waveform: Learning to Generate Audio from a Single Short Example
Gal Greshler
Tamar Rott Shaham
T. Michaeli
18
25
0
11 Jun 2021
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical
  Parameters
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters
Pritish Chandna
António Ramires
Xavier Serra
Emilia Gómez
24
4
0
21 May 2021
Parallel and Flexible Sampling from Autoregressive Models via Langevin
  Dynamics
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
V. Jayaram
John Thickstun
DiffM
20
23
0
17 May 2021
ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All
  You Need For Audio Generation
ItôTTS and ItôWave: Linear Stochastic Differential Equation Is All You Need For Audio Generation
Shoule Wu
Ziqiang Shi
DiffM
21
11
0
17 May 2021
Review of end-to-end speech synthesis technology based on deep learning
Review of end-to-end speech synthesis technology based on deep learning
Zhaoxi Mu
Xinyu Yang
Yizhuo Dong
AuLLM
ALM
26
24
0
20 Apr 2021
Unified Source-Filter GAN: Unified Source-filter Network Based On
  Factorization of Quasi-Periodic Parallel WaveGAN
Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN
Reo Yoneyama
Yi-Chiao Wu
T. Toda
14
12
0
10 Apr 2021
The AS-NU System for the M2VoC Challenge
The AS-NU System for the M2VoC Challenge
Cheng-Hung Hu
Yi-Chiao Wu
Wen-Chin Huang
Yu-Huai Peng
Yu-Wen Chen
Pin-Jui Ku
T. Toda
Yu Tsao
Hsin-Min Wang
11
1
0
07 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio Barnabò
Giovanni Trappolini
L. Lastilla
Cesare Campagnano
Angela Fan
Fabio Petroni
Fabrizio Silvestri
14
4
0
01 Apr 2021
Improve GAN-based Neural Vocoder using Pointwise Relativistic
  LeastSquare GAN
Improve GAN-based Neural Vocoder using Pointwise Relativistic LeastSquare GAN
Cong Wang
Yu Chen
Bin Wang
Yi Shi
32
1
0
26 Mar 2021
Latent Space Explorations of Singing Voice Synthesis using DDSP
Latent Space Explorations of Singing Voice Synthesis using DDSP
J. Alonso
Cumhur Erkut
41
12
0
12 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs,
  Normalizing Flows, Energy-Based and Autoregressive Models
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLM
TPM
41
481
0
08 Mar 2021
Handling Background Noise in Neural Speech Generation
Handling Background Noise in Neural Speech Generation
Tom Denton
Alejandro Luebs
Felicia S. C. Lim
Andrew Storus
Hengchin Yeh
W. Kleijn
Jan Skoglund
8
2
0
23 Feb 2021
Hierarchical Recurrent Neural Networks for Conditional Melody Generation
  with Long-term Structure
Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure
Zixun Guo
D. Makris
Dorien Herremans
16
24
0
19 Feb 2021
AudioVisual Speech Synthesis: A brief literature review
AudioVisual Speech Synthesis: A brief literature review
Efthymios Georgiou
Athanasios Katsamanis
21
0
0
18 Feb 2021
Environment Transfer for Distributed Systems
Environment Transfer for Distributed Systems
Chunheng Jiang
Jae-wook Ahn
N. Desai
28
1
0
06 Jan 2021
Group Communication with Context Codec for Lightweight Source Separation
Group Communication with Context Codec for Lightweight Source Separation
Yi Luo
Cong Han
N. Mesgarani
26
20
0
14 Dec 2020
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at
  Pitch
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch
Joseph P. Turian
Max Henry
24
29
0
08 Dec 2020
SRECG: ECG Signal Super-resolution Framework for Portable/Wearable
  Devices in Cardiac Arrhythmias Classification
SRECG: ECG Signal Super-resolution Framework for Portable/Wearable Devices in Cardiac Arrhythmias Classification
Tsai-Min Chen
Yuan-Hong Tsai
Huan-Hsin Tseng
Kai-Chun Liu
Jhih-Yu Chen
Chih-Han Huang
Guo-Yuan Li
Chun-Yen Shen
Yu Tsao
41
22
0
07 Dec 2020
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body
  Movements
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements
Kun Su
Xiulong Liu
Eli Shlizerman
24
28
0
07 Dec 2020
Text-to-speech for the hearing impaired
Text-to-speech for the hearing impaired
Josef Schlittenlacher
T. Baer
9
0
0
03 Dec 2020
MTCRNN: A multi-scale RNN for directed audio texture synthesis
MTCRNN: A multi-scale RNN for directed audio texture synthesis
M. Huzaifah
L. Wyse
14
2
0
25 Nov 2020
End-To-End Dilated Variational Autoencoder with Bottleneck
  Discriminative Loss for Sound Morphing -- A Preliminary Study
End-To-End Dilated Variational Autoencoder with Bottleneck Discriminative Loss for Sound Morphing -- A Preliminary Study
Matteo Lionello
Hendrik Purwins
28
0
0
19 Nov 2020
Vertical-Horizontal Structured Attention for Generating Music with
  Chords
Vertical-Horizontal Structured Attention for Generating Music with Chords
Yizhou Zhao
Liang Qiu
Wensi Ai
Feng Shi
Song-Chun Zhu
MGen
27
2
0
18 Nov 2020
Towards transformation-resilient provenance detection of digital media
Towards transformation-resilient provenance detection of digital media
Jamie Hayes
Krishnamurthy Dvijotham
Dvijotham
Yutian Chen
Sander Dieleman
Pushmeet Kohli
Norman Casagrande
18
3
0
14 Nov 2020
A Comprehensive Survey on Deep Music Generation: Multi-level
  Representations, Algorithms, Evaluations, and Future Directions
A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions
Shulei Ji
Jing Luo
Xinyu Yang
MGen
13
125
0
13 Nov 2020
Sound Synthesis, Propagation, and Rendering: A Survey
Sound Synthesis, Propagation, and Rendering: A Survey
Shiguang Liu
Tianyi Zhou
27
26
0
11 Nov 2020
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with
  Temporal Adaptive Normalization
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization
Ahmed Mustafa
N. Pia
Guillaume Fuchs
19
71
0
03 Nov 2020
Listening to Sounds of Silence for Speech Denoising
Listening to Sounds of Silence for Speech Denoising
Ruilin Xu
Rundi Wu
Y. Ishiwaka
Carl Vondrick
Changxi Zheng
25
32
0
22 Oct 2020
NU-GAN: High resolution neural upsampling with GAN
NU-GAN: High resolution neural upsampling with GAN
Rithesh Kumar
Kundan Kumar
Vicki Anand
Yoshua Bengio
Aaron Courville
24
25
0
22 Oct 2020
AI Song Contest: Human-AI Co-Creation in Songwriting
AI Song Contest: Human-AI Co-Creation in Songwriting
Cheng-Zhi Anna Huang
Hendrik Vincent Koops
Ed Newton-Rex
Monica Dinculescu
Carrie J. Cai
13
89
0
12 Oct 2020
The NU Voice Conversion System for the Voice Conversion Challenge 2020:
  On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural
  Vocoders
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin Huang
Patrick Lumban Tobing
Yi-Chiao Wu
Kazuhiro Kobayashi
T. Toda
19
8
0
09 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed
  Langevin Dynamics
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
23
21
0
06 Oct 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
34
1,392
0
21 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation
WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
DiffM
BDL
14
771
0
02 Sep 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative
  Adversarial Networks
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
13
20
0
27 Aug 2020
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems
Ravichander Vipperla
Sangjun Park
Kihyun Choo
Samin S. Ishtiaq
Kyoungbo Min
S. Bhattacharya
Abhinav Mehrotra
Alberto Gil C. P. Ramos
Nicholas D. Lane
18
26
0
11 Aug 2020
Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen
  Speaker and Recording Conditions
Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording Conditions
D. Paul
Yannis Pantazis
Y. Stylianou
DRL
8
29
0
09 Aug 2020
Unsupervised Cross-Domain Singing Voice Conversion
Unsupervised Cross-Domain Singing Voice Conversion
Adam Polyak
Lior Wolf
Yossi Adi
Yaniv Taigman
12
44
0
06 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with
  Adversarial Learning
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
15
6
0
05 Aug 2020
Diet deep generative audio models with structured lottery
Diet deep generative audio models with structured lottery
P. Esling
Ninon Devis
Adrien Bitton
Antoine Caillon
Axel Chemla-Romeu-Santos
Constance Douwes
6
6
0
31 Jul 2020
Generating Visually Aligned Sound from Videos
Generating Visually Aligned Sound from Videos
Peihao Chen
Yang Zhang
Mingkui Tan
Hongdong Xiao
Deng Huang
Chuang Gan
VGen
18
95
0
14 Jul 2020
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model
  with Pitch-dependent Dilated Convolution Neural Network
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network
Yi-Chiao Wu
Tomoki Hayashi
Patrick Lumban Tobing
Kazuhiro Kobayashi
T. Toda
27
18
0
11 Jul 2020
Face-to-Music Translation Using a Distance-Preserving Generative
  Adversarial Network with an Auxiliary Discriminator
Face-to-Music Translation Using a Distance-Preserving Generative Adversarial Network with an Auxiliary Discriminator
Chelhwon Kim
Andrew Allan Port
Mitesh Patel
CVBM
19
1
0
24 Jun 2020
Audeo: Audio Generation for a Silent Performance Video
Audeo: Audio Generation for a Silent Performance Video
Kun Su
Xiulong Liu
Eli Shlizerman
VGen
26
67
0
23 Jun 2020
Previous
123456
Next