ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.07837
  4. Cited By
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

22 December 2016
Soroush Mehri
Kundan Kumar
Ishaan Gulrajani
Rithesh Kumar
Shubham Jain
Jose M. R. Sotelo
Aaron Courville
Yoshua Bengio
ArXivPDFHTML

Papers citing "SampleRNN: An Unconditional End-to-End Neural Audio Generation Model"

24 / 274 papers shown
Title
A Hierarchical Recurrent Neural Network for Symbolic Melody Generation
A Hierarchical Recurrent Neural Network for Symbolic Melody Generation
Jian Wu
Changran Hu
Yulong Wang
Xiaolin Hu
Jun Zhu
18
81
0
14 Dec 2017
Visual to Sound: Generating Natural Sound for Videos in the Wild
Visual to Sound: Generating Natural Sound for Videos in the Wild
Yipin Zhou
Zhaowen Wang
Chen Fang
Trung Bui
Tamara L. Berg
VGen
18
206
0
04 Dec 2017
JamBot: Music Theory Aware Chord Based Generation of Polyphonic Music
  with LSTMs
JamBot: Music Theory Aware Chord Based Generation of Polyphonic Music with LSTMs
Gino Brunner
Yuyi Wang
Roger Wattenhofer
Jonas Wiesendanger
MGen
16
49
0
21 Nov 2017
Variational Bi-LSTMs
Variational Bi-LSTMs
Samira Shabanian
Devansh Arpit
Adam Trischler
Yoshua Bengio
DRL
30
24
0
15 Nov 2017
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent
  Networks
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent Networks
Nan Rosemary Ke
Anirudh Goyal
O. Bilaniuk
Jonathan Binas
Laurent Charlin
C. Pal
Yoshua Bengio
25
15
0
07 Nov 2017
Neural Discrete Representation Learning
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
43
4,822
0
02 Nov 2017
TasNet: time-domain audio separation network for real-time,
  single-channel speech separation
TasNet: time-domain audio separation network for real-time, single-channel speech separation
Yi Luo
N. Mesgarani
19
621
0
01 Nov 2017
Malware Detection by Eating a Whole EXE
Malware Detection by Eating a Whole EXE
Edward Raff
Jon Barker
Jared Sylvester
Robert Brandon
Bryan Catanzaro
Charles K. Nicholas
32
537
0
25 Oct 2017
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence
  Learning
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Ming-Yu Liu
Kainan Peng
Andrew Gibiansky
Sercan Ö. Arik
Ajay Kannan
Sharan Narang
Jonathan Raiman
John Miller
24
303
0
20 Oct 2017
A Tutorial on Deep Learning for Music Information Retrieval
A Tutorial on Deep Learning for Music Information Retrieval
Keunwoo Choi
Gyorgy Fazekas
Kyunghyun Cho
Mark Sandler
VLM
17
91
0
13 Sep 2017
Deep Learning Techniques for Music Generation -- A Survey
Deep Learning Techniques for Music Generation -- A Survey
Jean-Pierre Briot
Gaëtan Hadjeres
F. Pachet
MGen
37
297
0
05 Sep 2017
Audio Super Resolution using Neural Networks
Audio Super Resolution using Neural Networks
Volodymyr Kuleshov
S. Enam
Stefano Ermon
SupR
26
126
0
02 Aug 2017
VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop
VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop
Yaniv Taigman
Lior Wolf
Adam Polyak
Eliya Nachmani
17
26
0
20 Jul 2017
Do Neural Nets Learn Statistical Laws behind Natural Language?
Do Neural Nets Learn Statistical Laws behind Natural Language?
Shuntaro Takahashi
Kumiko Tanaka-Ishii
33
27
0
16 Jul 2017
Statistical Parametric Speech Synthesis Using Generative Adversarial
  Networks Under A Multi-task Learning Framework
Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning Framework
Shan Yang
Lei Xie
Xiao Chen
Xiaoyan Lou
Xuan Zhu
Dongyan Huang
Haizhou Li
GAN
33
46
0
06 Jul 2017
A Wavenet for Speech Denoising
A Wavenet for Speech Denoising
Dario Rethage
Jordi Pons
Xavier Serra
19
430
0
22 Jun 2017
Deep Voice 2: Multi-Speaker Neural Text-to-Speech
Deep Voice 2: Multi-Speaker Neural Text-to-Speech
Sercan Ö. Arik
G. Diamos
Andrew Gibiansky
John Miller
Kainan Peng
Ming-Yu Liu
Jonathan Raiman
Yanqi Zhou
19
494
0
24 May 2017
Learning Latent Representations for Speech Generation and Transformation
Learning Latent Representations for Speech Generation and Transformation
Wei-Ning Hsu
Yu Zhang
James R. Glass
DRL
BDL
SSL
18
145
0
13 Apr 2017
A Neural Parametric Singing Synthesizer
A Neural Parametric Singing Synthesizer
Merlijn Blaauw
J. Bonada
19
98
0
12 Apr 2017
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders
Jesse Engel
Cinjon Resnick
Adam Roberts
Sander Dieleman
Douglas Eck
Karen Simonyan
Mohammad Norouzi
23
613
0
05 Apr 2017
MidiNet: A Convolutional Generative Adversarial Network for
  Symbolic-domain Music Generation
MidiNet: A Convolutional Generative Adversarial Network for Symbolic-domain Music Generation
Li-Chia Yang
Szu-Yu Chou
Yi-Hsuan Yang
GAN
MGen
9
460
0
31 Mar 2017
Tacotron: Towards End-to-End Speech Synthesis
Tacotron: Towards End-to-End Speech Synthesis
Yuxuan Wang
RJ Skerry-Ryan
Daisy Stanton
Yonghui Wu
Ron J. Weiss
...
Samy Bengio
Quoc V. Le
Yannis Agiomyrgiannakis
R. Clark
Rif A. Saurous
45
1,804
0
29 Mar 2017
Deep Voice: Real-time Neural Text-to-Speech
Deep Voice: Real-time Neural Text-to-Speech
Sercan Ö. Arik
Mike Chrzanowski
Adam Coates
G. Diamos
Andrew Gibiansky
...
John Miller
Andrew Ng
Jonathan Raiman
Shubho Sengupta
M. Shoeybi
21
612
0
25 Feb 2017
Pixel Recurrent Neural Networks
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
266
2,550
0
25 Jan 2016
Previous
123456