ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.07837
  4. Cited By
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

22 December 2016
Soroush Mehri
Kundan Kumar
Ishaan Gulrajani
Rithesh Kumar
Shubham Jain
Jose M. R. Sotelo
Aaron Courville
Yoshua Bengio
ArXivPDFHTML

Papers citing "SampleRNN: An Unconditional End-to-End Neural Audio Generation Model"

50 / 274 papers shown
Title
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis
Jan Skoglund
J. Valin
37
38
0
12 May 2019
Deep Learning for Audio Signal Processing
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo-wen Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
24
586
0
30 Apr 2019
Neural source-filter waveform models for statistical parametric speech
  synthesis
Neural source-filter waveform models for statistical parametric speech synthesis
Xin Wang
Shinji Takaki
Junichi Yamagishi
31
117
0
27 Apr 2019
The Zero Resource Speech Challenge 2019: TTS without T
The Zero Resource Speech Challenge 2019: TTS without T
Ewan Dunbar
Robin Algayres
Julien Karadayi
Mathieu Bernard
Juan Benjumea
...
Lucas Ondel
A. Black
Laurent Besacier
S. Sakti
Emmanuel Dupoux
17
116
0
25 Apr 2019
Generating Long Sequences with Sparse Transformers
Generating Long Sequences with Sparse Transformers
R. Child
Scott Gray
Alec Radford
Ilya Sutskever
16
1,848
0
23 Apr 2019
Singing voice synthesis based on convolutional neural networks
Singing voice synthesis based on convolutional neural networks
Kazuhiro Nakamura
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
18
33
0
15 Apr 2019
RNN-based speech synthesis using a continuous sinusoidal model
RNN-based speech synthesis using a continuous sinusoidal model
M. S. Al-Radhi
T. Csapó
Géza Németh
12
4
0
12 Apr 2019
End-to-end Binaural Sound Localisation from the Raw Waveform
End-to-end Binaural Sound Localisation from the Raw Waveform
Paolo Vecchiotti
Ning Ma
S. Squartini
Guy J. Brown
11
59
0
03 Apr 2019
Training a Neural Speech Waveform Model using Spectral Losses of
  Short-Time Fourier Transform and Continuous Wavelet Transform
Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform
Shinji Takaki
Hirokazu Kameoka
Junichi Yamagishi
6
2
0
29 Mar 2019
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet
J. Valin
Jan Skoglund
24
78
0
28 Mar 2019
Bandwidth Extension on Raw Audio via Generative Adversarial Networks
Bandwidth Extension on Raw Audio via Generative Adversarial Networks
S. Kim
V. Sathe
GAN
8
26
0
21 Mar 2019
GANSynth: Adversarial Neural Audio Synthesis
GANSynth: Adversarial Neural Audio Synthesis
Jesse Engel
Kumar Krishna Agrawal
Shuo Chen
Ishaan Gulrajani
Chris Donahue
Adam Roberts
46
385
0
23 Feb 2019
Capacity allocation through neural network layers
Capacity allocation through neural network layers
Jonathan Donier
14
3
0
22 Feb 2019
Capacity allocation analysis of neural networks: A tool for principled
  architecture design
Capacity allocation analysis of neural networks: A tool for principled architecture design
Jonathan Donier
22
4
0
12 Feb 2019
Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning
Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning
Frederik Benzing
M. Gauy
Asier Mujika
A. Martinsson
Angelika Steger
23
22
0
11 Feb 2019
Adversarial Generation of Time-Frequency Features with application in
  audio synthesis
Adversarial Generation of Time-Frequency Features with application in audio synthesis
Andrés Marafioti
Nicki Holighaus
Nathanael Perraudin
P. Majdak
17
68
0
11 Feb 2019
Classical Music Generation in Distinct Dastgahs with AlimNet ACGAN
Classical Music Generation in Distinct Dastgahs with AlimNet ACGAN
Saber Malekzadeh
Maryam Samami
Shahla Rezazadeh Azar
Maryam Rayegan
GAN
MGen
22
3
0
15 Jan 2019
Introduction to Voice Presentation Attack Detection and Recent Advances
Introduction to Voice Presentation Attack Detection and Recent Advances
Md. Sahidullah
Héctor Delgado
Massimiliano Todisco
Tomi Kinnunen
Nicholas W. D. Evans
Junichi Yamagishi
Kong-Aik Lee
AAML
8
75
0
04 Jan 2019
Interpretable Convolutional Filters with SincNet
Interpretable Convolutional Filters with SincNet
Mirco Ravanelli
Yoshua Bengio
21
104
0
23 Nov 2018
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre
  Transfer
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Sicong Huang
Qiyang Li
Cem Anil
Xuchan Bao
Sageev Oore
Roger C. Grosse
19
97
0
22 Nov 2018
Representation Mixing for TTS Synthesis
Representation Mixing for TTS Synthesis
Kyle Kastner
J. F. Santos
Yoshua Bengio
Aaron Courville
11
43
0
17 Nov 2018
Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and
  Beatles
Generating Black Metal and Math Rock: Beyond Bach, Beethoven, and Beatles
Zack Zukowski
CJ Carr
11
18
0
16 Nov 2018
Generating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands
Generating Albums with SampleRNN to Imitate Metal, Rock, and Punk Bands
CJ Carr
Zack Zukowski
MGen
8
20
0
16 Nov 2018
Comprehensive evaluation of statistical speech waveform synthesis
Comprehensive evaluation of statistical speech waveform synthesis
Thomas Merritt
Bartosz Putrycz
Adam Nadolski
Tianjun Ye
Daniel Korzekwa
...
Alexis Moinet
A. Breen
Rafal Kuklinski
N. Strom
Roberto Barra-Chicote
14
17
0
15 Nov 2018
Speaker-adaptive neural vocoders for parametric speech synthesis systems
Speaker-adaptive neural vocoders for parametric speech synthesis systems
Eunwoo Song
Xiang Yu
Erik Cambria
Jagath Rajapakse
6
3
0
08 Nov 2018
High-quality speech coding with SampleRNN
High-quality speech coding with SampleRNN
Adam Conkey
Per Hedelin
Cong Zhou
Tucker Hermans
Lars Villemoes
11
59
0
07 Nov 2018
Modeling Melodic Feature Dependency with Modularized Variational
  Auto-Encoder
Modeling Melodic Feature Dependency with Modularized Variational Auto-Encoder
Yu-An Wang
Yu-Kai Huang
Tzu-Chuan Lin
Shang-Yu Su
Yun-Nung (Vivian) Chen
14
3
0
31 Oct 2018
Audio inpainting of music by means of neural networks
Audio inpainting of music by means of neural networks
Andrés Marafioti
Nicki Holighaus
P. Majdak
Nathanael Perraudin
16
18
0
29 Oct 2018
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction
J. Valin
Jan Skoglund
6
448
0
28 Oct 2018
SING: Symbol-to-Instrument Neural Generator
SING: Symbol-to-Instrument Neural Generator
Alexandre Défossez
Neil Zeghidour
Nicolas Usunier
Léon Bottou
Francis R. Bach
13
59
0
23 Oct 2018
Modulated Variational auto-Encoders for many-to-many musical timbre
  transfer
Modulated Variational auto-Encoders for many-to-many musical timbre transfer
Adrien Bitton
P. Esling
Axel Chemla-Romeu-Santos
20
25
0
29 Sep 2018
MIDI-VAE: Modeling Dynamics and Instrumentation of Music with
  Applications to Style Transfer
MIDI-VAE: Modeling Dynamics and Instrumentation of Music with Applications to Style Transfer
Gino Brunner
Andres Konrad
Yuyi Wang
Roger Wattenhofer
33
133
0
20 Sep 2018
Neural Speech Synthesis with Transformer Network
Neural Speech Synthesis with Transformer Network
Naihan Li
Shujie Liu
Yanqing Liu
Sheng Zhao
Ming-Yu Liu
M. Zhou
16
102
0
19 Sep 2018
Voice Conversion with Conditional SampleRNN
Voice Conversion with Conditional SampleRNN
Cong Zhou
Michael Horgan
Vivek Kumar
Cristina Vasco
Dan Darcy
15
20
0
24 Aug 2018
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable
  Speech Synthesis
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis
G. Henter
Jaime Lorenzo-Trueba
Xin Wang
Junichi Yamagishi
DRL
SSL
13
61
0
30 Jul 2018
Speaker Recognition from Raw Waveform with SincNet
Speaker Recognition from Raw Waveform with SincNet
Mirco Ravanelli
Yoshua Bengio
44
698
0
29 Jul 2018
ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
ClariNet: Parallel Wave Generation in End-to-End Text-to-Speech
Ming-Yu Liu
Kainan Peng
Jitong Chen
12
342
0
19 Jul 2018
The challenge of realistic music generation: modelling raw audio at
  scale
The challenge of realistic music generation: modelling raw audio at scale
Sander Dieleman
Aaron van den Oord
Karen Simonyan
21
184
0
26 Jun 2018
Sounderfeit: Cloning a Physical Model using a Conditional Adversarial
  Autoencoder
Sounderfeit: Cloning a Physical Model using a Conditional Adversarial Autoencoder
Stephen Sinclair
GAN
13
1
0
25 Jun 2018
Voice Imitating Text-to-Speech Neural Networks
Voice Imitating Text-to-Speech Neural Networks
Younggun Lee
Taesu Kim
Soo-Young Lee
24
11
0
04 Jun 2018
Real-valued parametric conditioning of an RNN for interactive sound
  synthesis
Real-valued parametric conditioning of an RNN for interactive sound synthesis
L. Wyse
12
9
0
28 May 2018
Generative timbre spaces: regularizing variational auto-encoders with
  perceptual metrics
Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics
P. Esling
Axel Chemla-Romeu-Santos
Adrien Bitton
12
32
0
22 May 2018
Collapsed speech segment detection and suppression for WaveNet vocoder
Collapsed speech segment detection and suppression for WaveNet vocoder
Yi-Chiao Wu
Kazuhiro Kobayashi
Tomoki Hayashi
Patrick Lumban Tobing
T. Toda
7
25
0
30 Apr 2018
The Voice Conversion Challenge 2018: Promoting Development of Parallel
  and Nonparallel Methods
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods
Jaime Lorenzo-Trueba
Junichi Yamagishi
T. Toda
Daisuke Saito
F. Villavicencio
Tomi Kinnunen
Zhenhua Ling
14
318
0
12 Apr 2018
Efficient Neural Audio Synthesis
Efficient Neural Audio Synthesis
Nal Kalchbrenner
Erich Elsen
Karen Simonyan
Seb Noury
Norman Casagrande
Edward Lockhart
Florian Stimberg
Aaron van den Oord
Sander Dieleman
Koray Kavukcuoglu
21
863
0
23 Feb 2018
Neural Voice Cloning with a Few Samples
Neural Voice Cloning with a Few Samples
Sercan Ö. Arik
Jitong Chen
Kainan Peng
Ming-Yu Liu
Yanqi Zhou
17
381
0
14 Feb 2018
Adversarial Audio Synthesis
Adversarial Audio Synthesis
Chris Donahue
Julian McAuley
M. Puckette
GAN
33
602
0
12 Feb 2018
Waveform Modeling and Generation Using Hierarchical Recurrent Neural
  Networks for Speech Bandwidth Extension
Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension
Zhenhua Ling
Yang Ai
Yu Gu
Lirong Dai
16
61
0
24 Jan 2018
Attacking Speaker Recognition With Deep Generative Models
Attacking Speaker Recognition With Deep Generative Models
Wilson Cai
Anish Doshi
Rafael Valle
GAN
10
22
0
08 Jan 2018
Towards a Deep Improviser: a prototype deep learning post-tonal free
  music generator
Towards a Deep Improviser: a prototype deep learning post-tonal free music generator
R. Dean
J. Forth
24
10
0
21 Dec 2017
Previous
123456
Next