Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.06719
Cited By
Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks
20 August 2018
Sercan O. Arik
Heewoo Jun
G. Diamos
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks"
16 / 16 papers shown
Title
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration
Shigeki Karita
Yuma Koizumi
Heiga Zen
Haruko Ishikawa
Robin Scheibler
M. Bacchiani
VLM
367
1
0
07 May 2025
NeRAF: 3D Scene Infused Neural Radiance and Acoustic Fields
Amandine Brunetto
Sascha Hornauer
Fabien Moutarde
92
2
0
28 May 2024
Neural Voice Cloning with a Few Samples
Sercan O. Arik
Jitong Chen
Kainan Peng
Ming-Yu Liu
Yanqi Zhou
58
387
0
14 Feb 2018
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Jonathan Shen
Ruoming Pang
Ron J. Weiss
M. Schuster
Navdeep Jaitly
...
Yuxuan Wang
RJ Skerry-Ryan
Rif A. Saurous
Yannis Agiomyrgiannakis
Yonghui Wu
79
2,697
0
16 Dec 2017
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Aaron van den Oord
Yazhe Li
Igor Babuschkin
Karen Simonyan
Oriol Vinyals
...
Alex Graves
Helen King
T. Walters
Dan Belov
Demis Hassabis
210
858
0
28 Nov 2017
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition
Chris Donahue
Yue Liu
Rohit Prabhavalkar
54
200
0
15 Nov 2017
Audio style transfer
Eric Grinstein
Ngoc Q. K. Duong
A. Ozerov
P. Pérez
44
68
0
31 Oct 2017
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Ming-Yu Liu
Kainan Peng
Andrew Gibiansky
Sercan O. Arik
Ajay Kannan
Sharan Narang
Jonathan Raiman
John Miller
66
307
0
20 Oct 2017
Deep Voice 2: Multi-Speaker Neural Text-to-Speech
Sercan O. Arik
G. Diamos
Andrew Gibiansky
John Miller
Kainan Peng
Ming-Yu Liu
Jonathan Raiman
Yanqi Zhou
72
496
0
24 May 2017
In-Datacenter Performance Analysis of a Tensor Processing Unit
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
...
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
235
4,632
0
16 Apr 2017
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders
Jesse Engel
Cinjon Resnick
Adam Roberts
Sander Dieleman
Douglas Eck
Karen Simonyan
Mohammad Norouzi
112
624
0
05 Apr 2017
Deep Voice: Real-time Neural Text-to-Speech
Sercan O. Arik
Mike Chrzanowski
Adam Coates
G. Diamos
Andrew Gibiansky
...
John Miller
Andrew Ng
Jonathan Raiman
Shubho Sengupta
Mohammad Shoeybi
80
616
0
25 Feb 2017
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
406
7,391
0
12 Sep 2016
A Non-iterative Method for (Re)Construction of Phase from STFT Magnitude
Zdeněk Průša
Péter Balázs
P. Søndergaard
40
77
0
01 Sep 2016
A guide to convolution arithmetic for deep learning
Vincent Dumoulin
Francesco Visin
FAtt
3DH
HAI
63
1,541
0
23 Mar 2016
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.8K
150,039
0
22 Dec 2014
1