Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.02155
Cited By
FloWaveNet : A Generative Flow for Raw Audio
6 November 2018
Sungwon Kim
Sang-gil Lee
Jongyoon Song
Jaehyeon Kim
Sungroh Yoon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FloWaveNet : A Generative Flow for Raw Audio"
50 / 108 papers shown
Title
Out-of-Distribution Detection of Melanoma using Normalizing Flows
M. Valiuddin
C.G.A. Viviers
OODD
24
0
0
23 Mar 2021
GAN Vocoder: Multi-Resolution Discriminator Is All You Need
J. You
Dalhyun Kim
Gyuhyeon Nam
Geumbyeol Hwang
Gyeongsu Chae
21
27
0
09 Mar 2021
FloMo: Tractable Motion Prediction with Normalizing Flows
Christoph Schöller
Alois C. Knoll
22
22
0
05 Mar 2021
Generative Speech Coding with Predictive Variance Regularization
W. Kleijn
Andrew Storus
Michael Chinen
Tom Denton
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Hengchin Yeh
29
67
0
18 Feb 2021
AudioVisual Speech Synthesis: A brief literature review
Efthymios Georgiou
Athanasios Katsamanis
21
0
0
18 Feb 2021
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components
Yukiya Hono
Shinji Takaki
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
22
16
0
15 Feb 2021
Full-Glow: Fully conditional Glow for more realistic image generation
Moein Sorkhei
G. Henter
Hedvig Kjellström
25
6
0
10 Dec 2020
Text-to-speech for the hearing impaired
Josef Schlittenlacher
T. Baer
14
0
0
03 Dec 2020
MelGlow: Efficient Waveform Generative Network Based on Location-Variable Convolution
Zhen Zeng
Jianzong Wang
Ning Cheng
Jing Xiao
14
8
0
03 Dec 2020
Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet Vocoder
Sam Davis
Giuseppe Coccia
Sam Gooch
Julian Mack
14
0
0
20 Nov 2020
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Ron J. Weiss
RJ Skerry-Ryan
Eric Battenberg
Soroosh Mariooryad
Diederik P. Kingma
24
98
0
06 Nov 2020
Problems using deep generative models for probabilistic audio source separation
M. Frank
Maximilian Ilse
DiffM
15
4
0
03 Nov 2020
Speech Synthesis and Control Using Differentiable DSP
Giorgio Fabbro
Vladimir Golkov
Thomas Kemp
Daniel Cremers
28
12
0
28 Oct 2020
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Ryuichi Yamamoto
Eunwoo Song
Min-Jae Hwang
Jae-Min Kim
29
18
0
27 Oct 2020
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin Huang
Patrick Lumban Tobing
Yi-Chiao Wu
Kazuhiro Kobayashi
T. Toda
21
8
0
09 Oct 2020
Improving Sequential Latent Variable Models with Autoregressive Flows
Joseph Marino
Lei Chen
Jiawei He
Stephan Mandt
BDL
AI4TS
30
12
0
07 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
28
21
0
06 Oct 2020
Haar Wavelet based Block Autoregressive Flows for Trajectories
Apratim Bhattacharyya
C. Straehle
Mario Fritz
Bernt Schiele
AI4TS
26
15
0
21 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
36
1,397
0
21 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
DiffM
BDL
16
773
0
02 Sep 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
18
20
0
27 Aug 2020
Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder
Hyun-Wook Yoon
Sang-Hoon Lee
Hyeong-Rae Noh
Seong-Whan Lee
20
11
0
16 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
45
318
0
09 Aug 2020
Unsupervised Cross-Domain Singing Voice Conversion
Adam Polyak
Lior Wolf
Yossi Adi
Yaniv Taigman
20
44
0
06 Aug 2020
A Spectral Energy Distance for Parallel Speech Synthesis
A. Gritsenko
Tim Salimans
Rianne van den Berg
Jasper Snoek
Nal Kalchbrenner
13
70
0
03 Aug 2020
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Jinhyeok Yang
Junmo Lee
Young-Ik Kim
Hoonyoung Cho
Injung Kim
16
72
0
30 Jul 2020
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network
Yi-Chiao Wu
Tomoki Hayashi
Patrick Lumban Tobing
Kazuhiro Kobayashi
T. Toda
27
18
0
11 Jul 2020
IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression
Rianne van den Berg
A. Gritsenko
Mostafa Dehghani
C. Sønderby
Tim Salimans
27
60
0
22 Jun 2020
Coupling-based Invertible Neural Networks Are Universal Diffeomorphism Approximators
Takeshi Teshima
Isao Ishikawa
Koichi Tojo
Kenta Oono
Masahiro Ikeda
Masashi Sugiyama
24
110
0
20 Jun 2020
Categorical Normalizing Flows via Continuous Transformations
Phillip Lippe
E. Gavves
BDL
23
43
0
17 Jun 2020
Why Normalizing Flows Fail to Detect Out-of-Distribution Data
Polina Kirichenko
Pavel Izmailov
A. Wilson
OODD
22
271
0
15 Jun 2020
NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity
Sang-gil Lee
Sungwon Kim
Sungroh Yoon
27
17
0
11 Jun 2020
SoftFlow: Probabilistic Framework for Normalizing Flow on Manifolds
Hyeongju Kim
Hyeonseung Lee
Woohyun Kang
Joun Yeop Lee
N. Kim
3DPC
25
114
0
08 Jun 2020
WaveNODE: A Continuous Normalizing Flow for Speech Synthesis
Hyeongju Kim
Hyeongseung Lee
Woohyun Kang
Sung Jun Cheon
Byoung Jin Choi
N. Kim
22
12
0
08 Jun 2020
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren
Chenxu Hu
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
60
1,362
0
08 Jun 2020
End-to-End Adversarial Text-to-Speech
Jeff Donahue
Sander Dieleman
Mikolaj Binkowski
Erich Elsen
Karen Simonyan
19
185
0
05 Jun 2020
Graphical Normalizing Flows
Antoine Wehenkel
Gilles Louppe
TPM
BDL
12
37
0
03 Jun 2020
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Jaehyeon Kim
Sungwon Kim
Jungil Kong
Sungroh Yoon
54
478
0
22 May 2020
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation
Yi-Chiao Wu
Tomoki Hayashi
T. Okamoto
Hisashi Kawai
T. Toda
31
4
0
18 May 2020
Many-to-Many Voice Transformer Network
Hirokazu Kameoka
Wen-Chin Huang
Kou Tanaka
Takuhiro Kaneko
Nobukatsu Hojo
T. Toda
ViT
30
30
0
18 May 2020
Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech
Geng Yang
Shan Yang
Kai-Chun Liu
Peng Fang
Wei Chen
Lei Xie
68
198
0
11 May 2020
C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds
Albert Pumarola
S. Popov
Francesc Moreno-Noguer
V. Ferrari
3DPC
AI4CE
31
80
0
15 Dec 2019
Normalizing Flows for Probabilistic Modeling and Inference
George Papamakarios
Eric T. Nalisnick
Danilo Jimenez Rezende
S. Mohamed
Balaji Lakshminarayanan
TPM
AI4CE
67
1,635
0
05 Dec 2019
WaveFlow: A Compact Flow-based Model for Raw Audio
Ming-Yu Liu
Kainan Peng
Kexin Zhao
Z. Song
25
116
0
03 Dec 2019
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Mingbo Ma
Baigong Zheng
Kaibo Liu
Renjie Zheng
Hairong Liu
Kainan Peng
Kenneth Church
Liang Huang
22
29
0
07 Nov 2019
On Investigation of Unsupervised Speech Factorization Based on Normalization Flow
Haoran Sun
Yunqi Cai
Lantian Li
Dong Wang
21
1
0
29 Oct 2019
Neural Density Estimation and Likelihood-free Inference
George Papamakarios
BDL
DRL
30
44
0
29 Oct 2019
Neural Language Priors
Joseph Enguehard
Dan Busbridge
V. Zhelezniak
Nils Y. Hammerla
31
3
0
04 Oct 2019
High Fidelity Speech Synthesis with Adversarial Networks
Mikolaj Binkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
243
239
0
25 Sep 2019
Normalizing Flows: An Introduction and Review of Current Methods
I. Kobyzev
S. Prince
Marcus A. Brubaker
TPM
MedIm
19
57
0
25 Aug 2019
Previous
1
2
3
Next