ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.02155
  4. Cited By
FloWaveNet : A Generative Flow for Raw Audio

FloWaveNet : A Generative Flow for Raw Audio

6 November 2018
Sungwon Kim
Sang-gil Lee
Jongyoon Song
Jaehyeon Kim
Sungroh Yoon
ArXivPDFHTML

Papers citing "FloWaveNet : A Generative Flow for Raw Audio"

50 / 108 papers shown
Title
Out-of-Distribution Detection of Melanoma using Normalizing Flows
Out-of-Distribution Detection of Melanoma using Normalizing Flows
M. Valiuddin
C.G.A. Viviers
OODD
24
0
0
23 Mar 2021
GAN Vocoder: Multi-Resolution Discriminator Is All You Need
GAN Vocoder: Multi-Resolution Discriminator Is All You Need
J. You
Dalhyun Kim
Gyuhyeon Nam
Geumbyeol Hwang
Gyeongsu Chae
21
27
0
09 Mar 2021
FloMo: Tractable Motion Prediction with Normalizing Flows
FloMo: Tractable Motion Prediction with Normalizing Flows
Christoph Schöller
Alois C. Knoll
22
22
0
05 Mar 2021
Generative Speech Coding with Predictive Variance Regularization
Generative Speech Coding with Predictive Variance Regularization
W. Kleijn
Andrew Storus
Michael Chinen
Tom Denton
Felicia S. C. Lim
Alejandro Luebs
Jan Skoglund
Hengchin Yeh
29
67
0
18 Feb 2021
AudioVisual Speech Synthesis: A brief literature review
AudioVisual Speech Synthesis: A brief literature review
Efthymios Georgiou
Athanasios Katsamanis
21
0
0
18 Feb 2021
PeriodNet: A non-autoregressive waveform generation model with a
  structure separating periodic and aperiodic components
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components
Yukiya Hono
Shinji Takaki
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
22
16
0
15 Feb 2021
Full-Glow: Fully conditional Glow for more realistic image generation
Full-Glow: Fully conditional Glow for more realistic image generation
Moein Sorkhei
G. Henter
Hedvig Kjellström
25
6
0
10 Dec 2020
Text-to-speech for the hearing impaired
Text-to-speech for the hearing impaired
Josef Schlittenlacher
T. Baer
14
0
0
03 Dec 2020
MelGlow: Efficient Waveform Generative Network Based on
  Location-Variable Convolution
MelGlow: Efficient Waveform Generative Network Based on Location-Variable Convolution
Zhen Zeng
Jianzong Wang
Ning Cheng
Jing Xiao
14
8
0
03 Dec 2020
Empirical Evaluation of Deep Learning Model Compression Techniques on
  the WaveNet Vocoder
Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet Vocoder
Sam Davis
Giuseppe Coccia
Sam Gooch
Julian Mack
14
0
0
20 Nov 2020
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Ron J. Weiss
RJ Skerry-Ryan
Eric Battenberg
Soroosh Mariooryad
Diederik P. Kingma
24
98
0
06 Nov 2020
Problems using deep generative models for probabilistic audio source
  separation
Problems using deep generative models for probabilistic audio source separation
M. Frank
Maximilian Ilse
DiffM
15
4
0
03 Nov 2020
Speech Synthesis and Control Using Differentiable DSP
Speech Synthesis and Control Using Differentiable DSP
Giorgio Fabbro
Vladimir Golkov
Thomas Kemp
Daniel Cremers
28
12
0
28 Oct 2020
Parallel waveform synthesis based on generative adversarial networks
  with voicing-aware conditional discriminators
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Ryuichi Yamamoto
Eunwoo Song
Min-Jae Hwang
Jae-Min Kim
29
18
0
27 Oct 2020
The NU Voice Conversion System for the Voice Conversion Challenge 2020:
  On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural
  Vocoders
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin Huang
Patrick Lumban Tobing
Yi-Chiao Wu
Kazuhiro Kobayashi
T. Toda
21
8
0
09 Oct 2020
Improving Sequential Latent Variable Models with Autoregressive Flows
Improving Sequential Latent Variable Models with Autoregressive Flows
Joseph Marino
Lei Chen
Jiawei He
Stephan Mandt
BDL
AI4TS
30
12
0
07 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed
  Langevin Dynamics
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
Shogo Seki
DiffM
28
21
0
06 Oct 2020
Haar Wavelet based Block Autoregressive Flows for Trajectories
Haar Wavelet based Block Autoregressive Flows for Trajectories
Apratim Bhattacharyya
C. Straehle
Mario Fritz
Bernt Schiele
AI4TS
26
15
0
21 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
36
1,397
0
21 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation
WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
DiffM
BDL
16
773
0
02 Sep 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative
  Adversarial Networks
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
18
20
0
27 Aug 2020
Audio Dequantization for High Fidelity Audio Generation in Flow-based
  Neural Vocoder
Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder
Hyun-Wook Yoon
Sang-Hoon Lee
Hyeong-Rae Noh
Seong-Whan Lee
20
11
0
16 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
45
318
0
09 Aug 2020
Unsupervised Cross-Domain Singing Voice Conversion
Unsupervised Cross-Domain Singing Voice Conversion
Adam Polyak
Lior Wolf
Yossi Adi
Yaniv Taigman
20
44
0
06 Aug 2020
A Spectral Energy Distance for Parallel Speech Synthesis
A Spectral Energy Distance for Parallel Speech Synthesis
A. Gritsenko
Tim Salimans
Rianne van den Berg
Jasper Snoek
Nal Kalchbrenner
13
70
0
03 Aug 2020
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested
  Adversarial Network
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Jinhyeok Yang
Junmo Lee
Young-Ik Kim
Hoonyoung Cho
Injung Kim
16
72
0
30 Jul 2020
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model
  with Pitch-dependent Dilated Convolution Neural Network
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network
Yi-Chiao Wu
Tomoki Hayashi
Patrick Lumban Tobing
Kazuhiro Kobayashi
T. Toda
27
18
0
11 Jul 2020
IDF++: Analyzing and Improving Integer Discrete Flows for Lossless
  Compression
IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression
Rianne van den Berg
A. Gritsenko
Mostafa Dehghani
C. Sønderby
Tim Salimans
27
60
0
22 Jun 2020
Coupling-based Invertible Neural Networks Are Universal Diffeomorphism
  Approximators
Coupling-based Invertible Neural Networks Are Universal Diffeomorphism Approximators
Takeshi Teshima
Isao Ishikawa
Koichi Tojo
Kenta Oono
Masahiro Ikeda
Masashi Sugiyama
24
110
0
20 Jun 2020
Categorical Normalizing Flows via Continuous Transformations
Categorical Normalizing Flows via Continuous Transformations
Phillip Lippe
E. Gavves
BDL
23
43
0
17 Jun 2020
Why Normalizing Flows Fail to Detect Out-of-Distribution Data
Why Normalizing Flows Fail to Detect Out-of-Distribution Data
Polina Kirichenko
Pavel Izmailov
A. Wilson
OODD
22
271
0
15 Jun 2020
NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity
NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity
Sang-gil Lee
Sungwon Kim
Sungroh Yoon
27
17
0
11 Jun 2020
SoftFlow: Probabilistic Framework for Normalizing Flow on Manifolds
SoftFlow: Probabilistic Framework for Normalizing Flow on Manifolds
Hyeongju Kim
Hyeonseung Lee
Woohyun Kang
Joun Yeop Lee
N. Kim
3DPC
25
114
0
08 Jun 2020
WaveNODE: A Continuous Normalizing Flow for Speech Synthesis
WaveNODE: A Continuous Normalizing Flow for Speech Synthesis
Hyeongju Kim
Hyeongseung Lee
Woohyun Kang
Sung Jun Cheon
Byoung Jin Choi
N. Kim
22
12
0
08 Jun 2020
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren
Chenxu Hu
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
60
1,362
0
08 Jun 2020
End-to-End Adversarial Text-to-Speech
End-to-End Adversarial Text-to-Speech
Jeff Donahue
Sander Dieleman
Mikolaj Binkowski
Erich Elsen
Karen Simonyan
19
185
0
05 Jun 2020
Graphical Normalizing Flows
Graphical Normalizing Flows
Antoine Wehenkel
Gilles Louppe
TPM
BDL
12
37
0
03 Jun 2020
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment
  Search
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Jaehyeon Kim
Sungwon Kim
Jungil Kong
Sungroh Yoon
54
478
0
22 May 2020
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive
  Pitch-dependent Dilated Convolution Model for Parametric Speech Generation
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation
Yi-Chiao Wu
Tomoki Hayashi
T. Okamoto
Hisashi Kawai
T. Toda
31
4
0
18 May 2020
Many-to-Many Voice Transformer Network
Many-to-Many Voice Transformer Network
Hirokazu Kameoka
Wen-Chin Huang
Kou Tanaka
Takuhiro Kaneko
Nobukatsu Hojo
T. Toda
ViT
30
30
0
18 May 2020
Multi-band MelGAN: Faster Waveform Generation for High-Quality
  Text-to-Speech
Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech
Geng Yang
Shan Yang
Kai-Chun Liu
Peng Fang
Wei Chen
Lei Xie
68
198
0
11 May 2020
C-Flow: Conditional Generative Flow Models for Images and 3D Point
  Clouds
C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds
Albert Pumarola
S. Popov
Francesc Moreno-Noguer
V. Ferrari
3DPC
AI4CE
31
80
0
15 Dec 2019
Normalizing Flows for Probabilistic Modeling and Inference
Normalizing Flows for Probabilistic Modeling and Inference
George Papamakarios
Eric T. Nalisnick
Danilo Jimenez Rezende
S. Mohamed
Balaji Lakshminarayanan
TPM
AI4CE
67
1,635
0
05 Dec 2019
WaveFlow: A Compact Flow-based Model for Raw Audio
WaveFlow: A Compact Flow-based Model for Raw Audio
Ming-Yu Liu
Kainan Peng
Kexin Zhao
Z. Song
25
116
0
03 Dec 2019
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Mingbo Ma
Baigong Zheng
Kaibo Liu
Renjie Zheng
Hairong Liu
Kainan Peng
Kenneth Church
Liang Huang
22
29
0
07 Nov 2019
On Investigation of Unsupervised Speech Factorization Based on
  Normalization Flow
On Investigation of Unsupervised Speech Factorization Based on Normalization Flow
Haoran Sun
Yunqi Cai
Lantian Li
Dong Wang
21
1
0
29 Oct 2019
Neural Density Estimation and Likelihood-free Inference
Neural Density Estimation and Likelihood-free Inference
George Papamakarios
BDL
DRL
30
44
0
29 Oct 2019
Neural Language Priors
Neural Language Priors
Joseph Enguehard
Dan Busbridge
V. Zhelezniak
Nils Y. Hammerla
31
3
0
04 Oct 2019
High Fidelity Speech Synthesis with Adversarial Networks
High Fidelity Speech Synthesis with Adversarial Networks
Mikolaj Binkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
243
239
0
25 Sep 2019
Normalizing Flows: An Introduction and Review of Current Methods
Normalizing Flows: An Introduction and Review of Current Methods
I. Kobyzev
S. Prince
Marcus A. Brubaker
TPM
MedIm
19
57
0
25 Aug 2019
Previous
123
Next