FloWaveNet : A Generative Flow for Raw Audio

6 November 2018

Papers citing "FloWaveNet : A Generative Flow for Raw Audio"

50 / 108 papers shown

Title
Out-of-Distribution Detection of Melanoma using Normalizing Flows M. Valiuddin C.G.A. Viviers OODD 24 0 0 23 Mar 2021
GAN Vocoder: Multi-Resolution Discriminator Is All You Need J. You Dalhyun Kim Gyuhyeon Nam Geumbyeol Hwang Gyeongsu Chae 21 27 0 09 Mar 2021
FloMo: Tractable Motion Prediction with Normalizing Flows Christoph Schöller Alois C. Knoll 22 22 0 05 Mar 2021
Generative Speech Coding with Predictive Variance Regularization W. Kleijn Andrew Storus Michael Chinen Tom Denton Felicia S. C. Lim Alejandro Luebs Jan Skoglund Hengchin Yeh 29 67 0 18 Feb 2021
AudioVisual Speech Synthesis: A brief literature review Efthymios Georgiou Athanasios Katsamanis 21 0 0 18 Feb 2021
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components Yukiya Hono Shinji Takaki Kei Hashimoto Keiichiro Oura Yoshihiko Nankaku K. Tokuda 22 16 0 15 Feb 2021
Full-Glow: Fully conditional Glow for more realistic image generation Moein Sorkhei G. Henter Hedvig Kjellström 25 6 0 10 Dec 2020
Text-to-speech for the hearing impaired Josef Schlittenlacher T. Baer 14 0 0 03 Dec 2020
MelGlow: Efficient Waveform Generative Network Based on Location-Variable Convolution Zhen Zeng Jianzong Wang Ning Cheng Jing Xiao 14 8 0 03 Dec 2020
Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet Vocoder Sam Davis Giuseppe Coccia Sam Gooch Julian Mack 14 0 0 20 Nov 2020
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis Ron J. Weiss RJ Skerry-Ryan Eric Battenberg Soroosh Mariooryad Diederik P. Kingma 24 98 0 06 Nov 2020
Problems using deep generative models for probabilistic audio source separation M. Frank Maximilian Ilse DiffM 15 4 0 03 Nov 2020
Speech Synthesis and Control Using Differentiable DSP Giorgio Fabbro Vladimir Golkov Thomas Kemp Daniel Cremers 28 12 0 28 Oct 2020
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators Ryuichi Yamamoto Eunwoo Song Min-Jae Hwang Jae-Min Kim 29 18 0 27 Oct 2020
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders Wen-Chin Huang Patrick Lumban Tobing Yi-Chiao Wu Kazuhiro Kobayashi T. Toda 21 8 0 09 Oct 2020
Improving Sequential Latent Variable Models with Autoregressive Flows Joseph Marino Lei Chen Jiawei He Stephan Mandt BDL AI4TS 30 12 0 07 Oct 2020
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics Hirokazu Kameoka Takuhiro Kaneko Kou Tanaka Nobukatsu Hojo Shogo Seki DiffM 28 21 0 06 Oct 2020
Haar Wavelet based Block Autoregressive Flows for Trajectories Apratim Bhattacharyya C. Straehle Mario Fritz Bernt Schiele AI4TS 26 15 0 21 Sep 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis Zhifeng Kong Ming-Yu Liu Jiaji Huang Kexin Zhao Bryan Catanzaro DiffM BDL 36 1,397 0 21 Sep 2020
WaveGrad: Estimating Gradients for Waveform Generation Nanxin Chen Yu Zhang Heiga Zen Ron J. Weiss Mohammad Norouzi William Chan DiffM BDL 16 773 0 02 Sep 2020
Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks Hirokazu Kameoka Takuhiro Kaneko Kou Tanaka Nobukatsu Hojo 18 20 0 27 Aug 2020
Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder Hyun-Wook Yoon Sang-Hoon Lee Hyeong-Rae Noh Seong-Whan Lee 20 11 0 16 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning Berrak Sisman Junichi Yamagishi Simon King Haizhou Li BDL 45 318 0 09 Aug 2020
Unsupervised Cross-Domain Singing Voice Conversion Adam Polyak Lior Wolf Yossi Adi Yaniv Taigman 20 44 0 06 Aug 2020
A Spectral Energy Distance for Parallel Speech Synthesis A. Gritsenko Tim Salimans Rianne van den Berg Jasper Snoek Nal Kalchbrenner 13 70 0 03 Aug 2020
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network Jinhyeok Yang Junmo Lee Young-Ik Kim Hoonyoung Cho Injung Kim 16 72 0 30 Jul 2020
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network Yi-Chiao Wu Tomoki Hayashi Patrick Lumban Tobing Kazuhiro Kobayashi T. Toda 27 18 0 11 Jul 2020
IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression Rianne van den Berg A. Gritsenko Mostafa Dehghani C. Sønderby Tim Salimans 27 60 0 22 Jun 2020
Coupling-based Invertible Neural Networks Are Universal Diffeomorphism Approximators Takeshi Teshima Isao Ishikawa Koichi Tojo Kenta Oono Masahiro Ikeda Masashi Sugiyama 24 110 0 20 Jun 2020
Categorical Normalizing Flows via Continuous Transformations Phillip Lippe E. Gavves BDL 23 43 0 17 Jun 2020
Why Normalizing Flows Fail to Detect Out-of-Distribution Data Polina Kirichenko Pavel Izmailov A. Wilson OODD 22 271 0 15 Jun 2020
NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity Sang-gil Lee Sungwon Kim Sungroh Yoon 27 17 0 11 Jun 2020
SoftFlow: Probabilistic Framework for Normalizing Flow on Manifolds Hyeongju Kim Hyeonseung Lee Woohyun Kang Joun Yeop Lee N. Kim 3DPC 25 114 0 08 Jun 2020
WaveNODE: A Continuous Normalizing Flow for Speech Synthesis Hyeongju Kim Hyeongseung Lee Woohyun Kang Sung Jun Cheon Byoung Jin Choi N. Kim 22 12 0 08 Jun 2020
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech Yi Ren Chenxu Hu Xu Tan Tao Qin Sheng Zhao Zhou Zhao Tie-Yan Liu 60 1,362 0 08 Jun 2020
End-to-End Adversarial Text-to-Speech Jeff Donahue Sander Dieleman Mikolaj Binkowski Erich Elsen Karen Simonyan 19 185 0 05 Jun 2020
Graphical Normalizing Flows Antoine Wehenkel Gilles Louppe TPM BDL 12 37 0 03 Jun 2020
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search Jaehyeon Kim Sungwon Kim Jungil Kong Sungroh Yoon 54 478 0 22 May 2020
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation Yi-Chiao Wu Tomoki Hayashi T. Okamoto Hisashi Kawai T. Toda 31 4 0 18 May 2020
Many-to-Many Voice Transformer Network Hirokazu Kameoka Wen-Chin Huang Kou Tanaka Takuhiro Kaneko Nobukatsu Hojo T. Toda ViT 30 30 0 18 May 2020
Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech Geng Yang Shan Yang Kai-Chun Liu Peng Fang Wei Chen Lei Xie 68 198 0 11 May 2020
C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds Albert Pumarola S. Popov Francesc Moreno-Noguer V. Ferrari 3DPC AI4CE 31 80 0 15 Dec 2019
Normalizing Flows for Probabilistic Modeling and Inference George Papamakarios Eric T. Nalisnick Danilo Jimenez Rezende S. Mohamed Balaji Lakshminarayanan TPM AI4CE 67 1,635 0 05 Dec 2019
WaveFlow: A Compact Flow-based Model for Raw Audio Ming-Yu Liu Kainan Peng Kexin Zhao Z. Song 25 116 0 03 Dec 2019
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework Mingbo Ma Baigong Zheng Kaibo Liu Renjie Zheng Hairong Liu Kainan Peng Kenneth Church Liang Huang 22 29 0 07 Nov 2019
On Investigation of Unsupervised Speech Factorization Based on Normalization Flow Haoran Sun Yunqi Cai Lantian Li Dong Wang 21 1 0 29 Oct 2019
Neural Density Estimation and Likelihood-free Inference George Papamakarios BDL DRL 30 44 0 29 Oct 2019
Neural Language Priors Joseph Enguehard Dan Busbridge V. Zhelezniak Nils Y. Hammerla 31 3 0 04 Oct 2019
High Fidelity Speech Synthesis with Adversarial Networks Mikolaj Binkowski Jeff Donahue Sander Dieleman Aidan Clark Erich Elsen Norman Casagrande Luis C. Cobo Karen Simonyan 243 239 0 25 Sep 2019
Normalizing Flows: An Introduction and Review of Current Methods I. Kobyzev S. Prince Marcus A. Brubaker TPM MedIm 19 57 0 25 Aug 2019