Neural source-filter waveform models for statistical parametric speech synthesis

27 April 2019

Xin Wang

Papers citing "Neural source-filter waveform models for statistical parametric speech synthesis"

36 / 86 papers shown

Title
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding Sergey Nikonorov Berrak Sisman Mingyang Zhang Haizhou Li 23 2 0 13 Oct 2021
Towards Universal Neural Vocoding with a Multi-band Excited WaveNet Axel Roebel F. Bous 29 2 0 07 Oct 2021
Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks Zhaojie Luo Shoufeng Lin Rui Liu Jun Baba Y. Yoshikawa H. Ishiguro 17 8 0 04 Oct 2021
MSR-NV: Neural Vocoder Using Multiple Sampling Rates Kentaro Mitsui Kei Sawada 20 0 0 28 Sep 2021
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds Xuan Shi Erica Cooper Junichi Yamagishi 33 7 0 24 Jul 2021
Neural Waveshaping Synthesis B. Hayes C. Saitis Gyorgy Fazekas 36 28 0 11 Jul 2021
A Survey on Neural Speech Synthesis Xu Tan Tao Qin Frank Soong Tie-Yan Liu AI4TS 18 352 0 29 Jun 2021
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters Pritish Chandna António Ramires Xavier Serra Emilia Gómez 24 4 0 21 May 2021
One Billion Audio Sounds from GPU-enabled Modular Synthesis Joseph P. Turian Jordie Shier George Tzanetakis K. McNally Max Henry 21 22 0 27 Apr 2021
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis Erica Cooper Xin Wang Junichi Yamagishi 31 6 0 25 Apr 2021
Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN Reo Yoneyama Yi-Chiao Wu T. Toda 14 12 0 10 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN Giorgio Barnabò Giovanni Trappolini L. Lastilla Cesare Campagnano Angela Fan Fabio Petroni Fabrizio Silvestri 14 4 0 01 Apr 2021
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components Yukiya Hono Shinji Takaki Kei Hashimoto Keiichiro Oura Yoshihiko Nankaku K. Tokuda 14 16 0 15 Feb 2021
A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender Pierre Champion D. Jouvet Anthony Larcher 16 24 0 21 Jan 2021
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch Joseph P. Turian Max Henry 24 29 0 08 Dec 2020
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation Yang Ai Haoyu Li Xin Wang Junichi Yamagishi Zhenhua Ling 9 4 0 08 Nov 2020
Speech Synthesis and Control Using Differentiable DSP Giorgio Fabbro Vladimir Golkov Thomas Kemp Daniel Cremers 20 12 0 28 Oct 2020
WaveGrad: Estimating Gradients for Waveform Generation Nanxin Chen Yu Zhang Heiga Zen Ron J. Weiss Mohammad Norouzi William Chan DiffM BDL 14 772 0 02 Sep 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion Yi Zhao Wen-Chin Huang Xiaohai Tian Junichi Yamagishi Rohan Kumar Das Tomi Kinnunen Zhenhua Ling T. Toda 27 206 0 28 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning Berrak Sisman Junichi Yamagishi Simon King Haizhou Li BDL 41 318 0 09 Aug 2020
HooliGAN: Robust, High Quality Neural Vocoding Ollie McCarthy Zo Ahmed 8 14 0 06 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning Jing-Xuan Zhang Zhenhua Ling Lirong Dai 15 6 0 05 Aug 2020
Neural Granular Sound Synthesis Adrien Bitton P. Esling Tatsuya Harada 16 7 0 04 Aug 2020
Diet deep generative audio models with structured lottery P. Esling Ninon Devis Adrien Bitton Antoine Caillon Axel Chemla-Romeu-Santos Constance Douwes 11 6 0 31 Jul 2020
Vector-Quantized Timbre Representation Adrien Bitton P. Esling Tatsuya Harada 20 12 0 13 Jul 2020
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network Yi-Chiao Wu Tomoki Hayashi Patrick Lumban Tobing Kazuhiro Kobayashi T. Toda 27 18 0 11 Jul 2020
NAUTILUS: a Versatile Voice Cloning System Hieu-Thi Luong Junichi Yamagishi 28 51 0 22 May 2020
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation Yi-Chiao Wu Tomoki Hayashi T. Okamoto Hisashi Kawai T. Toda 29 4 0 18 May 2020
Reverberation Modeling for Source-Filter-based Neural Vocoder Yang Ai Xin Wang Junichi Yamagishi Zhenhua Ling 20 3 0 15 May 2020
Exploring TTS without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020) Takashi Morita H. Koda 10 8 0 11 May 2020
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders Yang Ai Zhenhua Ling 11 8 0 16 Apr 2020
Semi-supervised learning of glottal pulse positions in a neural analysis-synthesis framework F. Bous Luc Ardaillon Axel Roebel 6 1 0 02 Mar 2020
DDSP: Differentiable Digital Signal Processing Jesse Engel Lamtharn Hantrakul Chenjie Gu Adam Roberts DiffM 96 373 0 14 Jan 2020
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis Xin Wang Junichi Yamagishi 14 31 0 27 Aug 2019
A Neural Vocoder with Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis Yang Ai Zhenhua Ling 21 29 0 23 Jun 2019
Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion Kaisheng Yao Geoffrey Zweig 48 163 0 31 May 2015