ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.12088
  4. Cited By
Neural source-filter waveform models for statistical parametric speech
  synthesis

Neural source-filter waveform models for statistical parametric speech synthesis

27 April 2019
Xin Wang
Shinji Takaki
Junichi Yamagishi
ArXivPDFHTML

Papers citing "Neural source-filter waveform models for statistical parametric speech synthesis"

36 / 86 papers shown
Title
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding
Sergey Nikonorov
Berrak Sisman
Mingyang Zhang
Haizhou Li
23
2
0
13 Oct 2021
Towards Universal Neural Vocoding with a Multi-band Excited WaveNet
Towards Universal Neural Vocoding with a Multi-band Excited WaveNet
Axel Roebel
F. Bous
29
2
0
07 Oct 2021
Decoupling Speaker-Independent Emotions for Voice Conversion Via
  Source-Filter Networks
Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks
Zhaojie Luo
Shoufeng Lin
Rui Liu
Jun Baba
Y. Yoshikawa
H. Ishiguro
17
8
0
04 Oct 2021
MSR-NV: Neural Vocoder Using Multiple Sampling Rates
MSR-NV: Neural Vocoder Using Multiple Sampling Rates
Kentaro Mitsui
Kei Sawada
20
0
0
28 Sep 2021
Use of speaker recognition approaches for learning and evaluating
  embedding representations of musical instrument sounds
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds
Xuan Shi
Erica Cooper
Junichi Yamagishi
33
7
0
24 Jul 2021
Neural Waveshaping Synthesis
Neural Waveshaping Synthesis
B. Hayes
C. Saitis
Gyorgy Fazekas
36
28
0
11 Jul 2021
A Survey on Neural Speech Synthesis
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
18
352
0
29 Jun 2021
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical
  Parameters
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters
Pritish Chandna
António Ramires
Xavier Serra
Emilia Gómez
24
4
0
21 May 2021
One Billion Audio Sounds from GPU-enabled Modular Synthesis
One Billion Audio Sounds from GPU-enabled Modular Synthesis
Joseph P. Turian
Jordie Shier
George Tzanetakis
K. McNally
Max Henry
21
22
0
27 Apr 2021
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis
Erica Cooper
Xin Wang
Junichi Yamagishi
31
6
0
25 Apr 2021
Unified Source-Filter GAN: Unified Source-filter Network Based On
  Factorization of Quasi-Periodic Parallel WaveGAN
Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN
Reo Yoneyama
Yi-Chiao Wu
T. Toda
14
12
0
10 Apr 2021
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
CycleDRUMS: Automatic Drum Arrangement For Bass Lines Using CycleGAN
Giorgio Barnabò
Giovanni Trappolini
L. Lastilla
Cesare Campagnano
Angela Fan
Fabio Petroni
Fabrizio Silvestri
14
4
0
01 Apr 2021
PeriodNet: A non-autoregressive waveform generation model with a
  structure separating periodic and aperiodic components
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components
Yukiya Hono
Shinji Takaki
Kei Hashimoto
Keiichiro Oura
Yoshihiko Nankaku
K. Tokuda
14
16
0
15 Feb 2021
A Study of F0 Modification for X-Vector Based Speech Pseudonymization
  Across Gender
A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender
Pierre Champion
D. Jouvet
Anthony Larcher
16
24
0
21 Jan 2021
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at
  Pitch
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch
Joseph P. Turian
Max Henry
24
29
0
08 Dec 2020
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust
  Waveform Generation
Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation
Yang Ai
Haoyu Li
Xin Wang
Junichi Yamagishi
Zhenhua Ling
9
4
0
08 Nov 2020
Speech Synthesis and Control Using Differentiable DSP
Speech Synthesis and Control Using Differentiable DSP
Giorgio Fabbro
Vladimir Golkov
Thomas Kemp
Daniel Cremers
20
12
0
28 Oct 2020
WaveGrad: Estimating Gradients for Waveform Generation
WaveGrad: Estimating Gradients for Waveform Generation
Nanxin Chen
Yu Zhang
Heiga Zen
Ron J. Weiss
Mohammad Norouzi
William Chan
DiffM
BDL
14
772
0
02 Sep 2020
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and
  cross-lingual voice conversion
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion
Yi Zhao
Wen-Chin Huang
Xiaohai Tian
Junichi Yamagishi
Rohan Kumar Das
Tomi Kinnunen
Zhenhua Ling
T. Toda
27
206
0
28 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical
  Modeling to Deep Learning
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
41
318
0
09 Aug 2020
HooliGAN: Robust, High Quality Neural Vocoding
HooliGAN: Robust, High Quality Neural Vocoding
Ollie McCarthy
Zo Ahmed
8
14
0
06 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with
  Adversarial Learning
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
15
6
0
05 Aug 2020
Neural Granular Sound Synthesis
Neural Granular Sound Synthesis
Adrien Bitton
P. Esling
Tatsuya Harada
16
7
0
04 Aug 2020
Diet deep generative audio models with structured lottery
Diet deep generative audio models with structured lottery
P. Esling
Ninon Devis
Adrien Bitton
Antoine Caillon
Axel Chemla-Romeu-Santos
Constance Douwes
11
6
0
31 Jul 2020
Vector-Quantized Timbre Representation
Vector-Quantized Timbre Representation
Adrien Bitton
P. Esling
Tatsuya Harada
20
12
0
13 Jul 2020
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model
  with Pitch-dependent Dilated Convolution Neural Network
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network
Yi-Chiao Wu
Tomoki Hayashi
Patrick Lumban Tobing
Kazuhiro Kobayashi
T. Toda
27
18
0
11 Jul 2020
NAUTILUS: a Versatile Voice Cloning System
NAUTILUS: a Versatile Voice Cloning System
Hieu-Thi Luong
Junichi Yamagishi
28
51
0
22 May 2020
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive
  Pitch-dependent Dilated Convolution Model for Parametric Speech Generation
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation
Yi-Chiao Wu
Tomoki Hayashi
T. Okamoto
Hisashi Kawai
T. Toda
29
4
0
18 May 2020
Reverberation Modeling for Source-Filter-based Neural Vocoder
Reverberation Modeling for Source-Filter-based Neural Vocoder
Yang Ai
Xin Wang
Junichi Yamagishi
Zhenhua Ling
20
3
0
15 May 2020
Exploring TTS without T Using Biologically/Psychologically Motivated
  Neural Network Modules (ZeroSpeech 2020)
Exploring TTS without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020)
Takashi Morita
H. Koda
10
8
0
11 May 2020
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical
  Neural Vocoders
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders
Yang Ai
Zhenhua Ling
11
8
0
16 Apr 2020
Semi-supervised learning of glottal pulse positions in a neural
  analysis-synthesis framework
Semi-supervised learning of glottal pulse positions in a neural analysis-synthesis framework
F. Bous
Luc Ardaillon
Axel Roebel
6
1
0
02 Mar 2020
DDSP: Differentiable Digital Signal Processing
DDSP: Differentiable Digital Signal Processing
Jesse Engel
Lamtharn Hantrakul
Chenjie Gu
Adam Roberts
DiffM
96
373
0
14 Jan 2020
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice
  Frequency for Text-to-Speech Synthesis
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis
Xin Wang
Junichi Yamagishi
14
31
0
27 Aug 2019
A Neural Vocoder with Hierarchical Generation of Amplitude and Phase
  Spectra for Statistical Parametric Speech Synthesis
A Neural Vocoder with Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis
Yang Ai
Zhenhua Ling
21
29
0
23 Jun 2019
Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme
  Conversion
Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion
Kaisheng Yao
Geoffrey Zweig
48
163
0
31 May 2015
Previous
12