ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.03088
  4. Cited By
Pretraining Techniques for Sequence-to-Sequence Voice Conversion

Pretraining Techniques for Sequence-to-Sequence Voice Conversion

7 August 2020
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
T. Toda
ArXivPDFHTML

Papers citing "Pretraining Techniques for Sequence-to-Sequence Voice Conversion"

19 / 19 papers shown
Title
VQalAttent: a Transparent Speech Generation Pipeline based on
  Transformer-learned VQ-VAE Latent Space
VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space
Armani Rodriguez
S. Kokalj-Filipovic
70
0
0
22 Nov 2024
A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to
  Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings
A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings
Haopeng Geng
Daisuke Saito
N. Minematsu
18
1
0
03 Oct 2024
Simulating Native Speaker Shadowing for Nonnative Speech Assessment with
  Latent Speech Representations
Simulating Native Speaker Shadowing for Nonnative Speech Assessment with Latent Speech Representations
Haopeng Geng
Daisuke Saito
Nobuaki Minematsu
25
0
0
18 Sep 2024
Electrolaryngeal Speech Intelligibility Enhancement Through Robust
  Linguistic Encoders
Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Lester Phillip Violeta
Wen-Chin Huang
D. Ma
Ryuichi Yamamoto
Kazuhiro Kobayashi
T. Toda
19
3
0
18 Sep 2023
Parallel and Limited Data Voice Conversion Using Stochastic Variational
  Deep Kernel Learning
Parallel and Limited Data Voice Conversion Using Stochastic Variational Deep Kernel Learning
Mohamadreza Jafaryani
H. Sheikhzadeh
V. Pourahmadi
14
4
0
08 Sep 2023
Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion
Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion
Wen-Chin Huang
T. Toda
CVBM
21
5
0
05 Sep 2023
The Singing Voice Conversion Challenge 2023
The Singing Voice Conversion Challenge 2023
Wen-Chin Huang
Lester Phillip Violeta
Songxiang Liu
Jiatong Shi
T. Toda
16
46
0
26 Jun 2023
Emotion Intensity and its Control for Emotional Voice Conversion
Emotion Intensity and its Control for Emotional Voice Conversion
Kun Zhou
Berrak Sisman
R. Rana
Björn W. Schuller
Haizhou Li
52
54
0
10 Jan 2022
Towards Identity Preserving Normal to Dysarthric Voice Conversion
Towards Identity Preserving Normal to Dysarthric Voice Conversion
Wen-Chin Huang
B. Halpern
Lester Phillip Violeta
O. Scharenborg
T. Toda
29
21
0
15 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
  Processing
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
118
193
0
14 Oct 2021
Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice
  Conversion
Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Yi-Syuan Liou
Wen-Chin Huang
Ming-Chi Yen
S. Tsai
Yu-Huai Peng
T. Toda
Yu Tsao
Hsin-Min Wang
14
1
0
08 Sep 2021
On Prosody Modeling for ASR+TTS based Voice Conversion
On Prosody Modeling for ASR+TTS based Voice Conversion
Wen-Chin Huang
Tomoki Hayashi
Xinjian Li
Shinji Watanabe
T. Toda
25
8
0
20 Jul 2021
Preliminary study on using vector quantization latent spaces for TTS/VC
  systems with consistent performance
Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Hieu-Thi Luong
Junichi Yamagishi
17
0
0
25 Jun 2021
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker
  Identity in Dysarthric Voice Conversion
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
Wen-Chin Huang
Kazuhiro Kobayashi
Yu-Huai Peng
Ching-Feng Liu
Yu Tsao
Hsin-Min Wang
T. Toda
18
10
0
02 Jun 2021
MASS: Multi-task Anthropomorphic Speech Synthesis Framework
MASS: Multi-task Anthropomorphic Speech Synthesis Framework
Jinyin Chen
Linhui Ye
Zhaoyan Ming
6
6
0
10 May 2021
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised
  Discrete Speech Representations
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Wen-Chin Huang
Yi-Chiao Wu
Tomoki Hayashi
T. Toda
BDL
44
37
0
23 Oct 2020
The NU Voice Conversion System for the Voice Conversion Challenge 2020:
  On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural
  Vocoders
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin Huang
Patrick Lumban Tobing
Yi-Chiao Wu
Kazuhiro Kobayashi
T. Toda
17
8
0
09 Oct 2020
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence
  Modeling
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling
Songxiang Liu
Yuewen Cao
Disong Wang
Xixin Wu
Xunying Liu
Helen Meng
BDL
21
88
0
06 Sep 2020
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
1