Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.12841
Cited By
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
25 February 2021
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames"
18 / 18 papers shown
Title
Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements
Sandipan Dhar
N. D. Jana
Swagatam Das
48
0
0
27 Apr 2025
Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion
Sandipan Dhar
Md. Tousin Akhter
N. D. Jana
Swagatam Das
35
1
0
18 Apr 2025
Discrete Unit based Masking for Improving Disentanglement in Voice Conversion
Philip H. Lee
Ismail Rasim Ulgen
Berrak Sisman
30
0
0
17 Sep 2024
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Yuto Kondo
DiffM
40
0
0
03 Sep 2024
AutoCycle-VC: Towards Bottleneck-Independent Zero-Shot Cross-Lingual Voice Conversion
Haeyun Choi
Jio Gim
Yuho Lee
Youngin Kim
Young-Joo Suh
BDL
13
1
0
10 Oct 2023
Towards General-Purpose Text-Instruction-Guided Voice Conversion
Chun-Yi Kuan
Chen An Li
Tsung-Yuan Hsu
T. Lin
Ho-Lam Chung
Kai-Wei Chang
Shuo-yiin Chang
Hung-yi Lee
18
5
0
25 Sep 2023
SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias
Sipan Li
Songxiang Liu
Lu Zhang
Xiang Li
Yanyao Bian
Chao Weng
Zhiyong Wu
Helen Meng
36
2
0
14 Sep 2023
Voice Conversion with Denoising Diffusion Probabilistic GAN Models
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
DiffM
16
5
0
28 Aug 2023
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
Dominik Wagner
Ilja Baumann
Tobias Bocklet
28
1
0
10 Jun 2023
Study of GANs for Noisy Speech Simulation from Clean Speech
L. Maben
Zixun Guo
Chen Chen
Utkarsh Chudiwal
Chng Eng Siong
14
0
0
21 May 2023
Multi-level Temporal-channel Speaker Retrieval for Zero-shot Voice Conversion
Zhichao Wang
Liumeng Xue
Qiuqiang Kong
Linfu Xie
Yuan-Jui Chen
Qiao Tian
Yuping Wang
BDL
17
3
0
12 May 2023
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Shogo Seki
26
9
0
24 Mar 2023
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
19
1
0
25 Oct 2022
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Xiaodong Cui
G. Saon
Tohru Nagano
Masayuki Suzuki
Takashi Fukuda
Brian Kingsbury
Gakuto Kurata
31
7
0
29 Mar 2022
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Takuhiro Kaneko
Kou Tanaka
Hirokazu Kameoka
Shogo Seki
17
60
0
04 Mar 2022
The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition
L. Prananta
B. Halpern
Siyuan Feng
O. Scharenborg
16
16
0
13 Jan 2022
Music Sentiment Transfer
Miles Sigel
Michael X. Zhou
Jiebo Luo
13
1
0
12 Oct 2021
High Fidelity Speech Synthesis with Adversarial Networks
Mikolaj Binkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
226
239
0
25 Sep 2019
1