ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.12841
  4. Cited By
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in
  Frames

MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames

25 February 2021
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
ArXivPDFHTML

Papers citing "MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames"

18 / 18 papers shown
Title
Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements
Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements
Sandipan Dhar
N. D. Jana
Swagatam Das
48
0
0
27 Apr 2025
Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion
Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion
Sandipan Dhar
Md. Tousin Akhter
N. D. Jana
Swagatam Das
35
1
0
18 Apr 2025
Discrete Unit based Masking for Improving Disentanglement in Voice
  Conversion
Discrete Unit based Masking for Improving Disentanglement in Voice Conversion
Philip H. Lee
Ismail Rasim Ulgen
Berrak Sisman
30
0
0
17 Sep 2024
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with
  Adversarial Conditional Diffusion Distillation
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Yuto Kondo
DiffM
40
0
0
03 Sep 2024
AutoCycle-VC: Towards Bottleneck-Independent Zero-Shot Cross-Lingual
  Voice Conversion
AutoCycle-VC: Towards Bottleneck-Independent Zero-Shot Cross-Lingual Voice Conversion
Haeyun Choi
Jio Gim
Yuho Lee
Youngin Kim
Young-Joo Suh
BDL
13
1
0
10 Oct 2023
Towards General-Purpose Text-Instruction-Guided Voice Conversion
Towards General-Purpose Text-Instruction-Guided Voice Conversion
Chun-Yi Kuan
Chen An Li
Tsung-Yuan Hsu
T. Lin
Ho-Lam Chung
Kai-Wei Chang
Shuo-yiin Chang
Hung-yi Lee
18
5
0
25 Sep 2023
SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and
  Periodic Inductive Bias
SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias
Sipan Li
Songxiang Liu
Lu Zhang
Xiang Li
Yanyao Bian
Chao Weng
Zhiyong Wu
Helen Meng
36
2
0
14 Sep 2023
Voice Conversion with Denoising Diffusion Probabilistic GAN Models
Voice Conversion with Denoising Diffusion Probabilistic GAN Models
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
DiffM
16
5
0
28 Aug 2023
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked
  Cycle-Consistent Generative Adversarial Networks
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
Dominik Wagner
Ilja Baumann
Tobias Bocklet
28
1
0
10 Jun 2023
Study of GANs for Noisy Speech Simulation from Clean Speech
Study of GANs for Noisy Speech Simulation from Clean Speech
L. Maben
Zixun Guo
Chen Chen
Utkarsh Chudiwal
Chng Eng Siong
14
0
0
21 May 2023
Multi-level Temporal-channel Speaker Retrieval for Zero-shot Voice
  Conversion
Multi-level Temporal-channel Speaker Retrieval for Zero-shot Voice Conversion
Zhichao Wang
Liumeng Xue
Qiuqiang Kong
Linfu Xie
Yuan-Jui Chen
Qiao Tian
Yuping Wang
BDL
17
3
0
12 May 2023
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for
  Generative Adversarial Network-Based Speech Synthesis
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Shogo Seki
26
9
0
24 Mar 2023
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
19
1
0
25 Oct 2022
Improving Generalization of Deep Neural Network Acoustic Models with
  Length Perturbation and N-best Based Label Smoothing
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Xiaodong Cui
G. Saon
Tohru Nagano
Masayuki Suzuki
Takashi Fukuda
Brian Kingsbury
Gakuto Kurata
31
7
0
29 Mar 2022
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating
  Inverse Short-Time Fourier Transform
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Takuhiro Kaneko
Kou Tanaka
Hirokazu Kameoka
Shogo Seki
17
60
0
04 Mar 2022
The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for
  Improved Dysarthric Speech Recognition
The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition
L. Prananta
B. Halpern
Siyuan Feng
O. Scharenborg
16
16
0
13 Jan 2022
Music Sentiment Transfer
Music Sentiment Transfer
Miles Sigel
Michael X. Zhou
Jiebo Luo
13
1
0
12 Oct 2021
High Fidelity Speech Synthesis with Adversarial Networks
High Fidelity Speech Synthesis with Adversarial Networks
Mikolaj Binkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
226
239
0
25 Sep 2019
1