Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations

9 April 2018

Papers citing "Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations"

33 / 33 papers shown

Title
Voice Conversion-based Privacy through Adversarial Information Hiding J. Webber O. Watts G. Henter Jennifer Williams Simon King 45 0 0 23 Sep 2024
Learning Disentangled Speech Representations Yusuf Brima U. Krumnack Simone Pika Gunther Heidemann CoGe DRL 43 3 0 04 Nov 2023
VaSAB: The variable size adaptive information bottleneck for disentanglement on speech and singing voice F. Bous Axel Roebel 18 0 0 05 Oct 2023
Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings Wei Xue Yiwen Wang Qi-fei Liu Yi-Ting Guo 39 1 0 09 May 2023
Disentangled Representation Learning for RF Fingerprint Extraction under Unknown Channel Statistics Renjie Xie Wei Xu Jiabao Yu A. Hu Derrick Wing Kwan Ng A. L. Swindlehurst 40 18 0 04 Aug 2022
Speak Like a Dog: Human to Non-human creature Voice Conversion Kohei Suzuki Shoki Sakamoto T. Taniguchi Hirokazu Kameoka 27 2 0 09 Jun 2022
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers Kaizhi Qian Yang Zhang Heting Gao Junrui Ni Cheng-I Jeff Lai David D. Cox M. Hasegawa-Johnson Shiyu Chang DRL 30 110 0 20 Apr 2022
Noise-robust voice conversion with domain adversarial training Hongqiang Du Lei Xie Haizhou Li 19 11 0 26 Jan 2022
How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition Haoran Sun Lantian Li T. Zheng Dong Wang CVBM 19 0 0 24 Nov 2021
Toward Degradation-Robust Voice Conversion Chien-yu Huang Kai-Wei Chang Hung-yi Lee 30 7 0 14 Oct 2021
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion Yinghao Aaron Li A. Zare N. Mesgarani 35 98 0 21 Jul 2021
Learning Robust Latent Representations for Controllable Speech Synthesis Shakti Kumar Jithin Pradeep Hussain Zaidi DRL 41 6 0 10 May 2021
CDPAM: Contrastive learning for perceptual audio similarity Pranay Manocha Zeyu Jin Richard Y. Zhang Adam Finkelstein 27 68 0 09 Feb 2021
Optimizing voice conversion network with cycle consistency loss of speaker identity Hongqiang Du Xiaohai Tian Lei Xie Haizhou Li 21 17 0 17 Nov 2020
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization Yen-Hao Chen Da-Yi Wu Tsung-Han Wu Hung-yi Lee 34 107 0 31 Oct 2020
Upsampling artifacts in neural audio synthesis Jordi Pons Santiago Pascual Giulio Cengarle Joan Serrà 33 62 0 27 Oct 2020
GAZEV: GAN-Based Zero-Shot Voice Conversion over Non-parallel Speech Corpus Zining Zhang Bingsheng He Zhenjie Zhang 16 19 0 24 Oct 2020
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS Wen-Chin Huang Tomoki Hayashi Shinji Watanabe T. Toda DRL 13 39 0 06 Oct 2020
Intra-class variation reduction of speaker representation in disentanglement framework Yoohwan Kwon Soo-Whan Chung Hong-Goo Kang DRL 14 21 0 04 Aug 2020
VQVC+: One-Shot Voice Conversion by Vector Quantization and U-Net architecture Da-Yi Wu Yen-Hao Chen Hung-yi Lee 8 99 0 07 Jun 2020
Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations Janek Ebbers Michael Kuhlmann Tobias Cord-Landwehr Reinhold Haeb-Umbach DRL CoGe SSL 31 4 0 26 May 2020
Vector-quantized neural networks for acoustic unit discovery in the ZeroSpeech 2020 challenge Benjamin van Niekerk Leanne Nortje Herman Kamper 16 115 0 19 May 2020
Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data Seung-won Park Doo-young Kim Myun-chul Joe 18 40 0 07 May 2020
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder Kaizhi Qian Zeyu Jin M. Hasegawa-Johnson G. J. Mysore 29 107 0 15 Apr 2020
Many-to-Many Voice Conversion using Conditional Cycle-Consistent Adversarial Networks Shindong Lee Bonggu Ko Keonnyeong Lee In-Chul Yoo Dongsuk Yook GAN 30 33 0 15 Feb 2020
Content Based Singing Voice Extraction From a Musical Mixture Pritish Chandna Merlijn Blaauw J. Bonada E. Gómez 28 14 0 12 Feb 2020
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion Wen-Chin Huang Hao Luo Hsin-Te Hwang Chen-Chou Lo Yu-Huai Peng Yu Tsao Hsin-Min Wang DRL 17 42 0 22 Jan 2020
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations Jing-Xuan Zhang Zhenhua Ling Lirong Dai 22 99 0 25 Jun 2019
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion Wen-Chin Huang Yi-Chiao Wu Chen-Chou Lo Patrick Lumban Tobing Tomoki Hayashi Kazuhiro Kobayashi T. Toda Yu Tsao H. Wang DRL 21 13 0 02 May 2019
Sequence-to-Sequence Acoustic Modeling for Voice Conversion Jing-Xuan Zhang Zhenhua Ling Li-Juan Liu Yuan Jiang Lirong Dai 16 129 0 16 Oct 2018
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences Cheng-chieh Yeh Po-Chun Hsu Ju-Chieh Chou Hung-yi Lee Lin-Shan Lee 33 23 0 09 Aug 2018
Conditional Image Synthesis With Auxiliary Classifier GANs Augustus Odena C. Olah Jonathon Shlens GAN 250 3,192 0 30 Oct 2016
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network Wenzhe Shi Jose Caballero Ferenc Huszár J. Totz Andrew P. Aitken Rob Bishop Daniel Rueckert Zehan Wang SupR 234 5,180 0 16 Sep 2016