ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.02812
  4. Cited By
Multi-target Voice Conversion without Parallel Data by Adversarially
  Learning Disentangled Audio Representations

Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations

9 April 2018
Ju-Chieh Chou
Cheng-chieh Yeh
Hung-yi Lee
Lin-Shan Lee
ArXivPDFHTML

Papers citing "Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations"

33 / 33 papers shown
Title
Voice Conversion-based Privacy through Adversarial Information Hiding
Voice Conversion-based Privacy through Adversarial Information Hiding
J. Webber
O. Watts
G. Henter
Jennifer Williams
Simon King
45
0
0
23 Sep 2024
Learning Disentangled Speech Representations
Learning Disentangled Speech Representations
Yusuf Brima
U. Krumnack
Simone Pika
Gunther Heidemann
CoGe
DRL
43
3
0
04 Nov 2023
VaSAB: The variable size adaptive information bottleneck for
  disentanglement on speech and singing voice
VaSAB: The variable size adaptive information bottleneck for disentanglement on speech and singing voice
F. Bous
Axel Roebel
18
0
0
05 Oct 2023
Learn to Sing by Listening: Building Controllable Virtual Singer by
  Unsupervised Learning from Voice Recordings
Learn to Sing by Listening: Building Controllable Virtual Singer by Unsupervised Learning from Voice Recordings
Wei Xue
Yiwen Wang
Qi-fei Liu
Yi-Ting Guo
39
1
0
09 May 2023
Disentangled Representation Learning for RF Fingerprint Extraction under
  Unknown Channel Statistics
Disentangled Representation Learning for RF Fingerprint Extraction under Unknown Channel Statistics
Renjie Xie
Wei Xu
Jiabao Yu
A. Hu
Derrick Wing Kwan Ng
A. L. Swindlehurst
40
18
0
04 Aug 2022
Speak Like a Dog: Human to Non-human creature Voice Conversion
Speak Like a Dog: Human to Non-human creature Voice Conversion
Kohei Suzuki
Shoki Sakamoto
T. Taniguchi
Hirokazu Kameoka
27
2
0
09 Jun 2022
ContentVec: An Improved Self-Supervised Speech Representation by
  Disentangling Speakers
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers
Kaizhi Qian
Yang Zhang
Heting Gao
Junrui Ni
Cheng-I Jeff Lai
David D. Cox
M. Hasegawa-Johnson
Shiyu Chang
DRL
30
110
0
20 Apr 2022
Noise-robust voice conversion with domain adversarial training
Noise-robust voice conversion with domain adversarial training
Hongqiang Du
Lei Xie
Haizhou Li
19
11
0
26 Jan 2022
How Speech is Recognized to Be Emotional - A Study Based on Information
  Decomposition
How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Haoran Sun
Lantian Li
T. Zheng
Dong Wang
CVBM
19
0
0
24 Nov 2021
Toward Degradation-Robust Voice Conversion
Toward Degradation-Robust Voice Conversion
Chien-yu Huang
Kai-Wei Chang
Hung-yi Lee
30
7
0
14 Oct 2021
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for
  Natural-Sounding Voice Conversion
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Yinghao Aaron Li
A. Zare
N. Mesgarani
35
98
0
21 Jul 2021
Learning Robust Latent Representations for Controllable Speech Synthesis
Learning Robust Latent Representations for Controllable Speech Synthesis
Shakti Kumar
Jithin Pradeep
Hussain Zaidi
DRL
41
6
0
10 May 2021
CDPAM: Contrastive learning for perceptual audio similarity
CDPAM: Contrastive learning for perceptual audio similarity
Pranay Manocha
Zeyu Jin
Richard Y. Zhang
Adam Finkelstein
27
68
0
09 Feb 2021
Optimizing voice conversion network with cycle consistency loss of
  speaker identity
Optimizing voice conversion network with cycle consistency loss of speaker identity
Hongqiang Du
Xiaohai Tian
Lei Xie
Haizhou Li
21
17
0
17 Nov 2020
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and
  Adaptive Instance Normalization
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization
Yen-Hao Chen
Da-Yi Wu
Tsung-Han Wu
Hung-yi Lee
34
107
0
31 Oct 2020
Upsampling artifacts in neural audio synthesis
Upsampling artifacts in neural audio synthesis
Jordi Pons
Santiago Pascual
Giulio Cengarle
Joan Serrà
33
62
0
27 Oct 2020
GAZEV: GAN-Based Zero-Shot Voice Conversion over Non-parallel Speech
  Corpus
GAZEV: GAN-Based Zero-Shot Voice Conversion over Non-parallel Speech Corpus
Zining Zhang
Bingsheng He
Zhenjie Zhang
16
19
0
24 Oct 2020
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge
  2020: Cascading ASR and TTS
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Wen-Chin Huang
Tomoki Hayashi
Shinji Watanabe
T. Toda
DRL
13
39
0
06 Oct 2020
Intra-class variation reduction of speaker representation in
  disentanglement framework
Intra-class variation reduction of speaker representation in disentanglement framework
Yoohwan Kwon
Soo-Whan Chung
Hong-Goo Kang
DRL
14
21
0
04 Aug 2020
VQVC+: One-Shot Voice Conversion by Vector Quantization and U-Net
  architecture
VQVC+: One-Shot Voice Conversion by Vector Quantization and U-Net architecture
Da-Yi Wu
Yen-Hao Chen
Hung-yi Lee
8
99
0
07 Jun 2020
Contrastive Predictive Coding Supported Factorized Variational
  Autoencoder for Unsupervised Learning of Disentangled Speech Representations
Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations
Janek Ebbers
Michael Kuhlmann
Tobias Cord-Landwehr
Reinhold Haeb-Umbach
DRL
CoGe
SSL
31
4
0
26 May 2020
Vector-quantized neural networks for acoustic unit discovery in the
  ZeroSpeech 2020 challenge
Vector-quantized neural networks for acoustic unit discovery in the ZeroSpeech 2020 challenge
Benjamin van Niekerk
Leanne Nortje
Herman Kamper
16
115
0
19 May 2020
Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice
  Conversion without Parallel Data
Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data
Seung-won Park
Doo-young Kim
Myun-chul Joe
18
40
0
07 May 2020
F0-consistent many-to-many non-parallel voice conversion via conditional
  autoencoder
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder
Kaizhi Qian
Zeyu Jin
M. Hasegawa-Johnson
G. J. Mysore
29
107
0
15 Apr 2020
Many-to-Many Voice Conversion using Conditional Cycle-Consistent
  Adversarial Networks
Many-to-Many Voice Conversion using Conditional Cycle-Consistent Adversarial Networks
Shindong Lee
Bonggu Ko
Keonnyeong Lee
In-Chul Yoo
Dongsuk Yook
GAN
30
33
0
15 Feb 2020
Content Based Singing Voice Extraction From a Musical Mixture
Content Based Singing Voice Extraction From a Musical Mixture
Pritish Chandna
Merlijn Blaauw
J. Bonada
E. Gómez
28
14
0
12 Feb 2020
Unsupervised Representation Disentanglement using Cross Domain Features
  and Adversarial Learning in Variational Autoencoder based Voice Conversion
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
Wen-Chin Huang
Hao Luo
Hsin-Te Hwang
Chen-Chou Lo
Yu-Huai Peng
Yu Tsao
Hsin-Min Wang
DRL
17
42
0
22 Jan 2020
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled
  Linguistic and Speaker Representations
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
22
99
0
25 Jun 2019
Investigation of F0 conditioning and Fully Convolutional Networks in
  Variational Autoencoder based Voice Conversion
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion
Wen-Chin Huang
Yi-Chiao Wu
Chen-Chou Lo
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
T. Toda
Yu Tsao
H. Wang
DRL
21
13
0
02 May 2019
Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Jing-Xuan Zhang
Zhenhua Ling
Li-Juan Liu
Yuan Jiang
Lirong Dai
16
129
0
16 Oct 2018
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN
  over Phoneme Posteriorgram Sequences
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
Cheng-chieh Yeh
Po-Chun Hsu
Ju-Chieh Chou
Hung-yi Lee
Lin-Shan Lee
33
23
0
09 Aug 2018
Conditional Image Synthesis With Auxiliary Classifier GANs
Conditional Image Synthesis With Auxiliary Classifier GANs
Augustus Odena
C. Olah
Jonathon Shlens
GAN
250
3,192
0
30 Oct 2016
Real-Time Single Image and Video Super-Resolution Using an Efficient
  Sub-Pixel Convolutional Neural Network
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network
Wenzhe Shi
Jose Caballero
Ferenc Huszár
J. Totz
Andrew P. Aitken
Rob Bishop
Daniel Rueckert
Zehan Wang
SupR
234
5,180
0
16 Sep 2016
1